Publications

2016

Kim M, Kreig A, Lee C-Y, Rube T, Calvert J, Song JS, Myong S. Quantitative analysis and prediction of G-quadruplex forming sequences in double-stranded DNA. Nucleic Acids Res. 2016;44(10):4807–4817.

G-quadruplex (GQ) is a four-stranded DNA structure that can be formed in guanine-rich sequences. GQ structures have been proposed to regulate diverse biological processes including transcription, replication, translation and telomere maintenance. Recent studies have demonstrated the existence of GQ DNA in live mammalian cells and a significant number of potential GQ forming sequences in the human genome. We present a systematic and quantitative analysis of GQ folding propensity on a large set of 438 GQ forming sequences in double-stranded DNA by integrating fluorescence measurement, single-molecule imaging and computational modeling. We find that short minimum loop length and the thymine base are two main factors that lead to high GQ folding propensity. Linear and Gaussian process regression models further validate that the GQ folding potential can be predicted with high accuracy based on the loop length distribution and the nucleotide content of the loop sequences. Our study provides important new parameters that can inform the evaluation and classification of putative GQ sequences in the human genome.

Bell RJA, Rube T, Xavier-Magalhães A, Costa BM, Mancini A, Song JS, Costello JF. Understanding TERT promoter mutations: A common path to immortality. Mol. Cancer Res. 2016;14(4):315–323.

Telomerase (TERT) activation is a fundamental step in tumorigenesis. By maintaining telomere length, telomerase relieves a main barrier on cellular lifespan, enabling limitless proliferation driven by oncogenes. The recently discovered, highly recurrent mutations in the promoter of TERT are found in over 50 cancer types, and are the most common mutation in many cancers. Transcriptional activation of TERT, via promoter mutation or other mechanisms, is the rate-limiting step in production of active telomerase. Although TERT is expressed in stem cells, it is naturally silenced upon differentiation. Thus, the presence of TERT promoter mutations may shed light on whether a particular tumor arose from a stem cell or more differentiated cell type. It is becoming clear that TERT mutations occur early during cellular transformation, and activate the TERT promoter by recruiting transcription factors that do not normally regulate TERT gene expression. This review highlights the fundamental and widespread role of TERT promoter mutations in tumorigenesis, including recent progress on their mechanism of transcriptional activation. These somatic promoter mutations, along with germline variation in the TERT locus also appear to have significant value as biomarkers of patient outcome. Understanding the precise molecular mechanism of TERT activation by promoter mutation and germline variation may inspire novel cancer cell-specific targeted therapies for a large number of cancer patients.

Rube T, Lee W, Hejna M, Chen H, Yasui DH, Hess JF, LaSalle JM, Song JS, Gong Q. Sequence features accurately predict genome-wide MeCP2 binding in vivo. Nat. Commun. 2016;7(1):11025.

Methyl-CpG binding protein 2 (MeCP2) is critical for proper brain development and expressed at near-histone levels in neurons, but the mechanism of its genomic localization remains poorly understood. Using high-resolution MeCP2-binding data, we show that DNA sequence features alone can predict binding with 88% accuracy. Integrating MeCP2 binding and DNA methylation in a probabilistic graphical model, we demonstrate that previously reported genome-wide association with methylation is in part due to MeCP2's affinity to GC-rich chromatin, a result replicated using published data. Furthermore, MeCP2 co-localizes with nucleosomes. Finally, MeCP2 binding downstream of promoters correlates with increased expression in Mecp2-deficient neurons.

Jin H, Rube T, Song JS. Categorical spectral analysis of periodicity in nucleosomal DNA. Nucleic Acids Res. 2016;44(5):2047–2057.

DNA helical twist imposes geometric constraints on the location of histone-DNA interaction sites along nucleosomal DNA. Certain 10.5-bp periodic nucleotides in phase with these geometric constraints have been suggested to facilitate nucleosome positioning. However, the extent of nucleotide periodicity in nucleosomal DNA and its significance in directing nucleosome positioning still remain unclear. We clarify these issues by applying categorical spectral analysis to high-resolution nucleosome maps in two yeast species. We find that only a small fraction of nucleosomal sequences contain significant 10.5-bp periodicity. We further develop a spectral decomposition method to show that the previously observed periodicity in aligned nucleosomal sequences mainly results from proper phasing among nucleosomal sequences, and not from a preponderant occurrence of periodicity within individual sequences. Importantly, we show that this phasing may arise from the histones' proclivity for putting preferred nucleotides at some of the evenly spaced histone-DNA contact points with respect to the dyad axis. We demonstrate that 10.5-bp periodicity, when present, significantly facilitates rotational, but not translational, nucleosome positioning. Finally, although periodicity only moderately affects nucleosome occupancy genome wide, reduced periodicity is an evolutionarily conserved signature of nucleosome-depleted regions around transcription start/termination sites.

2015

Bell RJA, Rube T, Kreig A, Mancini A, Fouse SD, Nagarajan RP, Choi S, Hong C, He D, Pekmezci M, et al. Cancer. The transcription factor GABP selectively binds and activates the mutant TERT promoter in cancer. Science. 2015;348(6238):1036–1039.

Reactivation of telomerase reverse transcriptase (TERT) expression enables cells to overcome replicative senescence and escape apoptosis, which are fundamental steps in the initiation of human cancer. Multiple cancer types, including up to 83% of glioblastomas (GBMs), harbor highly recurrent TERT promoter mutations of unknown function but specific to two nucleotide positions. We identified the functional consequence of these mutations in GBMs to be recruitment of the multimeric GA-binding protein (GABP) transcription factor specifically to the mutant promoter. Allelic recruitment of GABP is consistently observed across four cancer types, highlighting a shared mechanism underlying TERT reactivation. Tandem flanking native E26 transformation-specific motifs critically cooperate with these mutations to activate TERT, probably by facilitating GABP heterotetramer binding. GABP thus directly links TERT promoter mutations to aberrant expression in multiple cancers.

2014

Rube T, Song JS. Quantifying the role of steric constraints in nucleosome positioning. Nucleic Acids Res. 2014;42(4):2147–2158.

Statistical positioning, the localization of nucleosomes packed against a fixed barrier, is conjectured to explain the array of well-positioned nucleosomes at the 5' end of genes, but the extent and precise implications of statistical positioning in vivo are unclear. We examine this hypothesis quantitatively and generalize the idea to include moving barriers as well as nucleosomes actively packed against a barrier. Early experiments noted a similarity between the nucleosome profile aligned and averaged across genes and that predicted by statistical positioning; however, we demonstrate that aligning random nucleosomes also generates the same profile, calling the previous interpretation into question. New rigorous results reformulate statistical positioning as predictions on the variance structure of nucleosome locations in individual genes. In particular, a quantity termed the variance gradient, describing the change in variance between adjacent nucleosomes, is tested against recent high-throughput nucleosome sequencing data. Constant variance gradients provide support for generalized statistical positioning in ∼ 50% of long genes. Genes that deviate from predictions have high nucleosome turnover and cell-to-cell gene expression variability. The observed variance gradient suggests an effective nucleosome size of 158 bp, instead of the commonly perceived 147 bp. Our analyses thus clarify the role of statistical positioning in vivo.

2011

Kallosh R, Linde A, Olive KA, Rube T. Chaotic inflation and supersymmetry breaking. Physical Review D. 2011;84(8):083519.

We investigate the recently proposed class of chaotic inflation models in supergravity with an arbitrary inflaton potential V(\phi). These models are extended to include matter fields in the visible sector and we employ a mechanism of supersymmetry breaking based on a particular phenomenological version of the KKLT mechanism (the KL model). We describe specific features of reheating in this class of models and show how one can solve the cosmological moduli and gravitino problems in this context.

Rube T, Wacker JG. The simplicity of perfect atoms: degeneracies in supersymmetric hydrogen. Journal of mathematical physics. 2011;52(6):062102.

Supersymmetric QED hydrogen-like bound states are remarkably similar to nonsupersymmetric hydrogen, including an accidental degeneracy of the fine structure and is broken by the Lamb shift. This article classifies the states, calculates the leading order spectrum, and illustrates the results in several limits. The relation to other nonrelativistic bound states is explored.

Kallosh R, Linde A, Rube T. General inflaton potentials in supergravity. Physical Review D. 2011.

We describe a way to construct supergravity models with an arbitrary inflaton potential V(\phi) and show that all other scalar fields in this class of models can be stabilized at the inflationary trajectory by a proper choice of the Kähler potential.

Behbahani SR, Jankowiak M, Rube T, Wacker JG. Nearly supersymmetric dark atoms. Advances in High Energy Physics. 2011;2011.

Theories of dark matter that support bound states are an intriguing possibility for the identity of the missing mass of the Universe. This article proposes a class of models of supersymmetric composite dark matter where the interactions with the Standard Model communicate supersymmetry breaking to the dark sector. In these models, supersymmetry breaking can be treated as a perturbation on the spectrum of bound states. Using a general formalism, the spectrum with leading supersymmetry effects is computed without specifying the details of the binding dynamics. The interactions of the composite states with the Standard Model are computed, and several benchmark models are described. General features of nonrelativistic supersymmetric bound states are emphasized.