Transcription factor (TF) binding to genomic DNA elements constitutes one of the key mechanisms that regulates gene expression program in cells. Both consensus and nonconsensus DNA sequence elements influence the recognition specificity of TFs. Based on the analysis of experimentally determined c-Myc binding preferences to genomic DNA, here we statistically predict that certain repetitive, nonconsensus DNA symmetry elements can relatively reduce TF-DNA binding preferences.
View Article and Find Full Text PDFAlthough myriad protein-protein interactions in nature use polyvalent binding, in which multiple ligands on one entity bind to multiple receptors on another, to date an affinity advantage of polyvalent binding has been demonstrated experimentally only in cases where the target receptor molecules are clustered prior to complex formation. Here, we demonstrate cooperativity in binding affinity (i.e.
View Article and Find Full Text PDFIn the process of transcription initiation by RNA polymerase, promoter DNA sequences affect multiple reaction pathways determining the productivity of transcription. However, the question of how the molecular mechanism of transcription initiation depends on the sequence properties of promoter DNA remains poorly understood. Here, combining the statistical mechanical approach with high-throughput sequencing results, we characterize abortive transcription and pausing during transcription initiation by RNA polymerase at a genome-wide level.
View Article and Find Full Text PDFTranscription factor (TF) recognition is dictated by the underlying DNA motif sequence specific for each TF. Here, we reveal that DNA sequence repeat symmetry plays a central role in defining TF-DNA-binding preferences. In particular, we find that different TFs bind similar symmetry patterns in the context of different developmental layers.
View Article and Find Full Text PDFDNA primase synthesizes short RNA primers that initiate DNA synthesis of Okazaki fragments on the lagging strand by DNA polymerase during DNA replication. The binding of prokaryotic DnaG-like primases to DNA occurs at a specific trinucleotide recognition sequence. It is a pivotal step in the formation of Okazaki fragments.
View Article and Find Full Text PDFPrimases are key enzymes involved in DNA replication. They act on single-stranded DNA and catalyze the synthesis of short RNA primers used by DNA polymerases. Here, we investigate the DNA binding and activity of the bacteriophage T7 primase using a new workflow called high-throughput primase profiling (HTPP).
View Article and Find Full Text PDFTranscription of DNA by RNA polymerase (RNAP) takes place in a cell environment dominated by thermal fluctuations. How are transcription reactions including initiation, elongation, and termination on genomic DNA so well-controlled during such fluctuations? A recent statistical mechanical approach using high-throughput sequencing data reveals that repetitive DNA sequence elements embedded into a genomic sequence provide the key mechanism to functionally bias the fluctuations of transcription elongation complexes. In particular, during elongation pausing, such repetitive sequence elements can increase the magnitude of one-dimensional diffusion of the RNAP enzyme on the DNA upstream of the pausing site, generating a large variation in the dwell times of RNAP pausing under the control of these genomic signals.
View Article and Find Full Text PDFThe notion that transcription factors bind DNA only through specific, consensus binding sites has been recently questioned. No specific consensus motif for the positioning of the human preinitiation complex (PIC) has been identified. Here, we reveal that nonconsensus, statistical, DNA triplet code provides specificity for the positioning of the human PIC.
View Article and Find Full Text PDFIn the process of transcription elongation, RNA polymerase (RNAP) pauses at highly nonrandom positions across genomic DNA, broadly regulating transcription; however, molecular mechanisms responsible for the recognition of such pausing positions remain poorly understood. Here, using a combination of statistical mechanical modeling and high-throughput sequencing and biochemical data, we evaluate the effect of thermal fluctuations on the regulation of RNAP pausing. We demonstrate that diffusive backtracking of RNAP, which is biased by repetitive DNA sequence elements, causes transcriptional pausing.
View Article and Find Full Text PDFRecent genome-wide experiments in different eukaryotic genomes provide an unprecedented view of transcription factor (TF) binding locations and of nucleosome occupancy. These experiments revealed that a large fraction of TF binding events occur in regions where only a small number of specific TF binding sites (TFBSs) have been detected. Furthermore, in vitro protein-DNA binding measurements performed for hundreds of TFs indicate that TFs are bound with wide range of affinities to different DNA sequences that lack known consensus motifs.
View Article and Find Full Text PDFProc Natl Acad Sci U S A
December 2014
Until now, it has been reasonably assumed that specific base-pair recognition is the only mechanism controlling the specificity of transcription factor (TF)-DNA binding. Contrary to this assumption, here we show that nonspecific DNA sequences possessing certain repeat symmetries, when present outside of specific TF binding sites (TFBSs), statistically control TF-DNA binding preferences. We used high-throughput protein-DNA binding assays to measure the binding levels and free energies of binding for several human TFs to tens of thousands of short DNA sequences with varying repeat symmetries.
View Article and Find Full Text PDFRecent experiments provide an unprecedented view of protein-DNA binding in yeast and human genomes at single-nucleotide resolution. These measurements, performed over large cell populations, show quite generally that sequence-specific transcription regulators with well-defined protein-DNA consensus motifs bind only a fraction among all consensus motifs present in the genome. Alternatively, proteins in vivo often bind DNA regions lacking known consensus sequences.
View Article and Find Full Text PDFGenome-wide binding preferences of the key components of eukaryotic preinitiation complex (PIC) have been recently measured at high resolution in Saccharomyces cerevisiae by Rhee and Pugh. However, the rules determining the PIC binding specificity remain poorly understood. In this study, we show that nonconsensus protein-DNA binding significantly influences PIC binding preferences.
View Article and Find Full Text PDFRecent genome-wide measurements of binding preferences of ~200 transcription regulators in the vicinity of transcription start sites in yeast, have provided a unique insight into the cis-regulatory code of a eukaryotic genome. Here, we show that nonspecific transcription factor (TF)-DNA binding significantly influences binding preferences of the majority of transcription regulators in promoter regions of the yeast genome. We show that promoters of SAGA-dominated and TFIID-dominated genes can be statistically distinguished based on the landscape of nonspecific protein-DNA binding free energy.
View Article and Find Full Text PDFQuantitative understanding of the principles regulating nucleosome occupancy on a genome-wide level is a central issue in eukaryotic genomics. Here, we address this question using budding yeast, Saccharomyces cerevisiae, as a model organism. We perform a genome-wide computational analysis of the nonspecific transcription factor (TF)-DNA binding free-energy landscape and compare this landscape with experimentally determined nucleosome-binding preferences.
View Article and Find Full Text PDFWe predict analytically that diagonal correlations of amino acid positions within protein sequences statistically enhance protein propensity for nonspecific binding. We use the term "promiscuity" to describe such nonspecific binding. Diagonal correlations represent statistically significant repeats of sequence patterns where amino acids of the same type are clustered together.
View Article and Find Full Text PDFTranscription factors (TFs) are regulatory proteins that bind DNA in promoter regions of the genome and either promote or repress gene expression. Here, we predict analytically that enhanced homooligonucleotide sequence correlations, such as poly(dA:dT) and poly(dC:dG) tracts, statistically enhance nonspecific TF-DNA binding affinity. This prediction is generic and qualitatively independent of microscopic parameters of the model.
View Article and Find Full Text PDFNumerous experiments demonstrate a high level of promiscuity and structural disorder in organismal proteomes. Here, we ask the question what makes a protein promiscuous, that is, prone to nonspecific interactions, and structurally disordered. We predict that multi-scale correlations of amino acid positions within protein sequences statistically enhance the propensity for promiscuous intra- and inter-protein binding.
View Article and Find Full Text PDF