A mixture model and statistical method is proposed to interpret the distribution of reads from a nascent transcriptional assay, such as global run-on sequencing (GRO-seq) data. The model is annotation agnostic and leverages on current understanding of the behavior of RNA polymerase II. Briefly, it assumes that polymerase loads at key positions (transcription start sites) within the genome. Once loaded, polymerase either remains in the initiation form (with some probability) or transitions into an elongating form (with the remaining probability). The model can be fit genome-wide, allowing patterns of Pol II behavior to be assessed on each distinct transcript. Furthermore, it allows for the first time a principled approach to distinguishing the initiation signal from the elongation signal; in particular, it implies a data driven method for calculating the pausing index, a commonly used metric that informs on the behavior of RNA polymerase II. We demonstrate that this approach improves on existing analyses of GRO-seq data and uncovers a novel biological understanding of the impact of knocking down the Male Specific Lethal (MSL) complex in Drosophilia melanogaster.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1007/s00285-016-1014-4 | DOI Listing |
bioRxiv
January 2025
Mechanisms of Epigenetic Inheritance, Department of Developmental and Stem Cell Biology, Institut Pasteur, UMR 3738, CNRS, Paris 75015, France.
Transcription introduces torsional stress in the DNA fiber causing it to transition from a relaxed to a supercoiled state that can propagate across several kilobases and modulate the binding and activity of DNA-associated proteins. As a result, transcription at one locus has the potential to impact nearby transcription events. In this study, we asked how DNA supercoiling affects histone modifications and transcription of neighboring genes in the multicellular eukaryote .
View Article and Find Full Text PDFIntroduction: GWAS have identified multiple regions that confer risk for juvenile idiopathic arthritis (JIA). However, identifying the single nucleotide polymorphisms (SNPs) that drive disease risk is impeded by the SNPs' that identify risk loci being in linkage disequilibrium (LD) with hundreds of other SNPs. Since the causal SNPs remain unknown, it is difficult to identify target genes and use genetic information to inform patient care.
View Article and Find Full Text PDFTF profiler is a method of inferring transcription factor regulatory activity, i.e. when a TF is present and actively regulating transcription, directly directly from nascent sequencing assays such as PRO-seq and GRO-seq.
View Article and Find Full Text PDFPac Symp Biocomput
January 2024
Department of Computer Science, BioFrontiers Institute, University of Colorado Boulder, 596 UCB, Boulder, CO 80309, USA,
The problem of microdissection of heterogeneous tissue samples is of great interest for both fundamental biology and biomedical research. Until now, microdissection in the form of supervised deconvolution of mixed sequencing samples has been limited to assays measuring gene expression (RNA-seq) or chromatin accessibility (ATAC-seq). We present here the first attempt at solving the supervised deconvolution problem for run-on nascent sequencing data (GRO-seq and PRO-seq), a readout of active transcription.
View Article and Find Full Text PDFBMC Biol
November 2023
BioFrontiers Institute, University of Colorado, Boulder, 80309, USA.
Background: The increase in DNA copy number in Down syndrome (DS; caused by trisomy 21) has led to the DNA dosage hypothesis, which posits that the level of gene expression is proportional to the gene's DNA copy number. Yet many reports have suggested that a proportion of chromosome 21 genes are dosage compensated back towards typical expression levels (1.0×).
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!