Transcription factors (TFs) control specificity and activity of gene transcription, but whether a relationship between these two features exists is unclear. Here we provide evidence for an evolutionary trade-off between the activity and specificity in human TFs encoded as submaximal dispersion of aromatic residues in their intrinsically disordered protein regions. We identified approximately 500 human TFs that encode short periodic blocks of aromatic residues in their intrinsically disordered regions, resembling imperfect prion-like sequences.
View Article and Find Full Text PDFWhile the link between exposure to high levels of ambient particulate matter (PM) and increased incidences of respiratory and cardiovascular diseases is widely recognized, recent epidemiological studies have shown that low PM concentrations are equally associated with adverse health effects. As DNA methylation is one of the main mechanisms by which cells regulate and stabilize gene expression, changes in the methylome could constitute early indicators of dysregulated signaling pathways. So far, little is known about PM-associated DNA methylation changes in the upper airways, the first point of contact between airborne pollutants and the human body.
View Article and Find Full Text PDFBackground: Childhood asthma is a result of a complex interaction of genetic and environmental components causing epigenetic and immune dysregulation, airway inflammation and impaired lung function. Although different microarray based EWAS studies have been conducted, the impact of epigenetic regulation in asthma development is still widely unknown. We have therefore applied unbiased whole genome bisulfite sequencing (WGBS) to characterize global DNA-methylation profiles of asthmatic children compared to healthy controls.
View Article and Find Full Text PDFThe combination of a cell's transcriptional profile and location defines its function in a spatial context. Spatially resolved transcriptomics (SRT) has emerged as the assay of choice for characterizing cells . SRT methods can resolve gene expression up to single-molecule resolution.
View Article and Find Full Text PDFBRG1 and BRM are ATPase core subunits of the human SWI/SNF chromatin remodelling complexes mainly associated with transcriptional initiation. They also have a role in alternative splicing, which has been shown for BRM-containing SWI/SNF complexes at a few genes. Here, we have identified a subset of genes which harbour alternative exons that are affected by SWI/SNF ATPases by expressing the ATPases BRG1 and BRM in C33A cells, a BRG1- and BRM-deficient cell line, and analysed the effect on splicing by RNA sequencing.
View Article and Find Full Text PDFSingle-cell RNA sequencing studies on gene co-expression patterns could yield important regulatory and functional insights, but have so far been limited by the confounding effects of differentiation and cell cycle. We apply a tailored experimental design that eliminates these confounders, and report thousands of intrinsically covarying gene pairs in mouse embryonic stem cells. These covariations form a network with biological properties, outlining known and novel gene interactions.
View Article and Find Full Text PDFThe Mediator complex directs signals from DNA-binding transcription factors to RNA polymerase II (Pol II). Despite this pivotal position, mechanistic understanding of Mediator in human cells remains incomplete. Here we quantified Mediator-controlled Pol II kinetics by coupling rapid subunit degradation with orthogonal experimental readouts.
View Article and Find Full Text PDFExpansions of amino acid repeats occur in >20 inherited human disorders, and many occur in intrinsically disordered regions (IDRs) of transcription factors (TFs). Such diseases are associated with protein aggregation, but the contribution of aggregates to pathology has been controversial. Here, we report that alanine repeat expansions in the HOXD13 TF, which cause hereditary synpolydactyly in humans, alter its phase separation capacity and its capacity to co-condense with transcriptional co-activators.
View Article and Find Full Text PDFA correction has been published and is appended to both the HTML and PDF versions of this paper. The error has not been fixed in the paper.
View Article and Find Full Text PDFProfilin is vital for actin organisation in eukaryotic cells. It controls actin filament formation by binding monomeric actin and numerous proteins involved in polarised actin assembly. Important for the latter is the interaction surface formed by the N- and C-terminal helices, which pack close to each other on one side of the molecule at a distance from the actin site and mediate binding to poly-proline sequences present in many of the targeted proteins.
View Article and Find Full Text PDFBackground: There is increasing evidence that transcripts or transcript regions annotated as non-coding can harbor functional short open reading frames (sORFs). Loss-of-function experiments have identified essential developmental or physiological roles for a few of the encoded peptides (micropeptides), but genome-wide experimental or computational identification of functional sORFs remains challenging.
Results: Here, we expand our previously developed method and present results of an integrated computational pipeline for the identification of conserved sORFs in human, mouse, zebrafish, fruit fly, and the nematode C.
The ability of animals to sense and respond to elevated temperature is essential for survival. Transcriptional control of the heat stress response has been much studied, whereas its posttranscriptional regulation by microRNAs (miRNAs) is not well understood. Here we analyzed the miRNA response to heat stress in Caenorhabditis elegans and show that a discrete subset of miRNAs is thermoregulated.
View Article and Find Full Text PDFIdentification of the coding elements in the genome is a fundamental step to understanding the building blocks of living systems. Short peptides (< 100 aa) have emerged as important regulators of development and physiology, but their identification has been limited by their size. We have leveraged the periodicity of ribosome movement on the mRNA to define actively translated ORFs by ribosome footprinting.
View Article and Find Full Text PDFCircular RNAs (circRNAs) in animals are an enigmatic class of RNA with unknown function. To explore circRNAs systematically, we sequenced and computationally analysed human, mouse and nematode RNA. We detected thousands of well-expressed, stable circRNAs, often showing tissue/developmental-stage-specific expression.
View Article and Find Full Text PDFMicroRNAs (miRNAs) constitute an important class of small regulatory RNAs that are derived from distinct hairpin precursors (pre-miRNAs). In contrast to mature miRNAs, which have been characterized in numerous genome-wide studies of different organisms, research on global profiling of pre-miRNAs is limited. Here, using massive parallel sequencing, we have performed global characterization of both mouse mature and precursor miRNAs.
View Article and Find Full Text PDFCurr Protoc Bioinformatics
December 2011
miRNAs comprise an abundant class of small non-coding RNAs that play important roles in a wide range of biological processes by post-transcriptional regulation of a large fraction of animal genes. High-throughput sequencing machines and the availability of completely sequenced genomes make it possible to reliably identify miRNAs with computational methods. This unit documents how to use the miRDeep2 software package to identify novel and known microRNAs in small RNA deep-sequencing data.
View Article and Find Full Text PDFIn animals, RNA binding proteins (RBPs) and microRNAs (miRNAs) post-transcriptionally regulate the expression of virtually all genes by binding to RNA. Recent advances in experimental and computational methods facilitate transcriptome-wide mapping of these interactions. It is thought that the combinatorial action of RBPs and miRNAs on target mRNAs form a post-transcriptional regulatory code.
View Article and Find Full Text PDFmicroRNAs (miRNAs) are a large class of small non-coding RNAs which post-transcriptionally regulate the expression of a large fraction of all animal genes and are important in a wide range of biological processes. Recent advances in high-throughput sequencing allow miRNA detection at unprecedented sensitivity, but the computational task of accurately identifying the miRNAs in the background of sequenced RNAs remains challenging. For this purpose, we have designed miRDeep2, a substantially improved algorithm which identifies canonical and non-canonical miRNAs such as those derived from transposable elements and informs on high-confidence candidates that are detected in multiple independent samples.
View Article and Find Full Text PDFFreshwater planaria are a very attractive model system for stem cell biology, tissue homeostasis, and regeneration. The genome of the planarian Schmidtea mediterranea has recently been sequenced and is estimated to contain >20,000 protein-encoding genes. However, the characterization of its transcriptome is far from complete.
View Article and Find Full Text PDFWe systematically generated large-scale data sets to improve genome annotation for the nematode Caenorhabditis elegans, a key model organism. These data sets include transcriptome profiling across a developmental time course, genome-wide identification of transcription factor-binding sites, and maps of chromatin organization. From this, we created more complete and accurate gene models, including alternative splice forms and candidate noncoding RNAs.
View Article and Find Full Text PDFThree-prime untranslated regions (3'UTRs) of metazoan messenger RNAs (mRNAs) contain numerous regulatory elements, yet remain largely uncharacterized. Using polyA capture, 3' rapid amplification of complementary DNA (cDNA) ends, full-length cDNAs, and RNA-seq, we defined approximately 26,000 distinct 3'UTRs in Caenorhabditis elegans for approximately 85% of the 18,328 experimentally supported protein-coding genes and revised approximately 40% of gene models. Alternative 3'UTR isoforms are frequent, often differentially expressed during development.
View Article and Find Full Text PDFBackground: Although baker's yeast is a primary model organism for research on eukaryotic ribosome assembly and nucleoli, the list of its proteins that are functionally associated with nucleoli or ribosomes is still incomplete. We trained a naïve Bayesian classifier to predict novel proteins that are associated with yeast nucleoli or ribosomes based on parts lists of nucleoli in model organisms and large-scale protein interaction data sets. Phylogenetic profiling and gene expression analysis were carried out to shed light on evolutionary and regulatory aspects of nucleoli and ribosome assembly.
View Article and Find Full Text PDF