Messenger RNA precursors (pre-mRNA) generally undergo 3' end processing by cleavage and polyadenylation (CPA), which is specified by a polyadenylation site (PAS) and adjacent RNA sequences and regulated by a large variety of core and auxiliary CPA factors. To date, most of the human CPA factors have been discovered through biochemical and proteomic studies. However, genetic identification of the human CPA factors has been hampered by the lack of a reliable genome-wide screening method.
View Article and Find Full Text PDFAlternative polyadenylation (APA) enhances gene regulatory potential by increasing the diversity of mRNA transcripts. 3' UTR shortening through APA correlates with enhanced cellular proliferation and is a widespread phenomenon in tumor cells. Here, we show that the ubiquitously expressed transcription factor Sp1 binds RNA in vivo and is a common repressor of distal poly(A) site usage.
View Article and Find Full Text PDFPrevious transcriptomic profiling studies have typically focused on separately analyzing mRNA expression, alternative splicing and alternative polyadenylation differences between cell and tissue types. However, the relative contribution of these three transcriptomic regulatory layers to cell type specification is poorly understood. This question is particularly relevant to neurons, given their extensive heterogeneity associated with brain location, morphology and function.
View Article and Find Full Text PDFSystematic mapping of genetic interactions (GIs) and interrogation of the functions of sizable genomic segments in mammalian cells represent important goals of biomedical research. To advance these goals, we present a CRISPR (clustered regularly interspaced short palindromic repeats)-based screening system for combinatorial genetic manipulation that employs coexpression of CRISPR-associated nucleases 9 and 12a (Cas9 and Cas12a) and machine-learning-optimized libraries of hybrid Cas9-Cas12a guide RNAs. This system, named Cas Hybrid for Multiplexed Editing and screening Applications (CHyMErA), outperforms genetic screens using Cas9 or Cas12a editing alone.
View Article and Find Full Text PDFRegulation of translation during human development is poorly understood, and its dysregulation is associated with Rett syndrome (RTT). To discover shifts in mRNA ribosomal engagement (RE) during human neurodevelopment, we use parallel translating ribosome affinity purification sequencing (TRAP-seq) and RNA sequencing (RNA-seq) on control and RTT human induced pluripotent stem cells, neural progenitor cells, and cortical neurons. We find that 30% of transcribed genes are translationally regulated, including key gene sets (neurodevelopment, transcription and translation factors, and glycolysis).
View Article and Find Full Text PDFAlternative splicing (AS) is a widespread process underlying the generation of transcriptomic and proteomic diversity and is frequently misregulated in human disease. Accordingly, an important goal of biomedical research is the development of tools capable of comprehensively, accurately, and efficiently profiling AS. Here, we describe Whippet, an easy-to-use RNA-seq analysis method that rapidly-with hardware requirements compatible with a laptop-models and quantifies AS events of any complexity without loss of accuracy.
View Article and Find Full Text PDFAlternative polyadenylation (APA) affects most mammalian genes. The genome-wide investigation of APA has been hampered by an inability to reliably profile it using conventional RNA-seq. We describe 'Quantification of APA' (QAPA), a method that infers APA from conventional RNA-seq data.
View Article and Find Full Text PDFAlternative splicing (AS) generates remarkable regulatory and proteomic complexity in metazoans. However, the functions of most AS events are not known, and programs of regulated splicing remain to be identified. To address these challenges, we describe the Vertebrate Alternative Splicing and Transcription Database (VastDB), the largest resource of genome-wide, quantitative profiles of AS events assembled to date.
View Article and Find Full Text PDFGlobal transcriptomic imbalance is a ubiquitous feature associated with cancer, including hepatocellular carcinoma (HCC). Analyses of 1,225 clinical HCC samples revealed that a large numbers of RNA binding proteins (RBPs) are dysregulated and that RBP dysregulation is associated with poor prognosis. We further identified that oncogenic activation of a top candidate RBP, negative elongation factor E (NELFE), via somatic copy-number alterations enhanced MYC signaling and promoted HCC progression.
View Article and Find Full Text PDFRNA-binding proteins recognize RNA sequences and structures, but there is currently no systematic and accurate method to derive large (>12base) motifs de novo that reflect a combination of intrinsic preference to both sequence and structure. To address this absence, we introduce RNAcompete-S, which couples a single-step competitive binding reaction with an excess of random RNA 40-mers to a custom computational pipeline for interrogation of the bound RNA sequences and derivation of SSMs (Sequence and Structure Models). RNAcompete-S confirms that HuR, QKI, and SRSF1 prefer binding sites that are single stranded, and recapitulates known 8-10bp sequence and structure preferences for Vts1p and RBMY.
View Article and Find Full Text PDFNetworks of coordinated alternative splicing (AS) events play critical roles in development and disease. However, a comprehensive knowledge of the factors that regulate these networks is lacking. We describe a high-throughput system for systematically linking trans-acting factors to endogenous RNA regulatory events.
View Article and Find Full Text PDFRNA-binding proteins (RBPs) participate in diverse cellular processes and have important roles in human development and disease. The human genome, and that of many other eukaryotes, encodes hundreds of RBPs that contain canonical sequence-specific RNA-binding domains (RBDs) as well as numerous other unconventional RNA binding proteins (ucRBPs). ucRBPs physically associate with RNA but lack common RBDs.
View Article and Find Full Text PDFA progressive increase in MECP2 protein levels is a crucial and precisely regulated event during neurodevelopment, but the underlying mechanism is unclear. We report that MECP2 is regulated post-transcriptionally during in vitro differentiation of human embryonic stem cells (hESCs) into cortical neurons. Using reporters to identify functional RNA sequences in the MECP2 3' UTR and genetic manipulations to explore the role of interacting factors on endogenous MECP2, we discover combinatorial mechanisms that regulate RNA stability and translation.
View Article and Find Full Text PDFIsolated cytochrome c oxidase (COX) deficiency is a common cause of mitochondrial disease, yet its genetic basis remains unresolved in many patients. Here, we identified novel compound heterozygous mutations in SCO1 (p.M294V, p.
View Article and Find Full Text PDFBackground: Congenital nephrotic syndrome arises from a defect in the glomerular filtration barrier that permits the unrestricted passage of protein across the barrier, resulting in proteinuria, hypoalbuminaemia, and severe oedema. While most cases are due to mutations in one of five genes, in up to 15% of cases, a genetic cause is not identified. We investigated two sisters with a presumed recessive form of congenital nephrotic syndrome.
View Article and Find Full Text PDFBackground: Gene fusions arising from chromosomal translocations have been implicated in cancer. However, the role of gene fusions in BRCA1-related breast cancers is not well understood. Mutations in BRCA1 are associated with an increased risk for breast cancer (up to 80% lifetime risk) and ovarian cancer (up to 50%).
View Article and Find Full Text PDFBackground: Combined Malonic and Methylmalonic Aciduria (CMAMMA) is a rare recessive inborn error of metabolism characterised by elevations of urine malonic acid (MA) and methylmalonic acid (MMA). Nearly all reported cases are caused by malonyl-CoA decarboxylase (MCD) deficiency. Most patients have metabolic acidosis, developmental delay, seizures and cardiomyopathy.
View Article and Find Full Text PDFExpression levels of many human genes are under the genetic control of expression quantitative trait loci (eQTLs). Despite technological advances, the precise molecular mechanisms underlying most eQTLs remain elusive. Here, we use deep mRNA sequencing of two CEU individuals to investigate those mechanisms, with particular focus on the role of splicing control loci (sQTLs).
View Article and Find Full Text PDFVan Den Ende-Gupta syndrome (VDEGS) is an extremely rare autosomal-recessive disorder characterized by distinctive craniofacial features, which include blepharophimosis, malar and/or maxillary hypoplasia, a narrow and beaked nose, and an everted lower lip. Other features are arachnodactyly, camptodactyly, peculiar skeletal abnormalities, and normal development and intelligence. We present molecular data on four VDEGS patients from three consanguineous Qatari families belonging to the same highly inbred Bedouin tribe.
View Article and Find Full Text PDFProtein coding genes constitute approximately 1% of the human genome but harbor 85% of the mutations with large effects on disease-related traits. Therefore, efficient strategies for selectively sequencing complete coding regions (i.e.
View Article and Find Full Text PDFGenomic copy number variation (CNV) is a recently identified form of global genetic variation in the human genome. The Affymetrix GeneChip 100 and 500 K SNP genotyping platforms were used to perform a large-scale population-based study of CNV frequency. We constructed a genomic map of 578 CNV regions, covering approximately 220 Mb (7.
View Article and Find Full Text PDFDue to the low complexity associated with their sequences, uncovering the evolutionary and functional relationships in highly repetitive proteins such as elastin, spider silks, resilin and abductin represents a significant challenge. Using the polymeric extracellular protein elastin as a model system, we present a novel computational approach to the study of sequence, function and evolutionary relationships in repetitive proteins. To address the absence of accurate sequence annotation for repetitive proteins such as elastin, we have constructed a new database repository, ElastoDB (http://theileria.
View Article and Find Full Text PDF