Genome-wide association studies (GWAS) have identified thousands of putative disease causing variants with unknown regulatory effects. Efforts to connect these variants with splicing quantitative trait loci (sQTLs) have provided functional insights, yet sQTLs reported by existing methods cannot explain many GWAS signals. We show current sQTL modeling approaches can be improved by considering alternative splicing representation, model calibration, and covariate integration.
View Article and Find Full Text PDFThe length of 3' untranslated regions (3'UTR) is highly regulated during many transitions in cell state, including T cell activation, through the process of alternative polyadenylation (APA). However, the regulatory mechanisms and functional consequences of APA remain largely unexplored. Here we present a detailed analysis of the temporal and condition-specific regulation of APA following activation of primary human CD4 T cells.
View Article and Find Full Text PDFThe tremendous rate with which data is generated and analysis methods emerge makes it increasingly difficult to keep track of their domain of applicability, assumptions, limitations, and consequently, of the efficacy and precision with which they solve specific tasks. Therefore, there is an increasing need for benchmarks, and for the provision of infrastructure for continuous method evaluation. APAeval is an international community effort, organized by the RNA Society in 2021, to benchmark tools for the identification and quantification of the usage of alternative polyadenylation (APA) sites from short-read, bulk RNA-sequencing (RNA-seq) data.
View Article and Find Full Text PDFThe tremendous rate with which data is generated and analysis methods emerge makes it increasingly difficult to keep track of their domain of applicability, assumptions, and limitations and consequently, of the efficacy and precision with which they solve specific tasks. Therefore, there is an increasing need for benchmarks, and for the provision of infrastructure for continuous method evaluation. APAeval is an international community effort, organized by the RNA Society in 2021, to benchmark tools for the identification and quantification of the usage of alternative polyadenylation (APA) sites from short-read, bulk RNA-sequencing (RNA-seq) data.
View Article and Find Full Text PDFChromatin regulation and alternative splicing are both critical mechanisms guiding gene expression. Studies have demonstrated that histone modifications can influence alternative splicing decisions, but less is known about how alternative splicing may impact chromatin. Here, we demonstrate that several genes encoding histone-modifying enzymes are alternatively spliced downstream of T cell signaling pathways, including HDAC7, a gene previously implicated in controlling gene expression and differentiation in T cells.
View Article and Find Full Text PDFThe ubiquity of RNA-seq has led to many methods that use RNA-seq data to analyze variations in RNA splicing. However, available methods are not well suited for handling heterogeneous and large datasets. Such datasets scale to thousands of samples across dozens of experimental conditions, exhibit increased variability compared to biological replicates, and involve thousands of unannotated splice variants resulting in increased transcriptome complexity.
View Article and Find Full Text PDFAlternative splicing occurs in the vast majority of human genes, giving rise to distinct mRNA and protein isoforms. We, and others, have previously identified hundreds of genes that change their isoform expression upon T cell activation via alternative splicing; however, how these changes link activation input with functional output remains largely unknown. Here, we investigate how costimulation of T cells through the CD28 receptor impacts alternative splicing in T cells activated through the T cell receptor (TCR, CD3) and find that while CD28 signaling alone has minimal impact on splicing, it enhances the extent of change for up to 20% of TCR-induced alternative splicing events.
View Article and Find Full Text PDFWe performed genome-wide association study meta-analysis to identify genetic determinants of skeletal age (SA) deviating in multiple growth disorders. The joint meta-analysis (N = 4557) in two multiethnic cohorts of school-aged children identified one locus, CYP11B1 (expression confined to the adrenal gland), robustly associated with SA (rs6471570-A; β = 0.14; P = 6.
View Article and Find Full Text PDFDelivering a keynote talk at a conference organized by a scientific society or being named as a fellow by such a society indicates that a scientist is held in high regard by their colleagues. To explore if the distribution of such indicators of esteem in the field of bioinformatics reflects the composition of this field, we compared the gender, name origin, and country of affiliation of 412 honorees from the "International Society for Computational Biology" (75 fellows and 337 keynote speakers) with over 170,000 last authorships on computational biology papers between 1993 and 2019. The proportion of honors bestowed on women was similar to that of the field's overall last authorship rate.
View Article and Find Full Text PDFDespite the success and fast adaptation of deep learning models in biomedical domains, their lack of interpretability remains an issue. Here, we introduce Enhanced Integrated Gradients (EIG), a method to identify significant features associated with a specific prediction task. Using RNA splicing prediction as well as digit classification as case studies, we demonstrate that EIG improves upon the original Integrated Gradients method and produces sets of informative features.
View Article and Find Full Text PDFRNA binding proteins (RBPs) frequently regulate the expression of other RBPs in mammalian cells. Such cross-regulation has been proposed to be important to control networks of coordinated gene expression; however, much remains to be understood about how such networks of cross-regulation are established and what the functional consequence is of coordinated or reciprocal expression of RBPs. Here we demonstrate that the RBPs CELF2 and hnRNP C regulate the expression of each other, such that depletion of one results in reduced expression of the other.
View Article and Find Full Text PDFAlternative pre-mRNA splicing has long been proposed to contribute greatly to proteome complexity. However, the extent to which mature mRNA isoforms are successfully translated into protein remains controversial. Here, we used high-throughput RNA sequencing and mass spectrometry (MS)-based proteomics to better evaluate the translation of alternatively spliced mRNAs.
View Article and Find Full Text PDFThe 3' UTR (UTR) of human mRNAs plays a critical role in controlling protein expression and function. Importantly, 3' UTRs of human messages are not invariant for each gene but rather are shaped by alternative polyadenylation (APA) in a cell state-dependent manner, including in response to T cell activation. However, the proteins and mechanisms driving APA regulation remain poorly understood.
View Article and Find Full Text PDFMale germ cells of all placental mammals express an ancient nuclear RNA binding protein of unknown function called RBMXL2. Here we find that deletion of the retrogene encoding RBMXL2 blocks spermatogenesis. Transcriptome analyses of age-matched deletion mice show that RBMXL2 controls splicing patterns during meiosis.
View Article and Find Full Text PDFThree of the eight RNA segments encoded by the influenza A virus (IAV) undergo alternative splicing to generate distinct proteins. Previously, we found that host proteins hnRNP K and NS1-BP regulate IAV M segment splicing, but the mechanistic details were unknown. Here we show NS1-BP and hnRNP K bind M mRNA downstream of the M2 5' splice site (5'ss).
View Article and Find Full Text PDFSummary: Analysis of RNA sequencing (RNA-Seq) data have highlighted the fact that most genes undergo alternative splicing (AS) and that these patterns are tightly regulated. Many of these events are complex, resulting in numerous possible isoforms that quickly become difficult to visualize, interpret and experimentally validate. To address these challenges we developed MAJIQ-SPEL, a web-tool that takes as input local splicing variations (LSVs) quantified from RNA-Seq data and provides users with visualization and quantification of gene isoforms associated with those.
View Article and Find Full Text PDFMotivation: Advancements in sequencing technologies have highlighted the role of alternative splicing (AS) in increasing transcriptome complexity. This role of AS, combined with the relation of aberrant splicing to malignant states, motivated two streams of research, experimental and computational. The first involves a myriad of techniques such as RNA-Seq and CLIP-Seq to identify splicing regulators and their putative targets.
View Article and Find Full Text PDFThe brain is made up of trillions of synaptic connections that together form neural networks needed for normal brain function and behavior. SLM2 is a member of a conserved family of RNA binding proteins, including Sam68 and SLM1, that control splicing of Neurexin1-3 pre-mRNAs. Whether SLM2 affects neural network activity is unknown.
View Article and Find Full Text PDFCELF2 is an RNA binding protein that has been implicated in developmental and signal-dependent splicing in the heart, brain and T cells. In the heart, CELF2 expression decreases during development, while in T cells CELF2 expression increases both during development and in response to antigen-induced signaling events. Although hundreds of CELF2-responsive splicing events have been identified in both heart and T cells, the way in which CELF2 functions has not been broadly investigated.
View Article and Find Full Text PDFAlternative splicing (AS) can critically affect gene function and disease, yet mapping splicing variations remains a challenge. Here, we propose a new approach to define and quantify mRNA splicing in units of local splicing variations (LSVs). LSVs capture previously defined types of alternative splicing as well as more complex transcript variations.
View Article and Find Full Text PDF