Learning the sequence determinants of alternative splicing from millions of random sequences.

Cell

Department of Electrical Engineering, University of Washington, Seattle, WA 98195, USA; Department of Computer Science and Engineering, University of Washington, Seattle, WA 98195, USA. Electronic address:

Published: October 2015

Most human transcripts are alternatively spliced, and many disease-causing mutations affect RNA splicing. Toward better modeling the sequence determinants of alternative splicing, we measured the splicing patterns of over two million (M) synthetic mini-genes, which include degenerate subsequences totaling over 100 M bases of variation. The massive size of these training data allowed us to improve upon current models of splicing, as well as to gain new mechanistic insights. Our results show that the vast majority of hexamer sequence motifs measurably influence splice site selection when positioned within alternative exons, with multiple motifs acting additively rather than cooperatively. Intriguingly, motifs that enhance (suppress) exon inclusion in alternative 5' splicing also enhance (suppress) exon inclusion in alternative 3' or cassette exon splicing, suggesting a universal mechanism for alternative exon recognition. Finally, our empirically trained models are highly predictive of the effects of naturally occurring variants on alternative splicing in vivo.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.cell.2015.09.054DOI Listing

Publication Analysis

Top Keywords

alternative splicing
16
sequence determinants
8
determinants alternative
8
splicing
8
enhance suppress
8
suppress exon
8
exon inclusion
8
inclusion alternative
8
alternative
7
learning sequence
4

Similar Publications

The duck industry is vital for supplying high-quality protein, making research into the development of duck skeletal muscle critical for improving meat and egg production. In this study, we leveraged Oxford Nanopore Technologies (ONT) sequencing to perform full-length transcriptome sequencing of myoblasts harvested from the leg muscles of duck embryos at embryonic day 13 (E13), specifically examining both the proliferative (GM) and differentiation (DM) phases. Our analysis identified a total of 5797 novel transcripts along with 2332 long non-coding RNAs (lncRNAs), revealing substantial changes in gene expression linked to muscle development.

View Article and Find Full Text PDF

RNA sequencing (RNA-seq) is widely adopted for transcriptome analysis but has inherent biases that hinder the comprehensive detection and quantification of alternative splicing. To address this, we present an efficient targeted RNA-seq method that greatly enriches for splicing-informative junction-spanning reads. Local splicing variation sequencing (LSV-seq) utilizes multiplexed reverse transcription from highly scalable pools of primers anchored near splicing events of interest.

View Article and Find Full Text PDF

Background: It has been demonstrated that nintedanib can inhibit the proliferation of gastric cancer cells, but the specific mechanism of action is unclear.

Objective: Investigating the changes of key factors involved in gene transcription and post-transcriptional regulation during the process of treating gastric cancer with nintedanib.

Methods: In this study, we performed transcriptome sequencing on gastric cancer cell groups treated with nintedanib and control groups.

View Article and Find Full Text PDF

Dorper sheep is popular among farming enterprises with strong adaptability, disease resistance, and roughage tolerance, and an unique characteristic of natural shedding of wool. In a large number of observations on experimental sheep farms, it was found that the wool of some sheep still had not shed after May, thus manual shearing was required. Therefore, understanding the molecular mechanisms of normal hair follicles (HFs) development is crucial to revealing the improvement of sheep wool-related traits and mammalian skin-related traits.

View Article and Find Full Text PDF

Hereditary spastic paraplegias (HSP) are a diverse group of neurodegenerative diseases characterized by lower limb spasticity and weakness. To date, over 80 genes have been associated with HSP, but many families remain without a molecular diagnosis. In this study, linkage analysis and whole-exome sequencing (WES) were performed to identify the causal gene in a HSP family with autosomal recessive inheritance.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!