Deep learning of the tissue-regulated splicing code.

Bioinformatics

Department of Electrical and Computer Engineering, University of Toronto, Toronto, Ontario M5S 3G4, Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada and Canadian Institute for Advanced Research, Toronto, Ontario M5G 1Z8, CanadaDepartment of Electrical and Computer Engineering, University of Toronto, Toronto, Ontario M5S 3G4, Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada and Canadian Institute for Advanced Research, Toronto, Ontario M5G 1Z8, CanadaDepartment of Electrical and Computer Engineering, University of Toronto, Toronto, Ontario M5S 3G4, Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada and Canadian Institute for Advanced Research, Toronto, Ontario M5G 1Z8, Canada.

Published: June 2014

Motivation: Alternative splicing (AS) is a regulated process that directs the generation of different transcripts from single genes. A computational model that can accurately predict splicing patterns based on genomic features and cellular context is highly desirable, both in understanding this widespread phenomenon, and in exploring the effects of genetic variations on AS.

Methods: Using a deep neural network, we developed a model inferred from mouse RNA-Seq data that can predict splicing patterns in individual tissues and differences in splicing patterns across tissues. Our architecture uses hidden variables that jointly represent features in genomic sequences and tissue types when making predictions. A graphics processing unit was used to greatly reduce the training time of our models with millions of parameters.

Results: We show that the deep architecture surpasses the performance of the previous Bayesian method for predicting AS patterns. With the proper optimization procedure and selection of hyperparameters, we demonstrate that deep architectures can be beneficial, even with a moderately sparse dataset. An analysis of what the model has learned in terms of the genomic features is presented.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4058935PMC
http://dx.doi.org/10.1093/bioinformatics/btu277DOI Listing

Publication Analysis

Top Keywords

splicing patterns
12
predict splicing
8
genomic features
8
splicing
5
deep
4
deep learning
4
learning tissue-regulated
4
tissue-regulated splicing
4
splicing code
4
code motivation
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!