A deep recurrent neural network discovers complex biological rules to decipher RNA protein-coding potential.

Nucleic Acids Res

School of Electrical Engineering and Computer Science, Oregon State University, 1148 Kelley Engineering Center, Corvallis, OR 97331, USA.

Published: September 2018

The current deluge of newly identified RNA transcripts presents a singular opportunity for improved assessment of coding potential, a cornerstone of genome annotation, and for machine-driven discovery of biological knowledge. While traditional, feature-based methods for RNA classification are limited by current scientific knowledge, deep learning methods can independently discover complex biological rules in the data de novo. We trained a gated recurrent neural network (RNN) on human messenger RNA (mRNA) and long noncoding RNA (lncRNA) sequences. Our model, mRNA RNN (mRNN), surpasses state-of-the-art methods at predicting protein-coding potential despite being trained with less data and with no prior concept of what features define mRNAs. To understand what mRNN learned, we probed the network and uncovered several context-sensitive codons highly predictive of coding potential. Our results suggest that gated RNNs can learn complex and long-range patterns in full-length human transcripts, making them ideal for performing a wide range of difficult classification tasks and, most importantly, for harvesting new biological insights from the rising flood of sequencing data.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6144860PMC
http://dx.doi.org/10.1093/nar/gky567DOI Listing

Publication Analysis

Top Keywords

recurrent neural
8
neural network
8
complex biological
8
biological rules
8
protein-coding potential
8
coding potential
8
rna
5
deep recurrent
4
network discovers
4
discovers complex
4

Similar Publications

Effects of noise and metabolic cost on cortical task representations.

Elife

January 2025

Computational and Biological Learning Lab, Department of Engineering, University of Cambridge, Cambridge, United Kingdom.

Cognitive flexibility requires both the encoding of task-relevant and the ignoring of task-irrelevant stimuli. While the neural coding of task-relevant stimuli is increasingly well understood, the mechanisms for ignoring task-irrelevant stimuli remain poorly understood. Here, we study how task performance and biological constraints jointly determine the coding of relevant and irrelevant stimuli in neural circuits.

View Article and Find Full Text PDF

Comparative Analysis of Recurrent Neural Networks with Conjoint Fingerprints for Skin Corrosion Prediction.

J Chem Inf Model

January 2025

Division of Pharmaceutical Chemistry, Faculty of Pharmaceutical Sciences, Khon Kaen University, Khon Kaen 40002, Thailand.

Skin corrosion assessment is an essential toxicity end point that addresses safety concerns for topical dosage forms and cosmetic products. Previously, skin corrosion assessments required animal testing; however, differences in skin architecture and ethical concerns regarding animal models have fostered the advancement of alternative methods such as and models. This study aimed to develop deep learning (DL) models based on recurrent neural networks (RNNs) for classifying skin corrosion of chemical compounds based on chemical language notation, molecular substructure, physicochemical properties, and a combination of these three properties called conjoint fingerprints.

View Article and Find Full Text PDF

The claustrum complex is viewed as fundamental for higher-order cognition; however, the circuit organization and function of its neuroanatomical subregions are not well understood. We demonstrated that some of the key roles of the CLA complex can be attributed to the connectivity and function of a small group of neurons in its ventral subregion, the endopiriform (EN). We identified a subpopulation of EN neurons by their projection to the ventral CA1 (EN.

View Article and Find Full Text PDF

Background And Objective: Alglucosidase alfa for injection is used as an enzyme replacement therapy for the treatment of Pompe disease. The safety profile of alglucosidase alfa-associated adverse events requires a comprehensive evaluation. In this study, we aimed to identify drug safety alert signals and investigate the real-world safety of alglucosidase alfa to guide clinical decision making and optimize the risk-benefit balance.

View Article and Find Full Text PDF

Speech Enhancement for Cochlear Implant Recipients using Deep Complex Convolution Transformer with Frequency Transformation.

IEEE/ACM Trans Audio Speech Lang Process

February 2024

CRSS: Center for Robust Speech Systems; Cochlear Implant Processing Laboratory (CILab), Department of Electrical and Computer Engineering, University of Texas at Dallas, USA.

The presence of background noise or competing talkers is one of the main communication challenges for cochlear implant (CI) users in speech understanding in naturalistic spaces. These external factors distort the time-frequency (T-F) content including magnitude spectrum and phase of speech signals. While most existing speech enhancement (SE) solutions focus solely on enhancing the magnitude response, recent research highlights the importance of phase in perceptual speech quality.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!