A computational approach to identify genes for functional RNAs in genomic sequences.

Nucleic Acids Res

Computational and Theoretical Biology Department, Physical Biosciences Division, National Energy Research Scientific Computing Center, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA.

Published: October 2001

AI Article Synopsis

  • A new machine learning approach using neural networks and support vector machines has been developed to identify genes encoding novel functional RNAs in unannotated regions of prokaryotic and archaeal genomes.
  • The method, initially tested on the Escherichia coli genome, showed high accuracy (80-90% for bacteria and 90-99% for hyperthermophilic archaea) using nucleotide composition and further improved when combined with RNA sequence motifs and free energy calculations.
  • The research has successfully identified known fRNAs not included in training datasets and predicted hundreds of new RNAs, suggesting many unidentified RNAs exist in simple genomes, with accessible online resources for user predictions.

Article Abstract

Currently there is no successful computational approach for identification of genes encoding novel functional RNAs (fRNAs) in genomic sequences. We have developed a machine learning approach using neural networks and support vector machines to extract common features among known RNAs for prediction of new RNA genes in the unannotated regions of prokaryotic and archaeal genomes. The Escherichia coli genome was used for development, but we have applied this method to several other bacterial and archaeal genomes. Networks based on nucleotide composition were 80-90% accurate in jackknife testing experiments for bacteria and 90-99% for hyperthermophilic archaea. We also achieved a significant improvement in accuracy by combining these predictions with those obtained using a second set of parameters consisting of known RNA sequence motifs and the calculated free energy of folding. Several known fRNAs not included in the training datasets were identified as well as several hundred predicted novel RNAs. These studies indicate that there are many unidentified RNAs in simple genomes that can be predicted computationally as a precursor to experimental study. Public access to our RNA gene predictions and an interface for user predictions is available via the web.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC60242PMC
http://dx.doi.org/10.1093/nar/29.19.3928DOI Listing

Publication Analysis

Top Keywords

computational approach
8
functional rnas
8
genomic sequences
8
archaeal genomes
8
rnas
5
approach identify
4
identify genes
4
genes functional
4
rnas genomic
4
sequences currently
4

Similar Publications

While the early mental health of girls and boys is similar, as children age, girls tend to report worse mental health than boys. Explanations for these gendered disparities remain elusive. This study seeks to understand the social context in which mental health experiences are shaped.

View Article and Find Full Text PDF

The accurate non-invasive detection and estimation of central aortic pressure waveforms (CAPW) are crucial for reliable treatments of cardiovascular system diseases. But the accuracy and practicality of current estimation methods need to be improved. Our study combines a meta-learning neural network and a physics-driven method to accurately estimate CAPW based on personalized physiological indicators.

View Article and Find Full Text PDF

Aim: Dynamic cancer control is a current health system priority, yet methods for achieving it are lacking. This study aims to review the application of system dynamics modeling (SDM) on cancer control and evaluate the research quality.

Methods: Articles were searched in PubMed, Web of Science, and Scopus from the inception of the study to November 15th, 2023.

View Article and Find Full Text PDF

Comprehensive Non-invasive Versus Invasive Approach to Evaluate Cardiac Allograft Vasculopathy in Heart Transplantation: The CCTA-HTx Study.

Circ Cardiovasc Imaging

January 2025

Cardiovascular Center Aalst, Onze-Lieve-Vrouwziekenhuis (OLV) Clinic, Aalst, Belgium (M. Belmonte, P.P., M.M.V., M. Beles, H.O., R.S., G.E., M.S., R.D., W.H., J.V.K., J.B., M.V.).

Background: Coronary computed tomography angiography (CCTA) is emerging as a valuable tool for noninvasive surveillance of cardiac allograft vasculopathy (CAV) in patients with heart transplant (HTx). We assessed the diagnostic performance of a comprehensive CCTA-based approach compared with the invasive reference, which includes invasive coronary angiography, intravascular ultrasound, and fractional flow reserve, for detecting CAV.

Methods: This was a multicenter prospective study including 37 patients with HTx who underwent CCTA, invasive coronary angiography, intravascular ultrasound, and fractional flow reserve.

View Article and Find Full Text PDF

This study investigates the structural and dynamic properties of ternary mixtures composed of NaPF, ethylene carbonate (EC), and the ionic liquid choline glycine (ChGly), with a focus on their potential as electrolytes for supercapacitors. The combination of NaPF-EC, known for its high ionic conductivity, with the biodegradable and environmentally friendly ChGly offers a promising approach to enhancing electrolyte performance. Through molecular simulations, we analyze how the inclusion of small concentrations of ChGly affects key properties such as density, cohesive energy, and ion mobility.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!