Identification of non-coding RNAs with a new composite feature in the Hybrid Random Forest Ensemble algorithm.

Nucleic Acids Res

Biotechnology Program, School of Bioresources and Technology, King Mongkut's University of Technology Thonburi (Bang Khun Thian Campus), 49 Soi Thian Thale 25, Bang Khun Thian Chai Thale Rd, Tha Kham, Bangkok 10150, Thailand Bioinformatics and Systems Biology Program, King Mongkut's University of Technology Thonburi (Bang Khun Thian Campus), 49 Soi Thian Thale 25, Bang Khun Thian Chai Thale Rd, Tha Kham, Bangkok 10150, Thailand

Published: June 2014

To identify non-coding RNA (ncRNA) signals within genomic regions, a classification tool was developed based on a hybrid random forest (RF) with a logistic regression model to efficiently discriminate short ncRNA sequences as well as long complex ncRNA sequences. This RF-based classifier was trained on a well-balanced dataset with a discriminative set of features and achieved an accuracy, sensitivity and specificity of 92.11%, 90.7% and 93.5%, respectively. The selected feature set includes a new proposed feature, SCORE. This feature is generated based on a logistic regression function that combines five significant features-structure, sequence, modularity, structural robustness and coding potential-to enable improved characterization of long ncRNA (lncRNA) elements. The use of SCORE improved the performance of the RF-based classifier in the identification of Rfam lncRNA families. A genome-wide ncRNA classification framework was applied to a wide variety of organisms, with an emphasis on those of economic, social, public health, environmental and agricultural significance, such as various bacteria genomes, the Arthrospira (Spirulina) genome, and rice and human genomic regions. Our framework was able to identify known ncRNAs with sensitivities of greater than 90% and 77.7% for prokaryotic and eukaryotic sequences, respectively. Our classifier is available at http://ncrna-pred.com/HLRF.htm.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4066759PMC
http://dx.doi.org/10.1093/nar/gku325DOI Listing

Publication Analysis

Top Keywords

hybrid random
8
random forest
8
genomic regions
8
logistic regression
8
ncrna sequences
8
rf-based classifier
8
ncrna
5
identification non-coding
4
non-coding rnas
4
rnas composite
4

Similar Publications

Cardiac rehabilitation (CR) is a cornerstone of heart disease (HD) management, enhancing functional capacity and quality of life. Hybrid cardiac rehabilitation (hCR), combining supervised center-based sessions with synchronous, real-time telerehabilitation at home, offers an alternative to conventional CR to overcome logistical barriers such as facility limitations, distance, and pandemic-related disruptions. This randomized controlled trial evaluated the noninferiority of hCR compared to standard CR in improving functional capacity in patients with chronic heart disease, including those with stable coronary artery disease.

View Article and Find Full Text PDF

The two main extensions of rain forest in South America are the Amazon (Amazônia) and the Atlantic rain forest (Mata Atlântica), which are separated by a wide 'dry diagonal' of seasonal vegetation. We used the species-rich tree genus to test if Amazônia-Mata Atlântica dispersals have been clustered during specific time periods corresponding to past, humid climates. We performed hybrid capture DNA sequencing of 810 nuclear loci for 453 accessions representing 164 species that included 62% of Mata Atlântica species and estimated a dated phylogeny for all accessions using maximum likelihood, and a species-level tree using coalescent methods.

View Article and Find Full Text PDF

Prospective Validation of an Automated Hybrid Multidimensional MRI Tool for Prostate Cancer Detection Using Targeted Biopsy: Comparison with PI-RADS-based Assessment.

Radiol Imaging Cancer

January 2025

From the Department of Radiology (A.C., A.N.Y., R.E., C.H., G.L., M.M., E.B.J., A.L.C., B.G., G.S.K., A.O.), Sanford J. Grossman Center of Excellence in Prostate Imaging and Image Guided Therapy (A.C., A.N.Y., M.M., A.L.C., B.G.), Department of Surgery, Section of Urology (G.G., L.F.R., P.K.M., S.E.), Department of Pathology (T.A.), and Department of Public Health Sciences (M.G.), University of Chicago, 5841 S Maryland Ave, MC 2026, Chicago, IL 60637.

Purpose To evaluate the use of an automated hybrid multidimensional MRI (HM-MRI)-based tool to prospectively identify prostate cancer targets before MRI/US fusion biopsy in comparison with Prostate Imaging and Reporting Data System (PI-RADS)-based multiparametric MRI (mpMRI) evaluation by expert radiologists. Materials and Methods In this prospective clinical trial (ClinicalTrials.gov registration no.

View Article and Find Full Text PDF

Infrared absorption spectroscopy and surface-enhanced Raman spectroscopy were integrated into three data fusion strategies-hybrid (concatenated spectra), mid-level (extracted features from both datasets) and high-level (fusion of predictions from both models)-to enhance the predictive accuracy for xylazine detection in illicit opioid samples. Three chemometric approaches-random forest, support vector machine, and -nearest neighbor algorithms-were employed and optimized using a 5-fold cross-validation grid search for all fusion strategies. Validation results identified the random forest classifier as the optimal model for all fusion strategies, achieving high sensitivity (88% for hybrid, 92% for mid-level, and 96% for high-level) and specificity (88% for hybrid, mid-level, and high-level).

View Article and Find Full Text PDF

Background: Gastric accommodation (GA) testing is gaining clinical recognition as novel and minimally invasive modalities emerge. We investigated the feasibility of hybrid nuclear imaging volumetry (SPECT/CT) and combined high-resolution manometry-nutrient drink test (HRM-NDT) to assess GA.

Methods: In this non-randomized pilot study, [Tc]NaTcO gastric SPECT/CT (250 mL protocol) and proximal gastric HRM-NDT (~60 mL/min protocol) were performed separately within 30 days using Ensure Gold test meal (1.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!