Automatic recognition of complementary strands: lessons regarding machine learning abilities in RNA folding.

Front Genet

Institute for Research in Immunology and Cancer, Montréal, QC, Canada.

Published: September 2023

Prediction of RNA secondary structure from single sequences still needs substantial improvements. The application of machine learning (ML) to this problem has become increasingly popular. However, ML algorithms are prone to overfitting, limiting the ability to learn more about the inherent mechanisms governing RNA folding. It is natural to use high-capacity models when solving such a difficult task, but poor generalization is expected when too few examples are available. Here, we report the relation between capacity and performance on a fundamental related problem: determining whether two sequences are fully complementary. Our analysis focused on the impact of model architecture and capacity as well as dataset size and nature on classification accuracy. We observed that low-capacity models are better suited for learning with mislabelled training examples, while large capacities improve the ability to generalize to structurally dissimilar data. It turns out that neural networks struggle to grasp the fundamental concept of base complementarity, especially in lengthwise extrapolation context. Given a more complex task like RNA folding, it comes as no surprise that the scarcity of useable examples hurdles the applicability of machine learning techniques to this field.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10507318	PMC
http://dx.doi.org/10.3389/fgene.2023.1254226	DOI Listing

Publication Analysis

Top Keywords

machine learning

rna folding

automatic recognition

recognition complementary

complementary strands

strands lessons

lessons machine

learning

learning abilities

rna

Similar Publications

Cadmium and selenium blood levels in association with congestive heart failure in diabetic and prediabetic patients: a cross-sectional study from the national health and nutrition examination survey.

Diabetol Metab Syndr

January 2025

School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, 7 Sassoon Road, Pok Fu Lam, Hong Kong, SAR, China.

Renyue Ji Haisheng Wu Hongli Lin Yang Li Yumeng Shi

Background: Epidemiological research on the association between heavy metals and congestive heart failure (CHF) in individuals with abnormal glucose metabolism is scarce. The study addresses this research gap by examining the link between exposure to heavy metals and the odds of CHF in a population with dysregulated glucose metabolism.

Method: This cross-sectional study includes 7326 patients with diabetes and prediabetes from the National Health and Nutrition Examination Survey from 2011 to 2018.

View Article and Find Full Text PDF

Similar Publications

OSBPL3 modulates the immunosuppressive microenvironment and predicts therapeutic outcomes in pancreatic cancer.

Biol Direct

January 2025

School of Medicine, South China University of Technology, Guangzhou, 510006, China.

Qihui Sun Xiaoqi Zhu Qi Zou Yang Chen Tingting Wen

Background: Pancreatic cancer is characterized by a complex tumor microenvironment that hinders effective immunotherapy. Identifying key factors that regulate the immunosuppressive landscape is crucial for improving treatment strategies.

Methods: We constructed a prognostic and risk assessment model for pancreatic cancer using 101 machine learning algorithms, identifying OSBPL3 as a key gene associated with disease progression and prognosis.

View Article and Find Full Text PDF

Similar Publications

Prediction of urinary tract infection using machine learning methods: a study for finding the most-informative variables.

BMC Med Inform Decis Mak

January 2025

Department of Pediatrics, School of Medicine, Ekbatan Hospital, Hamadan University of Medical Sciences, Hamadan, Iran.

Sajjad Farashi Hossein Emad Momtaz

Background: Urinary tract infection (UTI) is a frequent health-threatening condition. Early reliable diagnosis of UTI helps to prevent misuse or overuse of antibiotics and hence prevent antibiotic resistance. The gold standard for UTI diagnosis is urine culture which is a time-consuming and also an error prone method.

View Article and Find Full Text PDF

Similar Publications

A machine learning model accurately identifies glycogen storage disease Ia patients based on plasma acylcarnitine profiles.

Orphanet J Rare Dis

January 2025

Laboratory of Metabolic Diseases, Department of Laboratory Medicine, University Medical Center Groningen, University of Groningen, Hanzeplein 1, Postbus, Groningen, 30001 - 9700 RB, the Netherlands.

Joost Groen Bas M de Haan Ruben J Overduin Andrea B Haijer-Schreuder Terry Gj Derks

Background: Glycogen storage disease (GSD) Ia is an ultra-rare inherited disorder of carbohydrate metabolism. Patients often present in the first months of life with fasting hypoketotic hypoglycemia and hepatomegaly. The diagnosis of GSD Ia relies on a combination of different biomarkers, mostly routine clinical chemical markers and subsequent genetic confirmation.

View Article and Find Full Text PDF

Similar Publications

Integrated bioinformatics analysis and experimental validation of exosome-related gene signature in steroid-induced osteonecrosis of the femoral head.

J Orthop Surg Res

January 2025

Department of Hand-Foot Microsurgery, Shenzhen Nanshan People's Hospital, The 6th Affiliated Hospital of Shenzhen University Health Science Center, Shenzhen, China.

Renqun Mao Wen Bi Mengyue Yang Lei Qin Wenqing Li

Background: Steroid-induced osteonecrosis of the femoral head (SIONFH) is a universal hip articular disease and is very hard to perceive at an early stage. The understanding of the pathogenesis of SIONFH is still limited, and the identification of efficient diagnostic biomarkers is insufficient. This research aims to recognize and validate the latent exosome-related molecular signature in SIONFH diagnosis by employing bioinformatics to investigate exosome-related mechanisms in SIONFH.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!