Towards parsimonious generative modeling of RNA families.

Nucleic Acids Res

Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratoire de Biologie Computationnelle et Quantitative - LCQB, Paris, France.

Published: June 2024

Generative probabilistic models emerge as a new paradigm in data-driven, evolution-informed design of biomolecular sequences. This paper introduces a novel approach, called Edge Activation Direct Coupling Analysis (eaDCA), tailored to the characteristics of RNA sequences, with a strong emphasis on simplicity, efficiency, and interpretability. eaDCA explicitly constructs sparse coevolutionary models for RNA families, achieving performance levels comparable to more complex methods while utilizing a significantly lower number of parameters. Our approach demonstrates efficiency in generating artificial RNA sequences that closely resemble their natural counterparts in both statistical analyses and SHAPE-MaP experiments, and in predicting the effect of mutations. Notably, eaDCA provides a unique feature: estimating the number of potential functional sequences within a given RNA family. For example, in the case of cyclic di-AMP riboswitches (RF00379), our analysis suggests the existence of approximately 1039 functional nucleotide sequences. While huge compared to the known <4000 natural sequences, this number represents only a tiny fraction of the vast pool of nearly 1082 possible nucleotide sequences of the same length (136 nucleotides). These results underscore the promise of sparse and interpretable generative models, such as eaDCA, in enhancing our understanding of the expansive RNA sequence space.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11162787PMC
http://dx.doi.org/10.1093/nar/gkae289DOI Listing

Publication Analysis

Top Keywords

rna families
8
rna sequences
8
rna
5
sequences
5
parsimonious generative
4
generative modeling
4
modeling rna
4
families generative
4
generative probabilistic
4
probabilistic models
4

Similar Publications

The cation-proton antiporter (CPA) superfamily plays pivotal roles in regulating cellular ion and pH homeostasis in plants. To date, the regulatory functions of CPA family members in rice (Oryza sativa L.) have not been elucidated.

View Article and Find Full Text PDF

Although viruses subvert innate immune pathways for their replication, there is evidence they can also co-opt antiviral responses for their benefit. The ubiquitous human pathogen, Herpes simplex virus-1 (HSV-1), encodes a protein (UL12.5) that induces the release of mitochondrial nucleic acid into the cytosol, which activates immune-sensing pathways and reduces productive replication in nonneuronal cells.

View Article and Find Full Text PDF

Metabolic dysfunction-associated steatotic liver disease (MASLD) is a growing global health problem, affecting ∼1 billion people. This condition is well established to have a heritable component with strong familial clustering. With the extraordinary breakthroughs in genetic research techniques coupled with their application to large-scale biobanks, the field of genetics in MASLD has expanded rapidly.

View Article and Find Full Text PDF

Multiple myeloma (MM) is an incurable cancer of plasma cells with a 5-year survival rate of 59%. Dysregulation of fatty acid (FA) metabolism is associated with MM development and progression; however, the underlying mechanisms remain unclear. Herein, we explore the roles of long-chain fatty acid coenzyme A ligase (ACSL) family members in MM.

View Article and Find Full Text PDF

Background: Chordoma, characterized as a slow growing yet locally invasive and destructive bone tumor mainly emerging in the sacrum and clivus, presents a unique challenge due to its rarity, hampering the development of effective treatment strategies. Comprehensive understanding of tumor biology is crucial to suggest novel treatment modalities. Reactive oxygen species (ROS), a family of chemically reactive and unstable oxygen derivatives, are controlled by an intracellular antioxidant system to maintain homeostasis.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!