The complex 'language' of plant RNA encodes a vast array of biological regulatory elements that orchestrate crucial aspects of plant growth, development and adaptation to environmental stresses. Recent advancements in foundation models (FMs) have demonstrated their unprecedented potential to decipher complex 'language' in biology. In this study, we introduced PlantRNA-FM, a high-performance and interpretable RNA FM specifically designed for plants.
View Article and Find Full Text PDFRNA structure constitutes a new layer of gene regulatory mechanisms. RNA binding proteins can modulate RNA secondary structures, thus participating in post-transcriptional regulation. The DEAH-box helicase 36 (DHX36) is known to bind and unwind RNA G-quadruplex (rG4) structure but the transcriptome-wide RNA structure remodeling induced by DHX36 binding and the impact on RNA fate remain poorly understood.
View Article and Find Full Text PDFNat Rev Mol Cell Biol
October 2024
The development of high-throughput RNA structure profiling methods in the past decade has greatly facilitated our ability to map and characterize different aspects of RNA structures transcriptome-wide in cell populations, single cells and single molecules. The resulting high-resolution data have provided insights into the static and dynamic nature of RNA structures, revealing their complexity as they perform their respective functions in the cell. In this Review, we discuss recent technical advances in the determination of RNA structures, and the roles of RNA structures in RNA biogenesis and functions, including in transcription, processing, translation, degradation, localization and RNA structure-dependent condensates.
View Article and Find Full Text PDFDNA, beyond its canonical B-form double helix, adopts various alternative conformations, among which the i-motif, emerging in cytosine-rich sequences under acidic conditions, holds significant biological implications in transcription modulation and telomere biology. Despite recognizing the crucial role of i-motifs, predictive software for i-motif forming sequences has been limited. Addressing this gap, we introduce 'iM-Seeker', an innovative computational platform designed for the prediction and evaluation of i-motifs.
View Article and Find Full Text PDFPlants, as sessile organisms, deploy transcriptional dynamics for adapting to extreme growth conditions such as cold stress. Emerging evidence suggests that chromatin architecture contributes to transcriptional regulation. However, the relationship between chromatin architectural dynamics and transcriptional reprogramming in response to cold stress remains unclear.
View Article and Find Full Text PDFi-Motifs (iMs), are secondary structures formed in cytosine-rich DNA sequences and are involved in multiple functions in the genome. Although putative iM forming sequences are widely distributed in the human genome, the folding status and strength of putative iMs vary dramatically. Much previous research on iM has focused on assessing the iM folding properties using biophysical experiments.
View Article and Find Full Text PDFRNA decay serves as a crucial mechanism for maintaining cellular homeostasis and regulating gene expression. Large-scale analyses indicate that altered rates of decay contribute significantly to changes in mRNA levels, with up to half of these changes attributed to decay. The regulation of RNA decay is, at least in part, through structured RNA elements, especially in the non-coding regions of the mRNAs.
View Article and Find Full Text PDFLoss-of-function mutations in the CYP24A1 protein-coding region causing reduced 25 hydroxyvitamin D (25OHD) and 1,25 dihydroxyvitamin D (1,25(OH) D) catabolism have been observed in some cases of infantile hypercalcemia type 1 (HCINF1), which can manifest as nephrocalcinosis, hypercalcemia and adult-onset hypercalciuria, and renal stone formation. Some cases present with apparent CYP24A1 phenotypes but do not exhibit pathogenic mutations. Here, we assessed the molecular mechanisms driving apparent HCINF1 where there was a lack of CYP24A1 mutation.
View Article and Find Full Text PDFSubcellular mRNA quantities and spatial distributions are fundamental for driving gene regulatory programmes. Single molecule RNA fluorescence in situ hybridization (smFISH) uses fluorescent probes to label individual mRNA molecules, thereby facilitating both localization and quantitative studies. Validated reference mRNAs function as positive controls and are required for calibration.
View Article and Find Full Text PDFThe study of RNAs has become one of the most influential research fields in contemporary biology and biomedicine. In the last few years, new sequencing technologies have produced an explosion of new and exciting discoveries in the field but have also given rise to many open questions. Defining these questions, together with old, long-standing gaps in our knowledge, is the spirit of this article.
View Article and Find Full Text PDFNucleotide composition is suggested to infer gene functionality and ecological adaptation of species to distinct environments. However, the underlying biological function of nucleotide composition dictating environmental adaptations is largely unknown. Here, we systematically analyze the nucleotide composition of transcriptomes across 1000 plants (1KP) and their corresponding habitats.
View Article and Find Full Text PDFRNA G-quadruplex (rG4) is a vital RNA tertiary structure motif that involves the base pairs on both Hoogsteen and Watson-Crick faces of guanines. rG4 is of great importance in the post-transcriptional regulation of gene expression. Experimental technologies have advanced to identify in vitro and in vivo rG4s across diverse transcriptomes.
View Article and Find Full Text PDFCellular RNAs are heterogeneous with respect to their alternative processing and secondary structures, but the functional importance of this complexity is still poorly understood. A set of alternatively processed antisense non-coding transcripts, which are collectively called COOLAIR, are generated at the Arabidopsis floral-repressor locus FLOWERING LOCUS C (FLC). Different isoforms of COOLAIR influence FLC transcriptional output in warm and cold conditions.
View Article and Find Full Text PDFRNA structures are essential to support RNA functions and regulation in various biological processes. Recently, a range of novel technologies have been developed to decode genome-wide RNA structures and novel modes of functionality across a wide range of species. In this review, we summarize key strategies for probing the RNA structurome and discuss the pros and cons of representative technologies.
View Article and Find Full Text PDFDeep learning, or artificial neural networks, is a type of machine learning algorithm that can decipher underlying relationships from large volumes of data and has been successfully applied to solve structural biology questions, such as RNA structure. RNA can fold into complex RNA structures by forming hydrogen bonds, thereby playing an essential role in biological processes. While experimental effort has enabled resolving RNA structure at the genome-wide scale, deep learning has been more recently introduced for studying RNA structure and its functionality.
View Article and Find Full Text PDFMost plant pentatricopeptide repeat (PPR) proteins localize to and function inside plastids and mitochondria. However, the function of PPRs that only localize to the cytoplasm remains unknown. Here, we demonstrated that the rice (Oryza sativa) PPR protein CYTOPLASM-LOCALIZED PPR1 (OsCPPR1) contributes to pollen development and localizes to the cytoplasm.
View Article and Find Full Text PDFBackground: Polyploidy, especially allopolyploidy, which entails merging divergent genomes via hybridization and whole-genome duplication (WGD), is a major route to speciation in plants. The duplication among the parental genomes (subgenomes) often leads to one subgenome becoming dominant over the other(s), resulting in subgenome asymmetry in gene content and expression. Polyploid wheats are allopolyploids with most genes present in two (tetraploid) or three (hexaploid) functional copies, which commonly show subgenome expression asymmetry.
View Article and Find Full Text PDFRNA folding is an intrinsic property of RNA that serves a key role in every step of post-transcriptional regulation of gene expression, from RNA maturation to translation in plants. Recent developments of genome-wide RNA structure profiling methods have transformed research in this area enabling focus to shift from individual molecules to the study of tens of thousands of RNAs. Here, we provide a comprehensive review of recent advances in the field.
View Article and Find Full Text PDFRNA structural elements occur in numerous single-stranded positive-sense RNA viruses. The stem-loop 2 motif (s2m) is one such element with an unusually high degree of sequence conservation, being found in the 3' untranslated region (UTR) in the genomes of many astroviruses, some picornaviruses and noroviruses, and a variety of coronaviruses, including severe acute respiratory syndrome coronavirus (SARS-CoV) and SARS-CoV-2. The evolutionary conservation and its occurrence in all viral subgenomic transcripts imply a key role for s2m in the viral infection cycle.
View Article and Find Full Text PDFBackground: mRNA processing is critical for gene expression. A challenge in regulating mRNA processing is how to recognize the actual mRNA processing sites, such as splice and polyadenylation sites, when the sequence content is insufficient for this purpose. Previous studies suggested that RNA structure affects mRNA processing.
View Article and Find Full Text PDF