We present the nuclear magnetic resonance spectroscopy (NMR) solution structure of the 5'-terminal stem loop 5_SL1 (SL1) of the SARS-CoV-2 genome. SL1 contains two A-form helical elements and two regions with non-canonical structure, namely an apical pyrimidine-rich loop and an asymmetric internal loop with one and two nucleotides at the 5'- and 3'-terminal part of the sequence, respectively. The conformational ensemble representing the averaged solution structure of SL1 was validated using NMR residual dipolar coupling (RDC) and small-angle X-ray scattering (SAXS) data.
View Article and Find Full Text PDFBoth experimental and theoretical structure determinations of RNAs have remained challenging due to the intrinsic dynamics of RNAs. We report here an integrated nuclear magnetic resonance/molecular dynamics (NMR/MD) structure determination approach to describe the dynamic structure of the CUUG tetraloop. We show that the tetraloop undergoes substantial dynamics, leading to averaging of the experimental data.
View Article and Find Full Text PDFProteins play important roles in biology, biotechnology and pharmacology, and missense variants are a common cause of disease. Discovering functionally important sites in proteins is a central but difficult problem because of the lack of large, systematic data sets. Sequence conservation can highlight residues that are functionally important but is often convoluted with a signal for preserving structural stability.
View Article and Find Full Text PDFRNA viruses have evolved elaborate strategies to protect their genomes, including 5' capping. However, until now no RNA 5' cap has been identified for hepatitis C virus (HCV), which causes chronic infection, liver cirrhosis and cancer. Here we demonstrate that the cellular metabolite flavin adenine dinucleotide (FAD) is used as a non-canonical initiating nucleotide by the viral RNA-dependent RNA polymerase, resulting in a 5'-FAD cap on the HCV RNA.
View Article and Find Full Text PDFBackground: The application of Machine Learning (ML) to genetic individual-level data represents a foreseeable advancement for the field, which is still in its infancy. Here, we aimed to evaluate the feasibility and accuracy of an ML-based model for disease risk prediction applied to Primary Biliary Cholangitis (PBC).
Methods: Genome-wide significant variants identified in subjects of European ancestry in the recently released second international meta-analysis of GWAS in PBC were used as input data.
We describe the conformational ensemble of the single-stranded r(UCAAUC) oligonucleotide obtained using extensive molecular dynamics (MD) simulations and Rosetta's FARFAR2 algorithm. The conformations observed in MD consist of A-form-like structures and variations thereof. These structures are not present in the pool generated using FARFAR2.
View Article and Find Full Text PDFThe 5' untranslated region (UTR) of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) genome is a conserved, functional and structured genomic region consisting of several RNA stem-loop elements. While the secondary structure of such elements has been determined experimentally, their three-dimensional structures are not known yet. Here, we predict structure and dynamics of five RNA stem loops in the 5'-UTR of SARS-CoV-2 by extensive atomistic molecular dynamics simulations, more than 0.
View Article and Find Full Text PDFNanodiscs are membrane mimetics that consist of a protein belt surrounding a lipid bilayer, and are broadly used for characterization of membrane proteins. Here, we investigate the structure, dynamics and biophysical properties of two small nanodiscs, MSP1D1ΔH5 and ΔH4H5. We combine our SAXS and SANS experiments with molecular dynamics simulations and previously obtained NMR and EPR data to derive and validate a conformational ensemble that represents the structure and dynamics of the nanodisc.
View Article and Find Full Text PDFWe provide an atomic-level description of the structure and dynamics of the UUCG RNA stem-loop by combining molecular dynamics simulations with experimental data. The integration of simulations with exact nuclear Overhauser enhancements data allowed us to characterize two distinct states of this molecule. The most stable conformation corresponds to the consensus three-dimensional structure.
View Article and Find Full Text PDFMany proteins contain multiple folded domains separated by flexible linkers, and the ability to describe the structure and conformational heterogeneity of such flexible systems pushes the limits of structural biology. Using the three-domain protein TIA-1 as an example, we here combine coarse-grained molecular dynamics simulations with previously measured small-angle scattering data to study the conformation of TIA-1 in solution. We show that while the coarse-grained potential (Martini) in itself leads to too compact conformations, increasing the strength of protein-water interactions results in ensembles that are in very good agreement with experiments.
View Article and Find Full Text PDFProg Mol Biol Transl Sci
January 2021
Molecular simulations and biophysical experiments can be used to provide independent and complementary insights into the molecular origin of biological processes. A particularly useful strategy is to use molecular simulations as a modeling tool to interpret experimental measurements, and to use experimental data to refine our biophysical models. Thus, explicit integration and synergy between molecular simulations and experiments is fundamental for furthering our understanding of biological processes.
View Article and Find Full Text PDFWe describe a Bayesian/Maximum entropy (BME) procedure and software to construct a conformational ensemble of a biomolecular system by integrating molecular simulations and experimental data. First, an initial conformational ensemble is constructed using, for example, Molecular Dynamics or Monte Carlo simulations. Due to potential inaccuracies in the model and finite sampling effects, properties predicted from simulations may not agree with experimental data.
View Article and Find Full Text PDFEmpirical force fields for biomolecular systems are usually derived from quantum chemistry calculations and validated against experimental data. We here show how it is possible to refine the full dihedral-angle potential of the Amber RNA force field by using solution NMR data as well as stability of known structural motifs. The procedure can be used to mix multiple systems and heterogeneous experimental information and crucially depends on a regularization term chosen with a cross-validation procedure.
View Article and Find Full Text PDFRNA molecules are highly dynamic systems characterized by a complex interplay between sequence, structure, dynamics, and function. Molecular simulations can potentially provide powerful insights into the nature of these relationships. The analysis of structures and molecular trajectories of nucleic acids can be nontrivial because it requires processing very high-dimensional data that are not easy to visualize and interpret.
View Article and Find Full Text PDFA fundamental challenge in biological research is achieving an atomic-level description and mechanistic understanding of the function of biomolecules. Techniques for biomolecular simulations have undergone substantial developments, and their accuracy and scope have expanded considerably. Progress has been made through an increasingly tight integration of experiments and simulations, with experiments being used to refine simulations and simulations used to interpret experiments.
View Article and Find Full Text PDFRNA molecules are key players in numerous cellular processes and are characterized by a complex relationship between structure, dynamics, and function. Despite their apparent simplicity, RNA oligonucleotides are very flexible molecules, and understanding their internal dynamics is particularly challenging using experimental data alone. We show how to reconstruct the conformational ensemble of four RNA tetranucleotides by combining atomistic molecular dynamics simulations with nuclear magnetic resonance spectroscopy data.
View Article and Find Full Text PDFWith both catalytic and genetic functions, ribonucleic acid (RNA) is perhaps the most pluripotent chemical species in molecular biology, and its functions are intimately linked to its structure and dynamics. Computer simulations, and in particular atomistic molecular dynamics (MD), allow structural dynamics of biomolecular systems to be investigated with unprecedented temporal and spatial resolution. We here provide a comprehensive overview of the fast-developing field of MD simulations of RNA molecules.
View Article and Find Full Text PDFNucleic Acids Res
February 2018
We introduce the SPlit-and-conQueR (SPQR) model, a coarse-grained (CG) representation of RNA designed for structure prediction and refinement. In our approach, the representation of a nucleotide consists of a point particle for the phosphate group and an anisotropic particle for the nucleoside. The interactions are, in principle, knowledge-based potentials inspired by the $\mathcal {E}$SCORE function, a base-centered scoring function.
View Article and Find Full Text PDFCoarse-grained models can be of great help to address the problem of structure prediction in nucleic acids. On one hand they can make the prediction more efficient, while on the other hand they can also help to identify the essential degrees of freedom and interactions for the description of a number of structures. With the aim to provide an all-atom representation in an explicit solvent to the predictions of our SPlit and conQueR (SPQR) coarse-grained model of RNA, we recently introduced a backmapping procedure which enforces the predicted structure into an atomistic one by means of steered molecular dynamics.
View Article and Find Full Text PDFWe report a map of RNA tetraloop conformations constructed by calculating pairwise distances among all experimentally determined four-nucleotide hairpin loops. Tetraloops with similar structures are clustered together and, as expected, the two largest clusters are the canonical GNRA and UNCG folds. We identify clusters corresponding to known tetraloop folds such as GGUG, RNYA, AGNN, and CUUG.
View Article and Find Full Text PDFWe report the folding thermodynamics of ccUUCGgg and ccGAGAgg RNA tetraloops using atomistic molecular dynamics simulations. We obtain a previously unreported estimation of the folding free energy using parallel tempering in combination with well-tempered metadynamics. A key ingredient is the use of a recently developed metric distance, eRMSD, as a biased collective variable.
View Article and Find Full Text PDFJ Chem Theory Comput
September 2016
The computer-aided folding of biomolecules, particularly RNAs, is one of the most difficult challenges in computational structural biology. RNA tetraloops are fundamental RNA motifs playing key roles in RNA folding and RNA-RNA and RNA-protein interactions. Although state-of-the-art Molecular Dynamics (MD) force fields correctly describe the native state of these tetraloops as a stable free-energy basin on the microsecond time scale, enhanced sampling techniques reveal that the native state is not the global free energy minimum, suggesting yet unidentified significant imbalances in the force fields.
View Article and Find Full Text PDF