Molecular dynamics (MD) simulation is a well-established method for studying protein motion at the atomic scale. However, it is computationally intensive and generates massive amounts of data. One way of addressing the dual challenges of computation efficiency and data analysis is to construct simplified models of long-timescale protein motion from MD simulation data. In this direction, we propose to use Markov models with hidden states, in which the Markovian states represent potentially overlapping probabilistic distributions over protein conformations. We also propose a principled criterion for evaluating the quality of a model by its ability to predict long-timescale protein motions. Our method was tested on 2D synthetic energy landscapes and two extensively studied peptides, alanine dipeptide and the villin headpiece subdomain (HP-35 NleNle). One interesting finding is that although a widely accepted model of alanine dipeptide contains six states, a simpler model with only three states is equally good for predicting long-timescale motions. We also used the constructed Markov models to estimate important kinetic and dynamic quantities for protein folding, in particular, mean first-passage time. The results are consistent with available experimental measurements.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2881362 | PMC |
http://dx.doi.org/10.1093/bioinformatics/btq177 | DOI Listing |
J Chem Phys
December 2024
Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon 97403, USA.
Studying the kinetics of long-timescale rare events is a fundamental challenge in molecular simulation. To address this problem, we propose an integration of two different rare-event sampling philosophies: biased enhanced sampling and unbiased path sampling. Enhanced sampling methods, e.
View Article and Find Full Text PDFbioRxiv
November 2024
Department of Computational and Quantitative Medicine, Beckman Research Institute of the City of Hope, 1218 S 5th Ave, Monrovia, CA 91016.
Bayesian network modeling (BN modeling, or BNM) is an interpretable machine learning method for constructing probabilistic graphical models from the data. In recent years, it has been extensively applied to diverse types of biomedical datasets. Concurrently, our ability to perform long-timescale molecular dynamics (MD) simulations on proteins and other materials has increased exponentially.
View Article and Find Full Text PDFPhys Rev Lett
November 2024
Freie Universität Berlin, Fachbereich Physik, 14195 Berlin, Germany.
Protein folding is an intrinsically multitimescale problem. While it is accepted that non-Markovian effects are present on short timescales, it is unclear whether memory-dependent friction influences long-timescale protein folding reaction kinetics. We combine friction memory-kernel extraction techniques with recently published extensive all-atom simulations of the α3D protein under neutral and reduced pH conditions, and we show that the pH reduction modifies the friction acting on the folding protein by dramatically decreasing the friction memory decay time.
View Article and Find Full Text PDFProc Natl Acad Sci U S A
November 2024
Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei 230027, China.
Animal behavior is organized into nested temporal patterns that span multiple timescales. This behavior hierarchy is believed to arise from a hierarchical neural architecture: Neurons near the top of the hierarchy are involved in planning, selecting, initiating, and maintaining motor programs, whereas those near the bottom of the hierarchy act in concert to produce fine spatiotemporal motor activity. In , behavior on a long timescale emerges from ordered and flexible transitions between different behavioral states, such as forward, reversal, and turn.
View Article and Find Full Text PDFAdv Exp Med Biol
September 2024
Faculty of Medicine, Department of General Surgery, Gazi University, Besevler, Ankara, Turkey.
Epigenetic changes have long-lasting impacts, which influence the epigenome and are maintained during cell division. Thus, human genome changes have required a very long timescale to become a major contributor to the current obesity pandemic. Whereas bidirectional effects of coronavirus disease 2019 (COVID-19) and obesity pandemics have given the opportunity to explore, how the viral microribonucleic acids (miRNAs) use the human's transcriptional machinery that regulate gene expression at a posttranscriptional level.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!