The time scales of long-time atomistic molecular dynamics simulations are typically reported in microseconds, while the time scales for experiments studying the kinetics of amyloid fibril formation are typically reported in minutes or hours. This time scale deficit of roughly 9 orders of magnitude presents a major challenge in the design of computer simulation methods for studying protein aggregation events. Coarse-grained molecular simulations offer a computationally tractable path forward for exploring the molecular mechanism driving the formation of these structures, which are implicated in diseases such as Alzheimer's, Parkinson's, and type-II diabetes.
View Article and Find Full Text PDFCoarse-graining is a powerful tool for extending the reach of dynamic models of proteins and other biological macromolecules. Topological coarse-graining, in which biomolecules or sets thereof are represented via graph structures, is a particularly useful way of obtaining highly compressed representations of molecular structures, and simulations operating via such representations can achieve substantial computational savings. A drawback of coarse-graining, however, is the loss of atomistic detail-an effect that is especially acute for topological representations such as protein structure networks (PSNs).
View Article and Find Full Text PDFAmyloid fibril formation is central to the etiology of a wide range of serious human diseases, such as Alzheimer's disease and prion diseases. Despite an ever growing collection of amyloid fibril structures found in the Protein Data Bank (PDB) and numerous clinical trials, therapeutic strategies remain elusive. One contributing factor to the lack of progress on this challenging problem is incomplete understanding of the mechanisms by which these locally ordered protein aggregates self-assemble in solution.
View Article and Find Full Text PDFSimulations of intrinsically disordered proteins (IDPs) pose numerous challenges to comparative analysis, prominently including highly dynamic conformational states and a lack of well-defined secondary structure. Machine learning (ML) algorithms are especially effective at discriminating among high-dimensional inputs whose differences are extremely subtle, making them well suited to the study of IDPs. In this work, we apply various ML techniques, including support vector machines (SVM) and clustering, as well as related methods such as principal component analysis (PCA) and protein structure network (PSN) analysis, to the problem of uncovering differences between configurational data from molecular dynamics simulations of two variants of the same IDP.
View Article and Find Full Text PDFAmyloid fibrils are locally ordered protein aggregates that self-assemble under a variety of physiological and in vitro conditions. Their formation is of fundamental interest as a physical chemistry problem and plays a central role in Alzheimer's disease, Type II diabetes, and other human diseases. As the number of known amyloid fibril structures has grown, the need has arisen for a nomenclature for describing and classifying fibril types, as well as a theoretical description of the physics that gives rise to the self-assembly of these structures.
View Article and Find Full Text PDFA machine learning-based methodology for the prediction of chemical reaction products, along with automated elucidation of mechanistic details via phase space analysis of reactive trajectories, is introduced using low-dimensional heuristic models and then applied to ab initio computer simulations of the photodissociation of acetaldehyde, an important chemical system in atmospheric chemistry. Our method is centered around training Support Vector Machines (SVMs) to identify optimal separatrices that delineate the regions of phase space that lead to different chemical reaction products. In contrast to the more common "black box" type machine learning methodologies for analyzing chemical simulation data, this SVM-based methodology allows for mechanistic insight to be gleaned from further analysis of the positioning of the phase space points used to train the SVM with respect to the separatrices.
View Article and Find Full Text PDFThe milestoning algorithm of Elber and co-workers creates a framework for computing the time scale of processes that are too long and too complex to be studied using simply brute force simulations. The fundamental objects involved in the milestoning algorithm are the first passage time distributions () between adjacent conformational milestones and . The method proposed herein aims to further enhance milestoning (or other interface based sampling methods) by employing an artificially applied force, akin to a wind that blows the trajectories from their initial to their final states, and by subsequently applying corrective weights to the trajectories to yield the true first passage time distributions () in a fraction of the computation time required for unassisted calculations.
View Article and Find Full Text PDFIn the milestoning framework, and more generally in related transition interface sampling schemes, one significantly enhances the calculation of relaxation rates for complex equilibrium kinetics from molecular dynamics simulations between the milestones or interfaces. The goal of the present paper is to advance milestoning applications into the realm of non-equilibrium statistical mechanics, in particular, to calculate entire time correlation functions. In order to accomplish this, we introduce a novel methodology for obtaining the flux through a given milestone configuration as a function of both time and initial configuration and build upon it with a novel formalism describing autocorrelation for Langevin motion in a discrete configuration space.
View Article and Find Full Text PDFSeveral recent implementations of algorithms for sampling reaction pathways employ a strategy for placing interfaces or milestones across the reaction coordinate manifold. Interfaces can be introduced such that the full feature space describing the dynamics of a macromolecule is divided into Voronoi (or other) cells, and the global kinetics of the molecular motions can be calculated from the set of fluxes through the interfaces between the cells. Although some methods of this type are exact for an arbitrary set of cells, in practice, the calculations will converge fastest when the interfaces are placed in regions where they can best capture transitions between configurations corresponding to local minima.
View Article and Find Full Text PDFThe B-DNA double helix can dynamically accommodate G-C and A-T base pairs in either Watson-Crick or Hoogsteen configurations. Here, we show that G-C(+) (in which + indicates protonation) and A-U Hoogsteen base pairs are strongly disfavored in A-RNA. As a result,N(1)-methyladenosine and N(1)-methylguanosine, which occur in DNA as a form of alkylation damage and in RNA as post-transcriptional modifications, have dramatically different consequences.
View Article and Find Full Text PDFA novel aspect in the area of mechano-chemistry concerns the effect of external forces on enzyme activity, i.e., the existence of mechano-catalytic coupling.
View Article and Find Full Text PDF