Relative free energy (RFE) calculations are now widely used in academia and the industry, but their accuracy is often limited by poor sampling of the complexes' conformational ensemble. To help address conformational sampling problems when simulating many relative binding free energies, we developed a novel method termed multiple topology replica exchange of expanded ensembles (MT-REXEE). This method enables parallel expanded ensemble calculations, facilitating iterative RFE computations while allowing conformational exchange between parallel transformations.
View Article and Find Full Text PDFThe development of reliable and extensible molecular mechanics (MM) force fields-fast, empirical models characterizing the potential energy surface of molecular systems-is indispensable for biomolecular simulation and computer-aided drug design. Here, we introduce a generalized and extensible machine-learned MM force field, espaloma-0.3, and an end-to-end differentiable framework using graph neural networks to overcome the limitations of traditional rule-based methods.
View Article and Find Full Text PDFThe engineering of novel protein-ligand binding interactions, particularly for complex drug-like molecules, is an unsolved problem, which could enable many practical applications of protein biosensors. In this work, we analyzed two engineered biosensors, derived from the plant hormone sensor PYR1, to recognize either the agrochemical mandipropamid or the synthetic cannabinoid WIN55,212-2. Using a combination of quantitative deep mutational scanning experiments and molecular dynamics simulations, we demonstrated that mutations at common positions can promote protein-ligand shape complementarity and revealed prominent differences in the electrostatic networks needed to complement diverse ligands.
View Article and Find Full Text PDFGeneralized ensemble methods such as Hamiltonian replica exchange (HREX) and expanded ensemble (EE) have been shown effective in free energy calculations for various contexts, given their ability to circumvent free energy barriers via nonphysical pathways defined by states with different modified Hamiltonians. However, both HREX and EE methods come with drawbacks, such as limited flexibility in parameter specification or the lack of parallelizability for more complicated applications. To address this challenge, we present the method of replica exchange of expanded ensembles (REXEE), which integrates the principles of HREX and EE methods by periodically exchanging coordinates of EE replicas sampling different yet overlapping sets of alchemical states.
View Article and Find Full Text PDFForce fields are a key component of physics-based molecular modeling, describing the energies and forces in a molecular system as a function of the positions of the atoms and molecules involved. Here, we provide a review and scientific status report on the work of the Open Force Field (OpenFF) Initiative, which focuses on the science, infrastructure and data required to build the next generation of biomolecular force fields. We introduce the OpenFF Initiative and the related OpenFF Consortium, describe its approach to force field development and software, and discuss accomplishments to date as well as future plans.
View Article and Find Full Text PDFComputing free energy differences between metastable states characterized by nonoverlapping Boltzmann distributions is often a computationally intensive endeavor, usually requiring chains of intermediate states to connect them. Targeted free energy perturbation (TFEP) can significantly lower the computational cost of FEP calculations by choosing a set of invertible maps used to directly connect the distributions of interest, achieving the necessary statistically significant overlaps without sampling any intermediate states. Probabilistic generative models (PGMs) based on normalizing flow architectures can make it much easier via machine learning to train invertible maps needed for TFEP.
View Article and Find Full Text PDFThe engineering of novel protein-ligand binding interactions, particularly for complex drug-like molecules, is an unsolved problem which could enable many practical applications of protein biosensors. In this work, we analyzed two engineer ed biosensors, derived from the plant hormone sensor PYR1, to recognize either the agrochemical mandipropamid or the synthetic cannabinoid WIN55,212-2. Using a combination of quantitative deep mutational scanning experiments and molecular dynamics simulations, we demonstrated that mutations at common positions can promote protein-ligand shape complementarity and revealed prominent differences in the electrostatic networks needed to complement diverse ligands.
View Article and Find Full Text PDFLiquid-liquid phase separation (LLPS) is thought to be a main driving force in the formation of membraneless organelles. Examples of such organelles include the centrosome, central spindle, and stress granules. Recently, it has been shown that coiled-coil (CC) proteins, such as the centrosomal proteins pericentrin, spd-5, and centrosomin, might be capable of LLPS.
View Article and Find Full Text PDFPolymer and chemically modified biopolymer systems present unique challenges to traditional molecular simulation preparation workflows. First, typical polymer and biomolecular input formats, such as Protein Data Bank (PDB) files, lack adequate chemical information needed for the parameterization of new chemistries. Second, polymers are typically too large for accurate partial charge generation methods.
View Article and Find Full Text PDFMembraneless liquid compartments based on phase-separating biopolymers have been observed in diverse cell types and attributed to weak multivalent interactions predominantly based on intrinsically disordered domains. The design of liquid-liquid phase separated (LLPS) condensates based on de novo designed tunable modules that interact in a well-understood, controllable manner could improve our understanding of this phenomenon and enable the introduction of new features. Here we report the construction of CC-LLPS in mammalian cells, based on designed coiled-coil (CC) dimer-forming modules, where the stability of CC pairs, their number, linkers, and sequential arrangement govern the transition between diffuse, liquid and immobile condensates and are corroborated by coarse-grained molecular simulations.
View Article and Find Full Text PDFProtein tyrosine phosphatases (PTPs) are emerging drug targets for many diseases, including cancer, autoimmunity, and neurological disorders. A high degree of structural similarity between their catalytic domains, however, has hindered the development of selective pharmacological agents. Our previous research uncovered two unfunctionalized terpenoid inhibitors that selectively inhibit PTP1B over T-cell PTP (TCPTP), two PTPs with high sequence conservation.
View Article and Find Full Text PDFNeutral mutational drift is an important source of biological diversity that remains underexploited in fundamental studies of protein biophysics. This study uses a synthetic transcriptional circuit to study neutral drift in protein tyrosine phosphatase 1B (PTP1B), a mammalian signaling enzyme for which conformational changes are rate limiting. Kinetic assays of purified mutants indicate that catalytic activity, rather than thermodynamic stability, guides enrichment under neutral drift, where neutral or mildly activating mutations can mitigate the effects of deleterious ones.
View Article and Find Full Text PDFLiquid-liquid phase separation (LLPS) is thought to be a main driving force in the formation of membraneless organelles. Examples of such organelles include the centrosome, central spindle, and stress granules. Recently, it has been shown that coiled-coil (CC) proteins, such as the centrosomal proteins pericentrin, spd-5, and centrosomin, might be capable of LLPS.
View Article and Find Full Text PDFAccurate representations of van der Waals dispersion-repulsion interactions play an important role in high-quality molecular dynamics simulations. Training the force field parameters used in the Lennard Jones (LJ) potential typically used to represent these interactions is challenging, generally requiring adjustment based on simulations of macroscopic physical properties. The large computational expense of these simulations, especially when many parameters must be trained simultaneously, limits the size of training data set and number of optimization steps that can be taken, often requiring modelers to perform optimizations within a local parameter region.
View Article and Find Full Text PDFWe introduce the Open Force Field (OpenFF) 2.0.0 small molecule force field for drug-like molecules, code-named Sage, which builds upon our previous iteration, Parsley.
View Article and Find Full Text PDFProtein tyrosine phosphatases (PTPs) are emerging drug targets for many diseases, including type 2 diabetes, obesity, and cancer. However, a high degree of structural similarity between the catalytic domains of these enzymes has made the development of selective pharmacological inhibitors an enormous challenge. Our previous research uncovered two unfunctionalized terpenoid inhibitors that selectively inhibit PTP1B over TCPTP, two PTPs with high sequence conservation.
View Article and Find Full Text PDFPerforming alchemical transformations, in which one molecular system is nonphysically changed to another system, is a popular approach adopted in performing free energy calculations associated with various biophysical processes, such as protein-ligand binding or the transfer of a molecule between environments. While the sampling of alchemical intermediate states in either parallel (e.g.
View Article and Find Full Text PDFAmphiphilic monomers in polar solvents can self-assemble into lyotropic liquid crystal (LLC) bicontinuous cubic structures under the right composition and temperature conditions. After cross-linking, the resulting polymer membranes with three-dimensional (3D) continuous uniform channels are excellent candidates for filtration applications. Designing such membranes with the desired physical and chemical properties requires molecular-level understanding of the structure, which can be obtained through molecular modeling.
View Article and Find Full Text PDFMolecular simulations such as molecular dynamics (MD) and Monte Carlo (MC) simulations are powerful tools allowing the prediction of experimental observables in the study of systems such as proteins, membranes, and polymeric materials. The quality of predictions based on molecular simulations depend on the validity of the underlying physical assumptions. physical_validation allows users of molecular simulation programs to perform simple yet powerful tests of physical validity on their systems and setups.
View Article and Find Full Text PDFProtein tyrosine phosphatases (PTPs) are promising drug targets for treating a wide range of diseases such as diabetes, cancer, and neurological disorders, but their conserved active sites have complicated the design of selective therapeutics. This study examines the allosteric inhibition of PTP1B by amorphadiene (AD), a terpenoid hydrocarbon that is an unusually selective inhibitor. Molecular dynamics (MD) simulations carried out in this study suggest that AD can stably sample multiple neighboring sites on the allosterically influential C-terminus of the catalytic domain.
View Article and Find Full Text PDFNon-biological foldamers are a promising class of macromolecules that share similarities to classical biopolymers such as proteins and nucleic acids. Currently, designing novel foldamers is a non-trivial process, often involving many iterations of trial synthesis and characterization until folded structures are observed. In this work, we aim to tackle these foldamer design challenges using computational modeling techniques.
View Article and Find Full Text PDFDeveloping a sufficiently accurate classical force field representation of molecules is key to realizing the full potential of molecular simulations as a route to gaining a fundamental insight into a broad spectrum of chemical and biological phenomena. This is only possible, however, if the many complex interactions between molecules of different species in the system are accurately captured by the model. Historically, the intermolecular van der Waals (vdW) interactions have primarily been trained against densities and enthalpies of vaporization of pure (single-component) systems, with occasional usage of hydration free energies.
View Article and Find Full Text PDFDeveloping accurate classical force field representations of molecules is key to realizing the full potential of molecular simulations, both as a powerful route to gaining fundamental insights into a broad spectrum of chemical and biological phenomena and for predicting physicochemical and mechanical properties of substances. The Open Force Field Consortium is an industry-funded open science effort to this end, developing open-source tools for rapidly generating new high-quality small-molecule force fields. An integral aspect of this is the parameterization and assessment of force fields against high-quality, condensed-phase physical property data, curated from open data sources such as the NIST ThermoML Archive, alongside quantum chemical data.
View Article and Find Full Text PDFInsulin has been commonly adopted as a peptide drug to treat diabetes as it facilitates the uptake of glucose from the blood. The development of oral insulin remains elusive over decades owing to its susceptibility to the enzymes in the gastrointestinal tract and poor permeability through the intestinal epithelium upon dimerization. Recent experimental studies have revealed that certain O-linked glycosylation patterns could enhance insulin's proteolytic stability and reduce its dimerization propensity, but understanding such phenomena at the molecular level is still difficult.
View Article and Find Full Text PDFA high level of physical detail in a molecular model improves its ability to perform high accuracy simulations but can also significantly affect its complexity and computational cost. In some situations, it is worthwhile to add complexity to a model to capture properties of interest; in others, additional complexity is unnecessary and can make simulations computationally infeasible. In this work, we demonstrate the use of Bayesian inference for molecular model selection, using Monte Carlo sampling techniques accelerated with surrogate modeling to evaluate the Bayes factor evidence for different levels of complexity in the two-centered Lennard-Jones + quadrupole (2CLJQ) fluid model.
View Article and Find Full Text PDF