Epik version 7 is a software program that uses machine learning for predicting the p values and protonation state distribution of complex, druglike molecules. Using an ensemble of atomic graph convolutional neural networks (GCNNs) trained on over 42,000 p values across broad chemical space from both experimental and computed origins, the model predicts p values with 0.42 and 0.
View Article and Find Full Text PDFThe recently developed AlphaFold2 (AF2) algorithm predicts proteins' 3D structures from amino acid sequences. The open AlphaFold protein structure database covers the complete human proteome. Using an industry-leading molecular docking method (Glide), we investigated the virtual screening performance of 37 common drug targets, each with an AF2 structure and known and structures from the DUD-E data set.
View Article and Find Full Text PDFJ Chem Theory Comput
November 2021
With the advent of make-on-demand commercial libraries, the number of purchasable compounds available for virtual screening and assay has grown explosively in recent years, with several libraries eclipsing one billion compounds. Today's screening libraries are larger and more diverse, enabling the discovery of more-potent hit compounds and unlocking new areas of chemical space, represented by new core scaffolds. Applying physics-based in silico screening methods in an exhaustive manner, where every molecule in the library must be enumerated and evaluated independently, is increasingly cost-prohibitive.
View Article and Find Full Text PDFAim: We introduce AutoQSAR, an automated machine-learning application to build, validate and deploy quantitative structure-activity relationship (QSAR) models.
Methodology/results: The process of descriptor generation, feature selection and the creation of a large number of QSAR models has been automated into a single workflow within AutoQSAR. The models are built using a variety of machine-learning methods, and each model is scored using a novel approach.
We have developed a new methodology for protein-ligand docking and scoring, WScore, incorporating a flexible description of explicit water molecules. The locations and thermodynamics of the waters are derived from a WaterMap molecular dynamics simulation. The water structure is employed to provide an atomic level description of ligand and protein desolvation.
View Article and Find Full Text PDFGlide SP mode enrichment results for two preparations of the DUD dataset and native ligand docking RMSDs for two preparations of the Astex dataset are presented. Following a best-practices preparation scheme, an average RMSD of 1.140 Å for native ligand docking with Glide SP is computed.
View Article and Find Full Text PDFGlide is a ligand docking program for predicting protein-ligand binding modes and ranking ligands via high-throughput virtual screening. Glide utilizes two different scoring functions, SP and XP GlideScore, to rank-order compounds. Three modes of sampling ligand conformational and positional degrees of freedom are available to determine the optimal ligand orientation relative to a rigid protein receptor geometry.
View Article and Find Full Text PDFWhile it may seem intuitive that using an ensemble of multiple conformations of a receptor in structure-based virtual screening experiments would necessarily yield improved enrichment of actives relative to using just a single receptor, it turns out that at least in the p38 MAP kinase model system studied here, a very large majority of all possible ensembles do not yield improved enrichment of actives. However, there are combinations of receptor structures that do lead to improved enrichment results. We present here a method to select the ensembles that produce the best enrichments that does not rely on knowledge of active compounds or sophisticated analyses of the 3D receptor structures.
View Article and Find Full Text PDFA novel scoring function to estimate protein-ligand binding affinities has been developed and implemented as the Glide 4.0 XP scoring function and docking protocol. In addition to unique water desolvation energy terms, protein-ligand structural motifs leading to enhanced binding affinity are included: (1) hydrophobic enclosure where groups of lipophilic ligand atoms are enclosed on opposite faces by lipophilic protein atoms, (2) neutral-neutral single or correlated hydrogen bonds in a hydrophobically enclosed environment, and (3) five categories of charged-charged hydrogen bonds.
View Article and Find Full Text PDFWe provide an overview of the IMPACT molecular mechanics program with an emphasis on recent developments and a description of its current functionality. With respect to core molecular mechanics technologies we include a status report for the fixed charge and polarizable force fields that can be used with the program and illustrate how the force fields, when used together with new atom typing and parameter assignment modules, have greatly expanded the coverage of organic compounds and medicinally relevant ligands. As we discuss in this review, explicit solvent simulations have been used to guide our design of implicit solvent models based on the generalized Born framework and a novel nonpolar estimator that have recently been incorporated into the program.
View Article and Find Full Text PDFUnlike other methods for docking ligands to the rigid 3D structure of a known protein receptor, Glide approximates a complete systematic search of the conformational, orientational, and positional space of the docked ligand. In this search, an initial rough positioning and scoring phase that dramatically narrows the search space is followed by torsionally flexible energy optimization on an OPLS-AA nonbonded potential grid for a few hundred surviving candidate poses. The very best candidates are further refined via a Monte Carlo sampling of pose conformation; in some cases, this is crucial to obtaining an accurate docked pose.
View Article and Find Full Text PDFThe new semiempirical methods, PDDG/PM3 and PDDG/MNDO, have been parameterized for halogens. For comparison, the original MNDO and PM3 were also reoptimized for the halogens using the same training set; these modified methods are referred to as MNDO' and PM3'. For 442 halogen-containing molecules, the smallest mean absolute error (MAE) in heats of formation is obtained with PDDG/PM3 (5.
View Article and Find Full Text PDFIn this article a wide variety of computational approaches (molecular mechanics force fields, semiempirical formalisms, and hybrid methods, namely ONIOM calculations) have been used to calculate the energy and geometry of the supramolecular system 2-(2'-hydroxyphenyl)-4-methyloxazole (HPMO) encapsulated in beta-cyclodextrin (beta-CD). The main objective of the present study has been to examine the performance of these computational methods when describing the short range H. H intermolecular interactions between guest (HPMO) and host (beta-CD) molecules.
View Article and Find Full Text PDFThe rate enhancement provided by the chorismate mutase (CM) enzyme for the Claisen rearrangement of chorismate to prephenate has been investigated by application of the concept of near attack conformations (NACs). Using a combined QM/MM Monte Carlo/free-energy perturbation (MC/FEP) method, 82% and 100% of chorismate conformers were found to be NAC structures in water and in the CM active site, respectively. Consequently, the conversion of non-NACs to NACs does not contribute to the free energy of activation from preorganization of the substrate into NACs.
View Article and Find Full Text PDFSolvent effects on the rate of the Claisen rearrangement of chorismate to prephenate have been examined in water and methanol. The preequilibrium free-energy differences between diaxial and diequatorial conformers of chorismate, which had previously been implicated as the sole basis for the observed 100-fold rate increase in water over methanol, have been reframed using the near attack conformation (NAC) concept of Bruice and co-workers. Using a combined QM/MM Monte Carlo/free-energy perturbation (MC/FEP) method, 82%, 57%, and 1% of chorismate conformers were found to be NAC structures (NACs) in water, methanol, and the gas phase, respectively.
View Article and Find Full Text PDFTwo new semiempirical methods employing a Pairwise Distance Directed Gaussian modification have been developed: PDDG/PM3 and PDDG/MNDO; they are easily implemented in existing software, and yield heats of formation for compounds containing C, H, N, and O atoms with significantly improved accuracy over the standard NDDO schemes, PM5, PM3, AM1, and MNDO. The PDDG/PM3 results for heats of formation also show substantial improvement over density functional theory with large basis sets. The PDDG modifications consist of a single function, which is added to the existing pairwise core repulsion functions within PM3 and MNDO, a reparameterized semiempirical parameter set, and modified computation of the energy of formation of a gaseous atom.
View Article and Find Full Text PDFDeficiencies in energetics obtained using the common semiempirical methods, AM1, PM3, and MNDO, may partly be traced to the use of pseudoatomic equivalents for conversion of molecular energies to heats of formation at 298 K. We present an alternative scheme based on the use of bond and group equivalents. Values for the 61 bond and group equivalents necessary for treatment of molecules containing the common organic elements, hydrogen, carbon, nitrogen, and oxygen have been derived.
View Article and Find Full Text PDF