Publications by authors named "Garrett M Morris"

Extended-connectivity fingerprints (ECFPs) are a ubiquitous tool in current cheminformatics and molecular machine learning, and one of the most prevalent molecular feature extraction techniques used for chemical prediction. Atom features learned by graph neural networks can be aggregated to compound-level representations using a large spectrum of graph pooling methods. In contrast, sets of detected ECFP substructures are by default transformed into bit vectors using only a simple hash-based folding procedure.

View Article and Find Full Text PDF

A novel class of protein misfolding characterized by either the formation of non-native noncovalent lasso entanglements in the misfolded structure or loss of native entanglements has been predicted to exist and found circumstantial support through biochemical assays and limited-proteolysis mass spectrometry data. Here, we examine whether it is possible to design small molecule compounds that can bind to specific folding intermediates and thereby avoid these misfolded states in computer simulations under idealized conditions (perfect drug-binding specificity, zero promiscuity, and a smooth energy landscape). Studying two proteins, type III chloramphenicol acetyltransferase (CAT-III) and D-alanyl-D-alanine ligase B (DDLB), that were previously suggested to form soluble misfolded states through a mechanism involving a failure-to-form of native entanglements, we explore two different drug design strategies using coarse-grained structure-based models.

View Article and Find Full Text PDF

The last few years have seen the development of numerous deep learning-based protein-ligand docking methods. They offer huge promise in terms of speed and accuracy. However, despite claims of state-of-the-art performance in terms of crystallographic root-mean-square deviation (RMSD), upon closer inspection, it has become apparent that they often produce physically implausible molecular structures.

View Article and Find Full Text PDF
Article Synopsis
  • - The COVID Moonshot was a collaborative, open-science effort focused on finding a new drug to inhibit the SARS-CoV-2 main protease, which is crucial for the virus's survival.
  • - Researchers developed a novel noncovalent, nonpeptidic inhibitor that stands out from existing drugs targeting the same protease, employing advanced techniques like machine learning and high-throughput structural biology.
  • - Over 18,000 compound designs, 490 ligand-bound x-ray structures, and extensive assay data were generated and shared openly, creating a comprehensive and accessible knowledge base for future drug discovery efforts against coronaviruses.
View Article and Find Full Text PDF

CC and CXC-chemokines are the primary drivers of chemotaxis in inflammation, but chemokine network redundancy thwarts pharmacological intervention. Tick evasins promiscuously bind CC and CXC-chemokines, overcoming redundancy. Here we show that short peptides that promiscuously bind both chemokine classes can be identified from evasins by phage-display screening performed with multiple chemokines in parallel.

View Article and Find Full Text PDF

Introduction And Methodology: Pairs of similar compounds that only differ by a small structural modification but exhibit a large difference in their binding affinity for a given target are known as activity cliffs (ACs). It has been hypothesised that QSAR models struggle to predict ACs and that ACs thus form a major source of prediction error. However, the AC-prediction power of modern QSAR methods and its quantitative relationship to general QSAR-prediction performance is still underexplored.

View Article and Find Full Text PDF

The rapid and accurate prediction of protein-ligand binding free energies or binding affinities has the potential to transform drug discovery. In recent years, there has been a rapid growth of interest in deep learning methods for the prediction of protein-ligand binding affinities based on the structural information of protein-ligand complexes. These structure-based scoring functions often obtain better results than classical scoring functions when applied within their applicability domain.

View Article and Find Full Text PDF

TRIM33 is a member of the tripartite motif (TRIM) family of proteins, some of which possess E3 ligase activity and are involved in the ubiquitin-dependent degradation of proteins. Four of the TRIM family proteins, TRIM24 (TIF1α), TRIM28 (TIF1β), TRIM33 (TIF1γ) and TRIM66, contain C-terminal plant homeodomain (PHD) and bromodomain (BRD) modules, which bind to methylated lysine (KMe) and acetylated lysine (KAc), respectively. Here we investigate the differences between the two isoforms of TRIM33, TRIM33α and TRIM33β, using structural and biophysical approaches.

View Article and Find Full Text PDF

Drug resistance caused by mutations is a public health threat for existing and emerging viral diseases. A wealth of evidence about these mutations and their clinically associated phenotypes is scattered across the literature, but a comprehensive perspective is usually lacking. This work aimed to produce a clinically relevant view for the case of Hepatitis B virus (HBV) mutations by combining a chronic HBV clinical study with a compendium of genetic mutations systematically gathered from the scientific literature.

View Article and Find Full Text PDF

The SARS-CoV-2 coronavirus is the causal agent of the current global pandemic. SARS-CoV-2 belongs to an order, Nidovirales, with very large RNA genomes. It is proposed that the fidelity of coronavirus (CoV) genome replication is aided by an RNA nuclease complex, comprising the non-structural proteins 14 and 10 (nsp14-nsp10), an attractive target for antiviral inhibition.

View Article and Find Full Text PDF

The main protease (M) of SARS-CoV-2 is central to viral maturation and is a promising drug target, but little is known about structural aspects of how it binds to its 11 natural cleavage sites. We used biophysical and crystallographic data and an array of biomolecular simulation techniques, including automated docking, molecular dynamics (MD) and interactive MD in virtual reality, QM/MM, and linear-scaling DFT, to investigate the molecular features underlying recognition of the natural M substrates. We extensively analysed the subsite interactions of modelled 11-residue cleavage site peptides, crystallographic ligands, and docked COVID Moonshot-designed covalent inhibitors.

View Article and Find Full Text PDF

Machine learning scoring functions for protein-ligand binding affinity have been found to consistently outperform classical scoring functions when trained and tested on crystal structures of bound protein-ligand complexes. However, it is less clear how these methods perform when applied to docked poses of complexes. We explore how the use of docked rather than crystallographic poses for both training and testing affects the performance of machine learning scoring functions.

View Article and Find Full Text PDF

Scoring functions for the prediction of protein-ligand binding affinity have seen renewed interest in recent years when novel machine learning and deep learning methods started to consistently outperform classical scoring functions. Here we explore the use of atomic environment vectors (AEVs) and feed-forward neural networks, the building blocks of several neural network potentials, for the prediction of protein-ligand binding affinity. The AEV-based scoring function, which we term AEScore, is shown to perform as well or better than other state-of-the-art scoring functions on binding affinity prediction, with an RMSE of 1.

View Article and Find Full Text PDF

The calculation of the entropy of flexible molecules can be challenging, since the number of possible conformers can grow exponentially with molecule size and many low-energy conformers may be thermally accessible. Different methods have been proposed to approximate the contribution of conformational entropy to the molecular standard entropy, including performing thermochemistry calculations with all possible stable conformations and developing empirical corrections from experimental data. We have performed conformer sampling on over 120,000 small molecules generating some 12 million conformers, to develop models to predict conformational entropy across a wide range of molecules.

View Article and Find Full Text PDF

The geometry of a molecule plays a significant role in determining its physical and chemical properties. Despite its importance, there are relatively few studies on ring puckering and conformations, often focused on small cycloalkanes, 5- and 6-membered carbohydrate rings, and specific macrocycle families. We lack a general understanding of the puckering preferences of medium-sized rings and macrocycles.

View Article and Find Full Text PDF

A key challenge in conformer sampling is finding low-energy conformations with a small number of energy evaluations. We recently demonstrated the Bayesian Optimization Algorithm (BOA) is an effective method for finding the lowest energy conformation of a small molecule. Our approach balances between exploitation and exploration, and is more efficient than exhaustive or random search methods.

View Article and Find Full Text PDF

Motivation: Machine learning scoring functions for protein-ligand binding affinity prediction have been found to consistently outperform classical scoring functions. Structure-based scoring functions for universal affinity prediction typically use features describing interactions derived from the protein-ligand complex, with limited information about the chemical or topological properties of the ligand itself.

Results: We demonstrate that the performance of machine learning scoring functions are consistently improved by the inclusion of diverse ligand-based features.

View Article and Find Full Text PDF

We present Ligity, a hybrid ligand-structure-based, non-superpositional method for virtual screening of large databases of small molecules. Ligity uses the relative spatial distribution of pharmacophoric interaction points (PIPs) derived from the conformations of small molecules. These are compared with the PIPs derived from key interaction features found in protein-ligand complexes and are used to prioritize likely binders.

View Article and Find Full Text PDF

Generating low-energy molecular conformers is a key task for many areas of computational chemistry, molecular modeling and cheminformatics. Most current conformer generation methods primarily focus on generating geometrically diverse conformers rather than finding the most probable or energetically lowest minima. Here, we present a new stochastic search method called the Bayesian optimization algorithm (BOA) for finding the lowest energy conformation of a given molecule.

View Article and Find Full Text PDF

Novel drugs to treat tuberculosis are required and the identification of potential targets is important. Piperidinols have been identified as potential antimycobacterial agents (MIC < 5 μg/mL), which also inhibit mycobacterial arylamine N-acetyltransferase (NAT), an enzyme essential for mycobacterial survival inside macrophages. The NAT inhibition involves a prodrug-like mechanism in which activation leads to the formation of bioactive phenyl vinyl ketone (PVK).

View Article and Find Full Text PDF

The potassium efflux system, Kef, protects bacteria against the detrimental effects of electrophilic compounds via acidification of the cytoplasm. Kef is inhibited by glutathione (GSH) but activated by glutathione-S-conjugates (GS-X) formed in the presence of electrophiles. GSH and GS-X bind to overlapping sites on Kef, which are located in a cytosolic regulatory domain.

View Article and Find Full Text PDF

A major goal in computational chemistry has been to discover the set of rules that can accurately predict the binding affinity of any protein-drug complex, using only a single snapshot of its three-dimensional structure. Despite the continual development of structure-based models, predictive accuracy remains low, and the fundamental factors that inhibit the inference of all-encompassing rules have yet to be fully explored. Using statistical learning theory and information theory, here we prove that even the very best generalized structure-based model is inherently limited in its accuracy, and protein-specific models are always likely to be better.

View Article and Find Full Text PDF

There is a growing recognition of the importance of cloud computing for large-scale and data-intensive applications. The distinguishing features of cloud computing and their relationship to other distributed computing paradigms are described, as are the strengths and weaknesses of the approach. We review the use made to date of cloud computing for molecular modelling projects and the availability of front ends for molecular modelling applications.

View Article and Find Full Text PDF
Article Synopsis
  • The research introduces a new method to improve the identification of protein targets for small molecules (drugs), addressing limitations of existing methods by considering 3D structures of both the drugs and their targets.
  • The method, called ElectroShape, quickly compares the shapes and charge distributions of drugs to predict their on-target activities, potential off-targets, and side effects using data from the DrugBank database.
  • An open-access web tool has been created for users to explore the relationships between drugs and their targets, allowing for the prediction of effects for new compounds and aiming to raise awareness of these interactions in pharmacology.
View Article and Find Full Text PDF

The use of computer-aided structure-based drug design prior to synthesis has proven to be generally valuable in suggesting improved binding analogues of existing ligands. Here we describe the application of the program AutoDock to the design of a focused library that was used in the "click chemistry in-situ" generation of the most potent noncovalent inhibitor of the native enzyme acetylcholinesterase (AChE) yet developed (K(d) = ~100 fM). AutoDock version 3.

View Article and Find Full Text PDF