Identifying bioactive conformations of small molecules is an essential process for virtual screening applications relying on three-dimensional structure such as molecular docking. For most small molecules, conformer generators retrieve at least one bioactive-like conformation, with an atomic root-mean-square deviation (ARMSD) lower than 1 Å, among the set of low-energy conformers generated. However, there is currently no general method to prioritise these likely target-bound conformations in the ensemble. In this work, we trained atomistic neural networks (AtNNs) on 3D information of generated conformers of a curated subset of PDBbind ligands to predict the ARMSD to their closest bioactive conformation, and evaluated the early enrichment of bioactive-like conformations when ranking conformers by AtNN prediction. AtNN ranking was compared with bioactivity-unaware baselines such as ascending Sage force field energy ranking, and a slower bioactivity-based baseline ranking by ascending Torsion Fingerprint Deviation to the Maximum Common Substructure to the most similar molecule in the training set (TFD2SimRefMCS). On test sets from random ligand splits of PDBbind, ranking conformers using ComENet, the AtNN encoding the most 3D information, leads to early enrichment of bioactive-like conformations with a median BEDROC of 0.29 ± 0.02, outperforming the best bioactivity-unaware Sage energy ranking baseline (median BEDROC of 0.18 ± 0.02), and performing on a par with the bioactivity-based TFD2SimRefMCS baseline (median BEDROC of 0.31 ± 0.02). The improved performance of the AtNN and TFD2SimRefMCS baseline is mostly observed on test set ligands that bind proteins similar to proteins observed in the training set. On a more challenging subset of flexible molecules, the bioactivity-unaware baselines showed median BEDROCs up to 0.02, while AtNNs and TFD2SimRefMCS showed median BEDROCs between 0.09 and 0.13. When performing rigid ligand re-docking of PDBbind ligands with GOLD using the 1% top-ranked conformers, ComENet ranked conformers showed a higher successful docking rate than bioactivity-unaware baselines, with a rate of 0.48 ± 0.02 compared to CSD probability baseline with a rate of 0.39 ± 0.02. Similarly, on a pharmacophore searching experiment, selecting the 20% top-ranked conformers ranked by ComENet showed higher hit rate compared to baselines. Hence, the approach presented here uses AtNNs successfully to focus conformer ensembles towards bioactive-like conformations, representing an opportunity to reduce computational expense in virtual screening applications on known targets that require input conformations.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10740246 | PMC |
http://dx.doi.org/10.1186/s13321-023-00794-w | DOI Listing |
J Cheminform
December 2023
Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Rd, Cambridge, CB2 1EW, UK.
Identifying bioactive conformations of small molecules is an essential process for virtual screening applications relying on three-dimensional structure such as molecular docking. For most small molecules, conformer generators retrieve at least one bioactive-like conformation, with an atomic root-mean-square deviation (ARMSD) lower than 1 Å, among the set of low-energy conformers generated. However, there is currently no general method to prioritise these likely target-bound conformations in the ensemble.
View Article and Find Full Text PDFExpert Opin Drug Discov
October 2010
15 HaGeula St., Hod Hasharon, 45272, Israel
Importance Of The Field: In silico or virtual screening has become a common practice in contemporary computer-aided drug discovery efforts and currently constitutes a reasonably mature paradigm. Application of ligand-based approaches to virtual screening requires the ability to identify the bioactive conformers of drug-like compounds as these conformers are expected to elicit the biological activity. However, given the complexity of the energy potential surfaces of such ligands and in particular those exhibiting some degree of flexibility and the limitation of contemporary energy functions, this is not an easy task.
View Article and Find Full Text PDFJ Chem Inf Model
November 2009
Epix Pharmaceuticals Ltd., 3 Hayetzira St., Ramat Gan 52521, Israel.
Computational approaches that rely on ligand-based information for lead discovery and optimization are often required to spend considerable resources analyzing compounds with large conformational ensembles. In order to reduce such efforts, we have developed a new filtration tool which reduces the total number of ligand conformations while retaining in the final set a reasonable number of conformations that are similar (rmsd < or = 1 A) to those observed in ligand-protein cocrystals (bioactive-like conformations). Our tool consists of the following steps: (1) Prefiltration aimed at removing ligands for which the probability of finding bioactive-like conformations is low.
View Article and Find Full Text PDFJ Mol Recognit
April 2000
Departament de Química Orgànica, Universitat de Barcelona, Barcelona, Spain.
Antigenic site A of foot-and-mouth disease virus (serotype C) has been reproduced by means of cyclic versions of peptide A15, YTASARGDLAHLTTT, corresponding to residues 136-150 of envelope protein VP1. A structural basis for the design of the cyclic peptides is provided by crystallographic data from complexes between the Fab fragments of anti-site A monoclonal antibodies and A15, in which the bound peptide is folded into a quasi-cyclic pattern. Head-to-tail cyclizations of A15 do not provide peptides of superior antigenicity.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!