Improving Structure-Based Virtual Screening with Ensemble Docking and Machine Learning.

J Chem Inf Model

Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), Ensenada, Baja California C.P. 22860, Mexico.

Published: November 2021

One of the main challenges of structure-based virtual screening (SBVS) is the incorporation of the receptor's flexibility, as its explicit representation in every docking run implies a high computational cost. Therefore, a common alternative to include the receptor's flexibility is the approach known as ensemble docking. Ensemble docking consists of using a set of receptor conformations and performing the docking assays over each of them. However, there is still no agreement on how to combine the ensemble docking results to obtain the final ligand ranking. A common choice is to use consensus strategies to aggregate the ensemble docking scores, but these strategies exhibit slight improvement regarding the single-structure approach. Here, we claim that using machine learning (ML) methodologies over the ensemble docking results could improve the predictive power of SBVS. To test this hypothesis, four proteins were selected as study cases: CDK2, FXa, EGFR, and HSP90. Protein conformational ensembles were built from crystallographic structures, whereas the evaluated compound library comprised up to three benchmarking data sets (DUD, DEKOIS 2.0, and CSAR-2012) and cocrystallized molecules. Ensemble docking results were processed through 30 repetitions of 4-fold cross-validation to train and validate two ML classifiers: logistic regression and gradient boosting trees. Our results indicate that the ML classifiers significantly outperform traditional consensus strategies and even the best performance case achieved with single-structure docking. We provide statistical evidence that supports the effectiveness of ML to improve the ensemble docking performance.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8865842PMC
http://dx.doi.org/10.1021/acs.jcim.1c00511DOI Listing

Publication Analysis

Top Keywords

ensemble docking
32
docking
11
structure-based virtual
8
virtual screening
8
ensemble
8
machine learning
8
receptor's flexibility
8
consensus strategies
8
improving structure-based
4
screening ensemble
4

Similar Publications

Preparation, separation and identification of novel hypocholesterolemic peptides from wheat germ: An in vitro and in silico study.

Food Chem

December 2024

Engineering and Technology Center for Grain Processing of Shandong Province, Key Laboratory of Food Nutrition and Healthy in Universities of Shandong, Laboratory of Food Processing Technology and Quality Control in Shandong Province, College of Food Science and Engineering, Shandong Agricultural University, 61 Daizong Avenue, Tai'an 271018, China. Electronic address:

The aim of this study was to prepare, isolate, and identify hypocholesterolemic peptides from wheat germ protein and explore their efficacy. Wheat germ protein was hydrolyzed using four commercial enzymes. Hydrolysate, with the highest in vitro hypocholesterolemic activity was isolated using ultrafiltration and macroporous resin.

View Article and Find Full Text PDF

Saccharomyces cerevisiae meiosis-specific Hop1, a structural constituent of the synaptonemal complex, also facilitates the formation of programmed DNA double-strand breaks and the pairing of homologous chromosomes. Here, we reveal a serendipitous discovery that Hop1 possesses robust DNA-independent ATPase activity, although it lacks recognizable sequence motifs required for ATP binding and hydrolysis. By leveraging molecular docking combined with molecular dynamics simulations and biochemical assays, we identified an ensemble of five amino acid residues in Hop1 that could potentially participate in ATP-binding and hydrolysis.

View Article and Find Full Text PDF

Homology modeling of the orthoflavivirus NS1 protein for virtual screening of potential ligands.

Biomed Khim

December 2024

Chumakov Federal Scientific Center for Research and Development of Immune-and-Biological Products of Russian Academy of Sciences (Institute of Poliomyelitis), Moscow, Russia; Institute of Translational Medicine and Biotechnology, Sechenov First Moscow State Medical University, Moscow, Russia.

The orthoflavivirus NS1 protein is a relatively understudied target for the design of broad-spectrum anti-orthoflaviviral drugs. Currently, the NS1 protein structures of tick-borne orthoflaviviruses have not been published yet, but these structures can be modelled by homology, thus generating a large amount of structural data. We performed homology modelling of the NS1 protein structures of epidemiologically significant orthoflaviviruses and analysed the possibility of using these models in ensemble docking-based virtual screening.

View Article and Find Full Text PDF

Anxiety disorders are one of the most common mental health pathologies in the world. They require searc h and development of novel effective pharmacologically active substances. Thus, the development of new approaches to the search for anxiolytic substances by artificial intelligence methods is an important area of modern bioinformatics and pharmacology.

View Article and Find Full Text PDF

In virtual drug screening, consensus docking is a standard in-silico approach consisting of a combined result from optimized docking experiments, a minimum of two results combination. Therefore, consensus docking is subjected to a lower success rate than the best docking method due to its mathematical nature, an unavoidable limitation. This study aims to overcome this drawback via random forest, an ensemble machine learning model.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!