Maximizing the Performance of Similarity-Based Virtual Screening Methods by Generating Synergy from the Integration of 2D and 3D Approaches.

Int J Mol Sci

Center for Bioinformatics (ZBH), Department of Informatics, Faculty of Mathematics, Informatics and Natural Sciences, Universität Hamburg, 20146 Hamburg, Germany.

Published: July 2022

Methods for the pairwise comparison of 2D and 3D molecular structures are established approaches in virtual screening. In this work, we explored three strategies for maximizing the virtual screening performance of these methods: (i) the merging of hit lists obtained from multi-compound screening using a single screening method, (ii) the merging of the hit lists obtained from 2D and 3D screening by parallel selection, and (iii) the combination of both of these strategies in an integrated approach. We found that any of these strategies led to a boost in virtual screening performance, with the clearest advantages observed for the integrated approach. On test sets for virtual screening, covering 50 pharmaceutically relevant proteins, the integrated approach, using sets of five query molecules, yielded, on average, an area under the receiver operating characteristic curve (AUC) of 0.84, an early enrichment among the top 1% of ranked compounds (EF1%) of 53.82 and a scaffold recovery rate among the top 1% of ranked compounds (SRR1%) of 0.50. In comparison, the 2D and 3D methods on their own (when using a single query molecule) yielded AUC values of 0.68 and 0.54, EF1% values of 19.96 and 17.52, and SRR1% values of 0.20 and 0.17, respectively. In conclusion, based on these results, the integration of 2D and 3D methods, via a (balanced) parallel selection strategy, is recommended, and, in particular, when combined with multi-query screening.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9322642PMC
http://dx.doi.org/10.3390/ijms23147747DOI Listing

Publication Analysis

Top Keywords

virtual screening
20
integrated approach
12
screening
9
screening performance
8
merging hit
8
hit lists
8
parallel selection
8
top ranked
8
ranked compounds
8
virtual
5

Similar Publications

Improving Molecular Design with Direct Inverse Analysis of QSAR/QSPR Model.

Mol Inform

January 2025

Department of Applied Chemistry, School of Science and Technology, Meiji University, 1-1-1 Higashi-Mita, Tama-ku, Kawasaki, Kanagawa 214-8571, Japan.

Recent advances in machine learning have significantly impacted molecular design, notably the molecular generation method combining the chemical variational autoencoder (VAE) with Gaussian mixture regression (GMR). In this method, a mathematical model is constructed with X as the latent variable of the molecule and Y as the target properties and activities. Through direct inverse analysis of this model, it is possible to generate molecules with the desired target properties.

View Article and Find Full Text PDF

COX-2 Inhibitor Prediction With KNIME: A Codeless Automated Machine Learning-Based Virtual Screening Workflow.

J Comput Chem

January 2025

Pharmaceutical Chemistry Research Laboratory 1, Department of Pharmaceutical Engineering & Technology, Indian Institute of Technology (Banaras Hindu University), Varanasi, India.

Cyclooxygenase-2 (COX-2) is an enzyme that plays a crucial role in inflammation by converting arachidonic acid into prostaglandins. The overexpression of enzyme is associated with conditions such as cancer, arthritis, and Alzheimer's disease (AD), where it contributes to neuroinflammation. In silico virtual screening is pivotal in early-stage drug discovery; however, the absence of coding or machine learning expertise can impede the development of reliable computational models capable of accurately predicting inhibitor compounds based on their chemical structure.

View Article and Find Full Text PDF

Endometrial cancer is the most prevalent gynecologic cancer in the United States and has rising incidence and mortality. Endometrial intraepithelial neoplasia or atypical endometrial hyperplasia (EIN-AEH), a precancerous neoplasm, is surgically managed with hysterectomy in patients who have completed childbearing because of risk of progression to cancer. Concurrent endometrial carcinoma (EC) is also present on hysterectomy specimens in up to 50% of cases.

View Article and Find Full Text PDF

The fuel system serves as the core component of marine diesel engines, and timely and effective fault diagnosis is the prerequisite for the safe navigation of ships. To address the challenge of current data-driven fault-diagnosis-based methods, which have difficulty in feature extraction and low accuracy under small samples, this paper proposes a fault diagnosis method based on digital twin (DT), Siamese Vision Transformer (SViT), and K-Nearest Neighbor (KNN). Firstly, a diesel engine DT model is constructed by integrating the mathematical, mechanism, and three-dimensional physical models of the Medium-speed diesel engines of 6L21/31 Marine, completing the mapping from physical entity to virtual entity.

View Article and Find Full Text PDF

Eumycetoma, a chronic fungal infection caused by , is a neglected tropical disease characterized by tumor-like growths that can lead to permanent disability and deformities if untreated. Predominantly affecting regions in Africa, South America, and Asia, it imposes significant physical, social, and economic burdens. Current treatments, including antifungal drugs like itraconazole, often show variable efficacy, with severe cases necessitating surgical intervention or amputation.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!