GA(M)E-QSAR: a novel, fully automatic genetic-algorithm-(meta)-ensembles approach for binary classification in ligand-based drug design.

J Chem Inf Model

Computational Modeling Lab-CoMo, Department of Computer Sciences, Faculty of Sciences, Vrije Universiteit Brussel, Pleinlaan 2, B-1050 Brussel, Belgium.

Published: September 2012

Computer-aided drug design has become an important component of the drug discovery process. Despite the advances in this field, there is not a unique modeling approach that can be successfully applied to solve the whole range of problems faced during QSAR modeling. Feature selection and ensemble modeling are active areas of research in ligand-based drug design. Here we introduce the GA(M)E-QSAR algorithm that combines the search and optimization capabilities of Genetic Algorithms with the simplicity of the Adaboost ensemble-based classification algorithm to solve binary classification problems. We also explore the usefulness of Meta-Ensembles trained with Adaboost and Voting schemes to further improve the accuracy, generalization, and robustness of the optimal Adaboost Single Ensemble derived from the Genetic Algorithm optimization. We evaluated the performance of our algorithm using five data sets from the literature and found that it is capable of yielding similar or better classification results to what has been reported for these data sets with a higher enrichment of active compounds relative to the whole actives subset when only the most active chemicals are considered. More important, we compared our methodology with state of the art feature selection and classification approaches and found that it can provide highly accurate, robust, and generalizable models. In the case of the Adaboost Ensembles derived from the Genetic Algorithm search, the final models are quite simple since they consist of a weighted sum of the output of single feature classifiers. Furthermore, the Adaboost scores can be used as ranking criterion to prioritize chemicals for synthesis and biological evaluation after virtual screening experiments.

Download full-text PDF

Source
http://dx.doi.org/10.1021/ci300146hDOI Listing

Publication Analysis

Top Keywords

drug design
12
binary classification
8
ligand-based drug
8
feature selection
8
derived genetic
8
genetic algorithm
8
data sets
8
classification
5
algorithm
5
adaboost
5

Similar Publications

Microtextured microneedles are tiny needle-like structures with micron-scale microtextures, and the drugs stored in the microtextures can be released after entering the skin to achieve the effect of precise drug delivery. In this study, the skin substitution model of Ogden's hyperelastic model and the microneedle array and microtexture models with different geometrical parameters were selected to simulate and analyse the flow of the microtexture microneedle arrays penetrating the skin by the finite-element method, and the length of the microneedles was determined to be 200 μm, the width 160 μm, and the value of the gaps was determined to be 420 μm. A four-pronged cone was chosen as the shape of microneedles, and a rectangle was chosen as the shape of the drug-carrying microneedle.

View Article and Find Full Text PDF

The Quantum Computing for Drug Discovery Challenge, held at the 42nd International Conference on Computer-Aided Design (ICCAD) in 2023, was a multi-month, research-intensive competition. Over 70 teams from more than 65 organizations from 12 different countries registered, focusing on the use of quantum computing for drug discovery. The challenge centered on designing algorithms to accurately estimate the ground state energy of molecules, specifically OH+, using quantum computing techniques.

View Article and Find Full Text PDF

Bisphenol A (BPA) is a chemical produced in large quantities for use primarily in the production of polycarbonate plastics, which has risks for human health. This study aimed to investigate BPA contents in canned fruit and vegetable samples using Gas Chromatography-Mass Spectrometry (GC-MS). Furthermore, health risks were assessed for Iranian adults and children using Monte Carlo simulations.

View Article and Find Full Text PDF

This paper introduces an evidence-based, design-of-experiments (DoE) approach to analyze and optimize drug delivery systems, ensuring that release aligns with the therapeutic window of the medication. First, the effective factors and release data of the system are extracted from the literature and meta-analytically undergo regression modeling. Then, the interaction and correlation of the factors to each other and the release amount are quantitatively assessed.

View Article and Find Full Text PDF

Background: Intermediate-high risk pulmonary embolism (PE) carries a significant risk of hemodynamic deterioration or death. Treatment should balance efficacy in reducing clot burden with the risk of complications, particularly bleeding. Previous studies on high-dose, short-term thrombolysis with alteplase (rtPA) showed a reduced risk of hemodynamic deterioration but no change in mortality and increased bleeding complications.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!