A benchmark of several popular methods, Associative Neural Networks (ANN), Support Vector Machines (SVM), k Nearest Neighbors (kNN), Maximal Margin Linear Programming (MMLP), Radial Basis Function Neural Network (RBFNN), and Multiple Linear Regression (MLR), is reported for quantitative-structure property relationships (QSPR) of stability constants logK1 for the 1:1 (M:L) and logbeta2 for 1:2 complexes of metal cations Ag+ and Eu3+ with diverse sets of organic molecules in water at 298 K and ionic strength 0.1 M. The methods were tested on three types of descriptors: molecular descriptors including E-state values, counts of atoms determined for E-state atom types, and substructural molecular fragments (SMF). Comparison of the models was performed using a 5-fold external cross-validation procedure. Robust statistical tests (bootstrap and Kolmogorov-Smirnov statistics) were employed to evaluate the significance of calculated models. The Wilcoxon signed-rank test was used to compare the performance of methods. Individual structure-complexation property models obtained with nonlinear methods demonstrated a significantly better performance than the models built using multilinear regression analysis (MLRA). However, the averaging of several MLRA models based on SMF descriptors provided as good of a prediction as the most efficient nonlinear techniques. Support Vector Machines and Associative Neural Networks contributed in the largest number of significant models. Models based on fragments (SMF descriptors and E-state counts) had higher prediction ability than those based on E-state indices. The use of SMF descriptors and E-state counts provided similar results, whereas E-state indices lead to less significant models. The current study illustrates the difficulties of quantitative comparison of different methods: conclusions based only on one data set without appropriate statistical tests could be wrong.

Download full-text PDF

Source
http://dx.doi.org/10.1021/ci0504216DOI Listing

Publication Analysis

Top Keywords

smf descriptors
12
associative neural
8
neural networks
8
support vector
8
vector machines
8
fragments smf
8
models
8
statistical tests
8
models based
8
descriptors e-state
8

Similar Publications

Background: Approximately 50% of dental amalgam is elemental mercury by weight. Accumulating body of evidence now shows that not only static magnetic fields (SMF) but both ionizing and non-ionizing electromagnetic radiations can increase the rate of mercury release from dental amalgam fillings. Iranian scientists firstly addressed this issue in 2008 but more than 10 years later, it became viral worldwide.

View Article and Find Full Text PDF

Design of phosphoryl containing podands with Li/Na selectivity using machine learning.

SAR QSAR Environ Res

July 2021

Laboratory of Novel Physicochemical Problems, A.N. Frumkin Institute of Physical Chemistry and Electrochemistry, Russian Academy of Sciences, Moscow, Russian Federation.

In this work we demonstrated, that machine learning opens a way for real design of ligands with required metal ion selectivity. We performed the ensemble QSPR modelling of the Li/Na complexation selectivity and the stability constants for the LiL and NaL complexes of phosphoryl podands in nonaqueous solvent THF/СНCl (4:1 v/v). The models were built and cross-validated using MLR with the ISIDA QSPR program and SVM with the libSVM package.

View Article and Find Full Text PDF

A comparison of different QSAR approaches to modeling CYP450 1A2 inhibition.

J Chem Inf Model

June 2011

eADMET GmbH, Ingolstädter Landstrasse 1, D-85764 Neuherberg, Germany.

Prediction of CYP450 inhibition activity of small molecules poses an important task due to high risk of drug-drug interactions. CYP1A2 is an important member of CYP450 superfamily and accounts for 15% of total CYP450 presence in human liver. This article compares 80 in-silico QSAR models that were created by following the same procedure with different combinations of descriptors and machine learning methods.

View Article and Find Full Text PDF

A benchmark of several popular methods, Associative Neural Networks (ANN), Support Vector Machines (SVM), k Nearest Neighbors (kNN), Maximal Margin Linear Programming (MMLP), Radial Basis Function Neural Network (RBFNN), and Multiple Linear Regression (MLR), is reported for quantitative-structure property relationships (QSPR) of stability constants logK1 for the 1:1 (M:L) and logbeta2 for 1:2 complexes of metal cations Ag+ and Eu3+ with diverse sets of organic molecules in water at 298 K and ionic strength 0.1 M. The methods were tested on three types of descriptors: molecular descriptors including E-state values, counts of atoms determined for E-state atom types, and substructural molecular fragments (SMF).

View Article and Find Full Text PDF

"In silico" design of potential anti-HIV actives using fragment descriptors.

Comb Chem High Throughput Screen

August 2005

Laboratoire d'Infochimie, UMR 7551 CNRS, Université Louis Pasteur, 4, rue B. Pascal, Strasbourg, 67000, France.

Substructural Molecular Fragments (SMF) method was applied for computer-aided design of new compounds potentially possessing high anti-HIV activities: tetrahydroimidazobenzodiazepinone (TIBO) derivatives and 1-[2-hydroxyethoxy)methyl]-6-(phenylthio)thymine (HEPT) derivatives. Using available experimental data, the SMF method was first applied to build QSAR models based on fragment descriptors (atom/bond sequences and "augmented atoms"). The focused virtual combinatorial libraries containing 891 TIBO derivatives and 2640 HEPT derivatives were then generated systematically attaching selected substituents to corresponding Markush structures.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!