Validation subset selections for extrapolation oriented QSPAR models.

Mol Divers

Cooperative Research Center, Semmelweis University, Pf 131, Budapest 5, Hungary, 1367.

Published: August 2004

One of the most important features of QSPAR models is their predictive ability. The predictive ability of QSPAR models should be checked by external validation. In this work we examined three different types of external validation set selection methods for their usefulness in in-silico screening. The usefulness of the selection methods was studied in such a way that: 1) We generated thousands of QSPR models and stored them in 'model banks'. 2) We selected a final top model from the model banks based on three different validation set selection methods. 3) We predicted large data sets, which we called 'chemical universe sets', and calculated the corresponding SEPs. The models were generated from small fractions of the available water solubility data during a GA Variable Subset Selection procedure. The external validation sets were constructed by random selections, uniformly distributed selections or by perimeter-oriented selections. We found that the best performing models on the perimeter-oriented external validation sets usually gave the best validation results when the remaining part of the available data was overwhelmingly large, i.e., when the model had to make a lot of extrapolations. We also compared the top final models obtained from external validation set selection methods in three independent and different sizes of 'chemical universe sets'.

Download full-text PDF

Source
http://dx.doi.org/10.1023/b:modi.0000006538.99122.00DOI Listing

Publication Analysis

Top Keywords

external validation
20
selection methods
16
qspar models
12
validation set
12
set selection
12
validation
8
predictive ability
8
'chemical universe
8
universe sets'
8
validation sets
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!