Supervised self-organizing maps in drug discovery. 2. Improvements in descriptor selection and model validation.

J Chem Inf Model

Molecular Design Group, Targacept Inc., Winston-Salem, North Carolina 27101-4165, USA.

Published: April 2006

The modeling of nonlinear descriptor-target relationships is a topic of considerable interest in drug discovery. We, herein, continue reporting the use of the self-organizing map-a nonlinear, topology-preserving pattern recognition technique that exhibits considerable promise in modeling and decoding these relationships. Since simulated annealing is an efficient tool for solving optimization problems, we combined the supervised self-organizing map with simulated annealing to build high-quality, highly predictive quantitative structure-activity/property relationship models. This technique was applied to six data sets representing a variety of biological endpoints. Since a high statistical correlation in the training set does not indicate a highly predictive model, the quality of all the models was confirmed by withholding a portion of each data set for external validation. Finally, we introduce new cross-validation and dynamic partitioning techniques to address model overfitting and assessment.

Download full-text PDF

Source
http://dx.doi.org/10.1021/ci0500841DOI Listing

Publication Analysis

Top Keywords

supervised self-organizing
8
drug discovery
8
simulated annealing
8
highly predictive
8
self-organizing maps
4
maps drug
4
discovery improvements
4
improvements descriptor
4
descriptor selection
4
selection model
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!