Automatic QSAR modeling of ADME properties: blood-brain barrier penetration and aqueous solubility.

J Comput Aided Mol Des

BioFocus DPI Ltd., Darwin Building, Chesterford Research Park, Saffron Walden, CB10 1XL, UK.

Published: August 2008

In this article, we present an automatic model generation process for building QSAR models using Gaussian Processes, a powerful machine learning modeling method. We describe the stages of the process that ensure models are built and validated within a rigorous framework: descriptor calculation, splitting data into training, validation and test sets, descriptor filtering, application of modeling techniques and selection of the best model. We apply this automatic process to data sets of blood-brain barrier penetration and aqueous solubility and compare the resulting automatically generated models with 'manually' built models using external test sets. The results demonstrate the effectiveness of the automatic model generation process for two types of data sets commonly encountered in building ADME QSAR models, a small set of in vivo data and a large set of physico-chemical data.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s10822-008-9193-8DOI Listing

Publication Analysis

Top Keywords

blood-brain barrier
8
barrier penetration
8
penetration aqueous
8
aqueous solubility
8
automatic model
8
model generation
8
generation process
8
qsar models
8
test sets
8
data sets
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!