This paper introduces principal component analysis (PCA), partial least squares projections to latent structures (PLS), and statistical molecular design (SMD) as useful tools in deriving multi- and megavariate quantitative structure-activity relationship (QSAR) models. Two QSAR data sets from the fields of environmental toxicology and environmental chemistry are worked out in detail, showing the benefits of PCA, PLS and SMD. PCA is useful when overviewing a data set and exploring relationships among compounds and relationships among variables. PLS is the regression extension of PCA and is used for establishing QSARs. SMD is essential for selecting informative training and test sets of compounds for QSAR calibration and validation.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s11030-006-9024-6DOI Listing

Publication Analysis

Top Keywords

qsar data
8
principal component
8
component analysis
8
analysis pca
8
pca partial
8
partial squares
8
pls statistical
8
statistical molecular
8
molecular design
8
design smd
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!