A comparison of model performance for six quantitative structure-activity relationship packages that predict acute toxicity to fish.

Dwayne R J Moore Roger L Breton Drew B MacDonald

Environ Toxicol Chem

The Cadmus Group, Suite 204, 411 Roosevelt Avenue, Ottawa, Ontario K2A 3X9, Canada.

Published: August 2003

Some regulatory programs rely on quantitative structure-activity relationship (QSAR) models to predict toxic effects to biota. Many currently existing QSAR models can predict the effects of a wide range of substances to biota, particularly aquatic biota. The difficulty for regulatory programs is in choosing the appropriate QSAR model or models for application in their new and existing substances programs. We evaluated model performance of six QSAR modeling packages: Ecological Structure Activity Relationship (ECOSAR), TOPKAT, a Probabilistic Neural Network (PNN), a Computational Neural Network (CNN), the QSAR components of the Assessment Tools for the Evaluation of Risk (ASTER) system, and the Optimized Approach Based on Structural Indices Set (OASIS) system. Using a testing data set of 130 substances that had not been included in the training data sets of the QSAR models under consideration, we compared model predictions for 96-h median lethal concentrations (LC50s) to fathead minnows to the corresponding measured toxicity values available in the AQUIRE database. The testing data set was heavily weighted with neutral organics of low molecular weight and functionality. Many of the testing data set substances also had a nonpolar narcosis mode of action and/or were chlorinated. A variety of statistical measures (correlation coefficient, slope and intercept from a linear regression analysis, mean absolute and squared difference between log prediction and log measured toxicity, and the percentage of predictions within factors of 2, 5, 10, 100, and 1,000 of measured toxicity values) indicated that the PNN model had the best model performance for the full testing data set of 130 substances. The rank order of the remainder of the models depended on the statistical measure employed. TOPKAT also had excellent model performance for substances within its optimum prediction space. Only 37% of the substances in the testing data set, however, fell within this optimum prediction space.

Download full-text PDF	Source
http://dx.doi.org/10.1897/00-361	DOI Listing

Publication Analysis

Top Keywords

testing data

data set

model performance

qsar models

measured toxicity

quantitative structure-activity

structure-activity relationship

regulatory programs

models predict

neural network

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!