Multiple Versus Single Set Validation of Multivariate Models to Avoid Mistakes.

Crit Rev Anal Chem

a Center for Intelligent Chemical Instrumentation, Ohio University, Clippinger Laboratories , Athens , OH , USA.

Published: January 2018

Validation of multivariate models is of current importance for a wide range of chemical applications. Although important, it is neglected. The common practice is to use a single external validation set for evaluation. This approach is deficient and may mislead investigators with results that are specific to the single validation set of data. In addition, no statistics are available regarding the precision of a derived figure of merit (FOM). A statistical approach using bootstrapped Latin partitions is advocated. This validation method makes an efficient use of the data because each object is used once for validation. It was reviewed a decade earlier but primarily for the optimization of chemometric models this review presents the reasons it should be used for generalized statistical validation. Average FOMs with confidence intervals are reported and powerful, matched-sample statistics may be applied for comparing models and methods. Examples demonstrate the problems with single validation sets.

Download full-text PDF

Source
http://dx.doi.org/10.1080/10408347.2017.1361314DOI Listing

Publication Analysis

Top Keywords

validation
8
validation multivariate
8
multivariate models
8
validation set
8
single validation
8
multiple versus
4
single
4
versus single
4
single set
4
set validation
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!