Partial least squares-discriminant analysis (PLS-DA) is a versatile algorithm that can be used for predictive and descriptive modelling as well as for discriminative variable selection. However, versatility is both a blessing and a curse and the user needs to optimize a wealth of parameters before reaching reliable and valid outcomes. Over the past two decades, PLS-DA has demonstrated great success in modelling high-dimensional datasets for diverse purposes, e.g. product authentication in food analysis, diseases classification in medical diagnosis, and evidence analysis in forensic science. Despite that, in practice, many users have yet to grasp the essence of constructing a valid and reliable PLS-DA model. As the technology progresses, across every discipline, datasets are evolving into a more complex form, i.e. multi-class, imbalanced and colossal. Indeed, the community is welcoming a new era called big data. In this context, the aim of the article is two-fold: (a) to review, outline and describe the contemporary PLS-DA modelling practice strategies, and (b) to critically discuss the respective knowledge gaps that have emerged in response to the present big data era. This work could complement other available reviews or tutorials on PLS-DA, to provide a timely and user-friendly guide to researchers, especially those working in applied research.

Download full-text PDF

Source
http://dx.doi.org/10.1039/c8an00599kDOI Listing

Publication Analysis

Top Keywords

partial squares-discriminant
8
squares-discriminant analysis
8
analysis pls-da
8
practice strategies
8
knowledge gaps
8
big data
8
pls-da
6
analysis
4
pls-da classification
4
classification high-dimensional
4

Similar Publications

This work deals with the development of a greener RP-HPLC method and chemical pattern recognition for the identification of L. collected from different natural sources and samples traded as '' in Indian herbal drug markets. The simultaneous quantification of α- and β-asarone was performed using 0.

View Article and Find Full Text PDF

: Obstructive Sleep Apnea (OSA) is a prevalent sleep disorder characterized by intermittent upper airway obstruction, leading to significant health consequences. Traditional diagnostic methods, such as polysomnography, are time-consuming and resource-intensive. : This study explores the potential of proton-transfer-reaction mass spectrometry (PTR-MS) in identifying volatile organic compound (VOC) biomarkers for the non-invasive detection of OSA.

View Article and Find Full Text PDF

This research examined the distinction between organic and conventional mango fruits, chips, and juice using portable near-infrared (NIR) spectroscopy. A comprehensive analysis was conducted on a sample of 100 mangoes (comprising 50 organic and 50 conventional) utilising a portable NIR spectrometer that spans a wavelength range from 900 to 1700 nm. The mangoes were assessed in their entirety and their juice and chip forms.

View Article and Find Full Text PDF

The popularity of roasted pork among Chinese consumers is largely attributed to its rich aroma profile. However, the suitability of different pork species for roasting remains uncertain. In this study, the effect of various pork species on the aroma profiles of roasted pork was systematically investigated using gas chromatography-olfactometry-mass spectrometry (GC-O-MS).

View Article and Find Full Text PDF

To investigate the impact of genetic factors on wine aroma, wines made from 22 clones of five grape varieties ( L.) were used to analyze the volatile compounds by headspace solid phase microextraction gas chromatography mass spectrometer (HS-SPME-GC-MS) and headspace gas chromatography-ion mobility spectrometry (HS-GC-IMS). Results showed that 52 and 49 aroma compounds were identified from 22 clones of wines by two technologies, respectively.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!