Random forest as one-class classifier and infrared spectroscopy for food adulteration detection.

Food Chem

Institute of Chemistry, University of Campinas, 13084-971 Campinas, SP, Brazil. Electronic address:

Published: September 2019

This paper proposes the use of random forest for adulteration detection purposes, combining the random forest algorithm with the artificial generation of outliers from the authentic samples. This proposal was applied in two food adulteration studies: evening primrose oils using ATR-FTIR spectroscopy and ground nutmeg using NIR diffuse reflectance spectroscopy. The primrose oil was adulterated with soybean, corn and sunflower oils, and the model was validated using these adulterated oils and other different oils, such as rosehip and andiroba, in pure and adulterated forms. The ground nutmeg was adulterated with cumin, commercial monosodium glutamate, soil, roasted coffee husks and wood sawdust. For the primrose oil, the proposed method presented superior performance than PLS-DA and similar performance to SIMCA and for the ground nutmeg, the random forest was superior to PLS-DA and SIMCA. Also, in both applications using the random forest, no sample was excluded from the external validation set.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.foodchem.2019.04.073DOI Listing

Publication Analysis

Top Keywords

random forest
20
ground nutmeg
12
food adulteration
8
adulteration detection
8
primrose oil
8
random
5
forest one-class
4
one-class classifier
4
classifier infrared
4
infrared spectroscopy
4

Similar Publications

Background: Major depressive disorder (MDD) comes along with an increased risk of recurrence and poor course of illness. Machine learning has recently shown promise in the prediction of mental illness, yet models aiming to predict MDD course are still rare and do not quantify the predictive value of established MDD recurrence risk factors.

Methods: We analyzed N = 571 MDD patients from the Marburg-Münster Affective Disorder Cohort Study (MACS).

View Article and Find Full Text PDF

Integrating machine learning, suspect and nontarget screening reveal the interpretable fates of micropollutants and their transformation products in sludge.

J Hazard Mater

January 2025

School of Environmental Studies, China University of Geosciences, Wuhan, Hubei 430074, China; National Engineering Research Center of Industrial Wastewater Detoxication and Resource Recovery, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China. Electronic address:

Activated sludge enriches vast amounts of micropollutants (MPs) when wastewater is treated, posing potential environmental risks. While standard methods typically focus on target analysis of known compounds, the identity, structure, and concentration of transformation products (TPs) of MPs remain less understood. Here, we employed a novel approach that integrates machine learning for the quantification of nontarget TPs with advanced target, suspect, and nontarget screening strategies.

View Article and Find Full Text PDF

Plastic waste management is one of the key issues in global environmental protection. Integrating spectroscopy acquisition devices with deep learning algorithms has emerged as an effective method for rapid plastic classification. However, the challenges in collecting plastic samples and spectroscopy data have resulted in a limited number of data samples and an incomplete comparison of relevant classification algorithms.

View Article and Find Full Text PDF

IntroductionAsthma attacks are set off by triggers such as pollutants from the environment, respiratory viruses, physical activity and allergens. The aim of this research is to create a machine learning model using data from mobile health technology to predict and appropriately warn a patient to avoid such triggers.MethodsLightweight machine learning models, XGBoost, Random Forest, and LightGBM were trained and tested on cleaned asthma data with a 70-30 train-test split.

View Article and Find Full Text PDF

Purpose: The incidence of cancer, which is a serious public health concern, is increasing. A predictive analysis driven by machine learning was integrated with haematology parameters to create a method for the simultaneous diagnosis of several malignancies at different stages.

Patients And Methods: We analysed a newly collected dataset from various hospitals in Jordan comprising 19,537 laboratory reports (6,280 cancer and 13,257 noncancer cases).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!