In this work, different approaches for variable selection are studied in the context of near-infrared (NIR) multivariate calibration of textile. First, a model-based regression method is proposed. It consists in genetic algorithm optimisation combined with partial least squares regression (GA-PLS). The second approach is a relevance measure of spectral variables based on mutual information (MI), which can be performed independently of any given regression model. As MI makes no assumption on the relationship between X and Y, non-linear methods such as feed-forward artificial neural network (ANN) are thus encouraged for modelling in a prediction context (MI-ANN). GA-PLS and MI-ANN models are developed for NIR quantitative prediction of cotton content in cotton-viscose textile samples. The results are compared to full-spectrum (480 variables) PLS model (FS-PLS). The model requires 11 latent variables and yielded a 3.74% RMS prediction error in the range 0-100%. GA-PLS provides more robust model based on 120 variables and slightly enhanced prediction performance (3.44% RMS error). Considering MI variable selection procedure, great improvement can be obtained as 12 variables only are retained. On the basis of these variables, a 12 inputs ANN model is trained and the corresponding prediction error is 3.43% RMS error.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.aca.2007.03.024DOI Listing

Publication Analysis

Top Keywords

variable selection
12
genetic algorithm
8
algorithm optimisation
8
optimisation combined
8
combined partial
8
partial squares
8
squares regression
8
prediction error
8
rms error
8
variables
6

Similar Publications

Use of the FHTHWA Index as a Novel Approach for Predicting the Incidence of Diabetes in a Japanese Population Without Diabetes: Data Analysis Study.

JMIR Med Inform

January 2025

Department of Endocrinology and Metabolism, The First Affiliated Hospital, Jiangxi Medical College, Nanchang University, Nanchang, China.

Background: Many tools have been developed to predict the risk of diabetes in a population without diabetes; however, these tools have shortcomings that include the omission of race, inclusion of variables that are not readily available to patients, and low sensitivity or specificity.

Objective: We aimed to develop and validate an easy, systematic index for predicting diabetes risk in the Asian population.

Methods: We collected the data from the NAGALA (NAfld [nonalcoholic fatty liver disease] in the Gifu Area, Longitudinal Analysis) database.

View Article and Find Full Text PDF

Background: Amebiasis represents a significant global health concern. This is especially evident in developing countries, where infections are more common. The primary diagnostic method in laboratories involves the microscopy of stool samples.

View Article and Find Full Text PDF

Background: Fetal growth restriction (FGR) is a leading risk factor for stillbirth, yet the diagnosis of FGR confers considerable prognostic uncertainty, as most infants with FGR do not experience any morbidity. Our objective was to use data from a large, deeply phenotyped observational obstetric cohort to develop a probabilistic graphical model (PGM), a type of "explainable artificial intelligence (AI)", as a potential framework to better understand how interrelated variables contribute to perinatal morbidity risk in FGR.

Methods: Using data from 9,558 pregnancies delivered at ≥ 20 weeks with available outcome data, we derived and validated a PGM using randomly selected sub-cohorts of 80% (n = 7645) and 20% (n = 1,912), respectively, to discriminate cases of FGR resulting in composite perinatal morbidity from those that did not.

View Article and Find Full Text PDF

Background: The diagnosis of depression or anxiety treated by SSRIs has become relatively common in women of childbearing age. However, the impact of gestational SSRI treatment on newborn thyroid function is lacking. We explored the impact of gestational SSRI treatment on newborn thyroid function as measured by the National Newborn Screening (NBS) Program and identified contributory factors.

View Article and Find Full Text PDF

The shifting of buffer crop repertoires in pre-industrial north-eastern Europe.

Sci Rep

January 2025

Department of Archaeology, Faculty of History, Vilnius University, Universiteto St. 7, Vilnius, 01513, Lithuania.

This study explores how major climatic shifts, together with socioeconomic factors over the past two millennia, influenced buffer crop selection, focusing on five crops: rye, millet, buckwheat, oat, and hemp. For this study, we analyzed archaeobotanical data from 135 archaeological contexts and historical data from 242 manor inventories across the northeastern Baltic region, spanning the period from 100 to 1800 AD. Our findings revealed that rye remained a main staple crop throughout the studied periods reflecting environmental adaptation to northern latitudes.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!