A novel interval sparse evolutionary algorithm for efficient spectral variable selection.

Anal Chim Acta

School of Automation, Central South University, 410083, Changsha, China. Electronic address:

Published: March 2025

In spectral analysis, selecting the right spectral variables is crucial for effective modeling. It reduces data dimensionality, removes irrelevant wavelength points, and improves both the generalization ability and computational efficiency of the model. However, the number of available samples often falls short of the total possible combinations of wavelengths, making variable selection a non-deterministic polynomial-time (NP) hard optimization problem. The current dedicated variable selection and heuristic algorithms fail to balance the effectiveness and speed of variable selection. Therefore, there is a great need for a more advanced approach to address this problem. (92) RESULTS: In this paper, we adopt a different perspective by considering variable selection as a large-scale sparse multi-objective optimization problem, modeled with fewer variables to achieve lower prediction errors. Then a novel interval sparse evolutionary algorithm (ISEA) was proposed, merging the benefits of dedicated variable selection algorithms and evolutionary algorithms. It incorporates a roulette probability mechanism and enhances the selection probability of key informative variables through a sparse population initialization strategy (SPIS) and a regional sparse evolution strategy (RSES). Specifically, the SPIS prioritizes variable regions through interval partial least squares (iPLS) and initializes the sparse population based on regional roulette probability, thereby enhancing the likelihood of selection of important regional variables in the initial sparse population. The RSES further focuses on more important regions, ensuring the variables in more important regions have a higher survival probability in subsequent generations. (138) SIGNIFICANCE: Applied to datasets of corn oil, soil, and diesel fuels, ISEA outperforms nine state-of-the-art methods by maintaining both the effectiveness of variable selection and running speed. Additionally, unlike dedicated variable selection algorithms, it is suitable for both specific variable selection scenarios and other large-scale sparse problems, such as critical node detection, frequent pattern mining, and neural network node training, demonstrating wide application potential. (63).

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.aca.2025.343655DOI Listing

Publication Analysis

Top Keywords

variable selection
36
dedicated variable
12
sparse population
12
selection
11
variable
10
novel interval
8
sparse
8
interval sparse
8
sparse evolutionary
8
evolutionary algorithm
8

Similar Publications

Costimulation blockade: the next generation.

Curr Opin Organ Transplant

January 2025

Division of Nephrology, Virginia Commonwealth University, Richmond, Virginia, USA.

Purpose Of The Review: Calcineurin inhibitors (CNIs) are central to immunosuppression in kidney transplantation (KT), improving short-term outcomes but falling short in enhancing long-term outcomes due to cardiovascular, metabolic, and renal complications. Belatacept, an FDA-approved costimulation blocker, offers a less toxic alternative to CNIs but is limited by its intravenous administration and reduced efficacy in high-immunological-risk patients.

Recent Findings: Emerging therapies target more specific pathways to improve efficacy and accessibility.

View Article and Find Full Text PDF

Fixed start method for repetitive project scheduling with simulated annealing.

Heliyon

January 2025

Engineering School of Sustainable Infrastructure and Environment, University of Florida, Gainesville, FL, United States.

This research deals with the high project completion variability by presenting a new method to decrease such variability in repetitive construction projects. To achieve this, a Fixed Start Method (FSM) -where the starts of each activity with a high level of probabilistic confidence for the planned project duration are fixed- was applied, where the high level of probabilistic confidence obtained was optimized with the use of the metaheuristic algorithm called Simulated Annealing (SA). This procedure evaluated the project completion in a case study based on the coefficient of variance (COV) of the resulting standard deviation, mean, and temperature selected for the SA.

View Article and Find Full Text PDF

Predicting competitive alpine skiing performance by multivariable statistics-the need for individual profiling.

Front Sports Act Living

January 2025

Section of Sports Medicine, Department of Community Medicine and Rehabilitation, Umeå School of Sport Sciences, Umeå University, Umeå, Sweden.

Introduction: Predicting competitive alpine skiing performance using conventional statistical methods has proven challenging. Many studies assessing the relationship between physiological performance and skiing outcomes have employed statistical methods of questionable validity. Furthermore, the reliance on Fédération Internationale de Ski (FIS) points as a performance outcome variable presents additional limitations due to its potential unreliability in reflecting short-term, sport-specific performance.

View Article and Find Full Text PDF

Background: Postural Orthostatic Tachycardia Syndrome (POTS) is a complex form of dysautonomia that presents with abnormal autonomic reflexes upon standing, leading to symptoms such as lightheadedness, tachycardia, fatigue, and cognitive impairment. The COVID-19 pandemic has brought renewed attention to POTS due to its overlap with post-acute sequelae of COVID-19 (PASC). Studies have found that a substantial percentage of COVID-19 survivors exhibit symptoms resembling POTS, elevating POTS diagnoses to previously unseen levels.

View Article and Find Full Text PDF

Background: Globally, in ~50% of epilepsy cases, the underlying cause remains unknown, despite the fact that various disease pathways may contribute to the condition. Nearly 80% of people with epilepsy live in low- and middle-income countries and the risk of premature death in people with epilepsy is up to three times higher than that for the general population. Identifying the determinants of epilepsy is important for applying evidence-based interventions to achieve a better outcome.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!