Optimizing the soft independent modeling of class analogy (SIMCA) using statistical prediction regions.

Anal Chim Acta

Vibra-Santé Hub, Laboratory of Pharmaceutical and Analytical Chemistry, Department of Pharmacy, CIRM, University of Liège, Avenue Hippocrate 15, 4000, Liège, Belgium.

Published: October 2022

The ultimate goal of a one-class classifier like the "rigorous" soft independent modeling of class analogy (SIMCA) is to predict with a certain confidence probability, the conformity of future objects with a given reference class. However, the SIMCA model, as currently implemented often suffers from an undercoverage problem, meaning that its observed sensitivity often falls far below the desired theoretical confidence probability, hence undermining its intended use as a predictive tool. To overcome the issue, the most reported strategy in the literature, involves incrementing the nominal confidence probability until the desired sensitivity is obtained in cross-validation. This article proposes a statistical prediction interval-based strategy as an alternative strategy to properly overcome this undercoverage issue. The strategy uses the concept of predictive distributions sensu stricto to construct statistical prediction regions for the metrics. Firstly, a procedure based on goodness-of-fit criteria is used to select the best-fitting family of probability models for each metric or its monotonic transformation, among several plausible candidate families of right-skewed probability distributions for positive random variables, including the gamma and the lognormal families. Secondly, assuming the best-fitting distribution, a generalized linear model is fitted to each metric data using the Bayesian method. This method enables to conveniently estimate uncertainties about the parameters of the selected distribution. Propagating these uncertainties to the best-fitting probability model of the metric enables to derive its so-called posterior predictive distribution, which is then used to set its critical limit. Overall, the evaluation of the proposed approach on a diversity of real datasets shows that it yields unbiased and more accurate sensitivities than existing methods which are not based on predictive densities. It can even yield better specificities than the strategy that attempts to improve sensitivities of existing methods by "optimizing" the type 1 error, especially in low sample sizes' contexts.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.aca.2022.340339DOI Listing

Publication Analysis

Top Keywords

statistical prediction
12
confidence probability
12
soft independent
8
independent modeling
8
modeling class
8
class analogy
8
analogy simca
8
prediction regions
8
sensitivities existing
8
existing methods
8

Similar Publications

Background: In 2024, the Korean Ministry of Health and Welfare enforced a policy to increase the number of medical school students by 2,000 over the next 5 years, despite opposition from doctors. This study aims to predict the trend of excess or shortage of medical personnel in Korea due to the policy of increasing the number of medical school students by 2035.

Methods: Data from multiple sources, including the Ministry of Health and Welfare, National Health Insurance Corporation, and the Korean Medical Association, were used to estimate supply and demand.

View Article and Find Full Text PDF

Background: Bullying, encompassing physical, psychological, social, or educational harm, affects approximately 1 in 20 United States teens aged 12-18. The prevalence and impact of bullying, including online bullying, necessitate a deeper understanding of risk and protective factors to enhance prevention efforts. This study investigated the key risk and protective factors most highly associated with adolescent bullying victimization.

View Article and Find Full Text PDF

Background: Evidence is lacking on whether chronic pain is related to the risk of cancer mortality. This study seeks to unveil the association between chronic pain and all-cause, cancer, as well as non-cancer death in cancer patients based on the National Health and Nutrition Examination Survey (NHANES) database.

Methods: Cancer survivors aged at least 20 (n = 1369) from 3 NHANES (1999-2004) cycles were encompassed.

View Article and Find Full Text PDF

CSPG4 overexpression implicates higher risks of recurrence and tumorigenesis after surgical intervention of vocal fold Leukoplakia.

Eur Arch Otorhinolaryngol

January 2025

ENT institute and Department of Otorhinolaryngology, Eye & ENT Hospital, Fudan University, 83 FenYang Road, Shanghai, 200031, China.

Background: Vocal fold leukoplakia (VFL), a precancerous lesion of the larynx, is characterized by white plaques on the vocal fold mucous membrane. Currently, there are no reliable biomarkers to predict the recurrence and malignant transformation of VFL. Considering chondroitin sulfate proteoglycan 4 (CSPG4) as a biomarker for malignant tumors such as laryngeal squamous cell carcinoma (LSCC), we conducted this cohort study to evaluate the prognostic influence of CSPG4 expression on VFL patients.

View Article and Find Full Text PDF

Measurement and Assessment of Head-to-Helmet Contact Forces.

Ann Biomed Eng

January 2025

Department of Mechanical and Industrial Engineering, Northeastern University, 360 Huntington Avenue, Boston, MA, 02115, USA.

Purpose: To evaluate the population variation in head-to-helmet contact forces in helmet users.

Methods: Four different size Kevlar composite helmets were instrumented with contact pressure sensors and chinstrap tension meters. A total number of 89 volunteers (25 female and 64 male volunteers) participated in the study.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!