Cross validation is a useful way of comparing predictive generalizability of theoretically plausible a priori models in structural equation modeling (SEM). A number of overall or local cross validation indices have been proposed for existing factor-based and component-based approaches to SEM, including covariance structure analysis and partial least squares path modeling. However, there is no such cross validation index available for generalized structured component analysis (GSCA) which is another component-based approach. We thus propose a cross validation index for GSCA, called Out-of-bag Prediction Error (OPE), which estimates the expected prediction error of a model over replications of so-called in-bag and out-of-bag samples constructed through the implementation of the bootstrap method. The calculation of this index is well-suited to the estimation procedure of GSCA, which uses the bootstrap method to obtain the standard errors or confidence intervals of parameter estimates. We empirically evaluate the performance of the proposed index through the analyses of both simulated and real data.

Download full-text PDF

Source
http://dx.doi.org/10.1080/00273171.2018.1540340DOI Listing

Publication Analysis

Top Keywords

cross validation
20
prediction error
12
out-of-bag prediction
8
validation generalized
8
generalized structured
8
structured component
8
component analysis
8
bootstrap method
8
cross
5
validation
5

Similar Publications

Machine learning prediction model for oral mucositis risk in head and neck radiotherapy: a preliminary study.

Support Care Cancer

January 2025

Oral Diagnosis Department, Faculdade de Odontolodia de Piracicaba, Universidade de Campinas (UNICAMP), Piracicaba, São Paulo, Brazil.

Purpose: Oral mucositis (OM) reflects a complex interplay of several risk factors. Machine learning (ML) is a promising frontier in science, capable of processing dense information. This study aims to assess the performance of ML in predicting OM risk in patients undergoing head and neck radiotherapy.

View Article and Find Full Text PDF

Hemorrhagic stroke is a known complication of glioma, yet the underlying mechanisms remain poorly understood. This study aims to investigate key biomarkers of glioma-related hemorrhage to provide insights into glioma molecular therapies. Data were obtained from the Gene Expression Omnibus (GEO) and the Cancer Genome Atlas (TCGA) databases to analyze differentially expressed genes (DEGs) in glioma by contrasting glioblastoma (GBM) with low-grade gliomas (LGGs).

View Article and Find Full Text PDF

Background: In Saudi Arabia, cervical cancer, frequently caused by human papillomavirus (HPV) infection, is a common cancer. The usual procedures for screening and diagnosing cervical cancer include Pap smears and HPV tests, even though they have considerable drawbacks, particularly for older women (> 60 years) who have limited access to or compliance with these tests. Urinalysis is a simple, noninvasive test that has been suggested as an alternative procedure.

View Article and Find Full Text PDF

Identifying cancer prognosis genes through causal learning.

Brief Bioinform

November 2024

School of Artificial Intelligence, Jilin University, 3003 Qianjin Street, 130012 Changchun, China.

Accurate identification of causal genes for cancer prognosis is critical for estimating disease progression and guiding treatment interventions. In this study, we propose CPCG (Cancer Prognosis's Causal Gene), a two-stage framework identifying gene sets causally associated with patient prognosis across diverse cancer types using transcriptomic data. Initially, an ensemble approach models gene expression's impact on survival with parametric and semiparametric hazard models.

View Article and Find Full Text PDF

Objective: This study aims to develop and validate a machine learning model for identifying individuals within the nursing population experiencing severe subjective cognitive decline (SCD) during the menopause transition, along with their associated factors.

Methods: A secondary analysis was performed using cross-sectional data from 1,264 nurses undergoing the menopause transition. The data set was randomly split into training (75%) and validation sets (25%), with the Bortua algorithm employed for feature selection.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!