Model selection in high-dimensional settings has received substantial attention in recent years, however, similar advancements in the low-dimensional setting have been lacking. In this article, we introduce a new variable selection procedure for low to moderate scale regressions (n>p). This method repeatedly splits the data into two sets, one for estimation and one for validation, to obtain an empirically optimized threshold which is then used to screen for variables to include in the final model. In an extensive simulation study, we show that the proposed variable selection technique enjoys superior performance compared with candidate methods (backward elimination via repeated data splitting, univariate screening at 0.05 level, adaptive LASSO, SCAD), being amongst those with the lowest inclusion of noisy predictors while having the highest power to detect the correct model and being unaffected by correlations among the predictors. We illustrate the methods by applying them to a cohort of patients undergoing hepatectomy at our institution.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8547352PMC
http://dx.doi.org/10.1002/sim.8538DOI Listing

Publication Analysis

Top Keywords

variable selection
12
repeated data
8
data splitting
8
optimized variable
4
selection
4
selection repeated
4
splitting model
4
model selection
4
selection high-dimensional
4
high-dimensional settings
4

Similar Publications

Background: In difficult colorectal cases, surgeons may opt for a hand-assisted laparoscopic (HALS) colectomy or attempt a laparoscopic surgery that may require an unplanned conversion to open (LCOS). We aimed to compare the clinical outcomes of these 2 types of surgeries.

Methods: Colectomies for acute diverticulitis with a HALS or LCOS surgery were selected from the American College of Surgeons (ACS) National Surgical Quality Improvement Program (NSQIP) 2022 Targeted Colectomy Database.

View Article and Find Full Text PDF
Article Synopsis
  • The emergence of Next Generation Sequencing (NGS) technology has transformed clinical diagnostics, providing extensive microbiome data for personalized medicine.
  • Despite its potential, microbiome data's complexity and variability pose challenges for traditional statistical and machine learning approaches, including deep learning.
  • The paper presents a novel feature engineering technique that combines two data feature sets, significantly improving the Deep Neural Network's performance in colorectal cancer detection, raising the Area Under the Curve (AUC) from 0.800 to 0.923, thus enhancing microbiome data analysis and disease detection capabilities.
View Article and Find Full Text PDF

Introduction: Anxiety is a prevalent psychological issue among cancer patients, significantly affecting their quality of life and potentially influencing treatment outcomes. This study aims to investigate the prevalence and associated factors of anxiety among cancer patients at the University of Gondar Comprehensive Specialized Hospital in Northwest Ethiopia.

Methods: A cross-sectional study design was used, involving 384 cancer patients, selected by systematic random sampling technique, from the oncology ward.

View Article and Find Full Text PDF

Background: Previous studies have separately suggested a possible association between the vitamin exposure, blood biochemical indicators, and bone density. Our study aimed to investigate the relationship between vitamin exposure serum concentrations, blood biochemical indicator serum concentrations, and BMC and BMD using the NHANES 2017-2018 nutrient survey data. This population-based cross-sectional study aimed to explore these associations.

View Article and Find Full Text PDF

Objectives: To compare the utility values of Spondyloarthritis (SpA)-specific ASAS Health Index (U-ASAS-HI) to generic utilities and to understand the contribution of health outcomes, personal- and country-level factors to the U-ASAS-HI.

Methods: Ancillary analysis of the ASAS-HI international validation study. SpA patients who completed the ASAS-HI, 5-level EuroQol-5D (EQ-5D-5L) and Short Form-36 (SF-36) questionnaires were selected, and utilities calculated.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!