Objective: This study evaluates regularization variants in logistic regression (L1, L2, ElasticNet, Adaptive L1, Adaptive ElasticNet, Broken adaptive ridge [BAR], and Iterative hard thresholding [IHT]) for discrimination and calibration performance, focusing on both internal and external validation.

Materials And Methods: We use data from 5 US claims and electronic health record databases and develop models for various outcomes in a major depressive disorder patient population. We externally validate all models in the other databases. We use a train-test split of 75%/25% and evaluate performance with discrimination and calibration. Statistical analysis for difference in performance uses Friedman's test and critical difference diagrams.

Results: Of the 840 models we develop, L1 and ElasticNet emerge as superior in both internal and external discrimination, with a notable AUC difference. BAR and IHT show the best internal calibration, without a clear external calibration leader. ElasticNet typically has larger model sizes than L1. Methods like IHT and BAR, while slightly less discriminative, significantly reduce model complexity.

Conclusion: L1 and ElasticNet offer the best discriminative performance in logistic regression for healthcare predictions, maintaining robustness across validations. For simpler, more interpretable models, L0-based methods (IHT and BAR) are advantageous, providing greater parsimony and calibration with fewer features. This study aids in selecting suitable regularization techniques for healthcare prediction models, balancing performance, complexity, and interpretability.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11187433PMC
http://dx.doi.org/10.1093/jamia/ocae109DOI Listing

Publication Analysis

Top Keywords

logistic regression
8
discrimination calibration
8
internal external
8
methods iht
8
iht bar
8
models
6
elasticnet
5
calibration
5
performance
5
comparing penalization
4

Similar Publications

Background: It has been suggested that dog walking may protect against falls and mobility problems in later life, but little work to date has examined this.The aim of this study was to assess if regular dog walking was associated with reduced likelihood of falls, fear of falling and mobility problems in a large cohort of community-dwelling older people.

Methods: Participants ≥60 years at Wave 5 of The Irish Longitudinal Study on Ageing were included.

View Article and Find Full Text PDF

The current study was deployed to evaluate the role of metastasis-associated lung adenocarcinoma transcript 1 (MALAT1) and miR-155, along with the inflammatory markers, TNFα and IL-6, and the adhesion molecule, cluster of differentiation 106 (CD106), in Behçet's disease (BD) pathogenesis. The study also assessed MALAT1/miR-155 as promising diagnostic and prognostic biomarkers for BD. The current retrospective case-control study included 74 Egyptian BD patients and 50 age and sex-matched controls.

View Article and Find Full Text PDF

The COVID-19 pandemic led to significant shifts in societal norms and individual behaviors, including changes in physical activity levels. This study examines the relationship between socioeconomic and sociodemographic factors and changes in physical activity levels during the pandemic compared to pre-pandemic levels among adult Arkansans. Survey data were collected from 1,205 adult Arkansans in July and August 2020, capturing socioeconomic and sociodemographic characteristics and information on physical activity changes since the onset of the pandemic.

View Article and Find Full Text PDF

Background: Bariatric surgery is the most effective intervention for severe pediatric obesity, but a subset of youth experience suboptimal weight loss and/or recurrent weight gain. Early re-initiation of obesity pharmacotherapy postoperatively may improve outcomes, though this has not been evaluated in pediatric populations.

Methods: A retrospective cohort study at a tertiary care children's hospital evaluated the safety and efficacy of reintroducing obesity pharmacotherapy within six weeks after laparoscopic sleeve gastrectomy (LSG).

View Article and Find Full Text PDF

Cohort-based nomogram for forensic prediction of SCD: a single-center pilot study.

Forensic Sci Med Pathol

January 2025

Department of Forensic Pathology, School of Forensic Medicine, China Medical University, Shenyang, 110122, P. R. China.

Forensic diagnosis of sudden cardiac death (SCD) is an extremely important part of routine forensic practice. The present study aimed to develop and validate nomograms for predicting the probability of SCD with special regards to ischemic heart disease-induced SCD (IHD-induced SCD) based on multiple autopsy variables. A total of 3322 cases, were enrolled and randomly assigned into a training cohort (n = 2325) and a validation cohort (n = 997), respectively.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!