Objectives: To provide a foundational methodology for differentiating comorbidity patterns in subphenotypes through investigation of a multi-site dementia patient dataset.

Materials And Methods: Employing the National Clinical Cohort Collaborative Tenant Pilot (N3C Clinical) dataset, our approach integrates machine learning algorithms-logistic regression and eXtreme Gradient Boosting (XGBoost)-with a diagnostic hierarchical model for nuanced classification of dementia subtypes based on comorbidities and gender. The methodology is enhanced by multi-site EHR data, implementing a hybrid sampling strategy combining 65% Synthetic Minority Over-sampling Technique (SMOTE), 35% Random Under-Sampling (RUS), and Tomek Links for class imbalance. The hierarchical model further refines the analysis, allowing for layered understanding of disease patterns.

Results: The study identified significant comorbidity patterns associated with diagnosis of Alzheimer's, Vascular, and Lewy Body dementia subtypes. The classification models achieved accuracies up to 69% for Alzheimer's/Vascular dementia and highlighted challenges in distinguishing Dementia with Lewy Bodies. The hierarchical model elucidates the complexity of diagnosing Dementia with Lewy Bodies and reveals the potential impact of regional clinical practices on dementia classification.

Conclusion: Our methodology underscores the importance of leveraging multi-site datasets and tailored sampling techniques for dementia research. This framework holds promise for extending to other disease subtypes, offering a pathway to more nuanced and generalizable insights into dementia and its complex interplay with comorbid conditions.

Discussion: This study underscores the critical role of multi-site data analyzes in understanding the relationship between comorbidities and disease subtypes. By utilizing diverse healthcare data, we emphasize the need to consider site-specific differences in clinical practices and patient demographics. Despite challenges like class imbalance and variability in EHR data, our findings highlight the essential contribution of multi-site data to developing accurate and generalizable models for disease classification.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11316614PMC
http://dx.doi.org/10.1093/jamiaopen/ooae076DOI Listing

Publication Analysis

Top Keywords

hierarchical model
12
dementia
10
leveraging multi-site
8
n3c clinical
8
comorbidity patterns
8
dementia subtypes
8
ehr data
8
class imbalance
8
dementia lewy
8
lewy bodies
8

Similar Publications

Introduction: Although psychotic behaviors can be difficult to assess in children, early identification of children at high risk for the emergence of psychotic symptoms may facilitate the prevention of related disorders. Psychotic-like experiences (PLEs), or subthreshold thought and perceptual disturbances, could be early manifestations of psychosis that may predict a future diagnosis of a psychosis-related disorder or nonspecific correlates of a wide range of psychological problems. Additional research is needed regarding how PLEs map onto dimensions of psychopathology in children.

View Article and Find Full Text PDF

Background: Nurse educators must be culturally sensitive to teach cultural care to nursing students effectively.

Objective: To explore the factors associated with cultural sensitivity and global nursing education among nurse educators.

Methods: This was a cross-sectional exploratory study.

View Article and Find Full Text PDF

Current sound-absorbing materials, reliant on nonrenewable resources, pose sustainability and disposal challenges. This study introduces a novel collagen-lignin sponge (CLS), a renewable biomass-based material that combines collagen's acoustic properties with lignin's structural benefits. CLSs demonstrate high porosity (>0.

View Article and Find Full Text PDF

Application of three statistical approaches to explore effects of dietary intake of multiple persistent organic pollutants on ER-positive breast cancer risk in the French E3N cohort.

Sci Rep

January 2025

Inserm, Gustave Roussy, Centre for Research in Epidemiology and Population Health (CESP), "Exposome, Heredity, Cancer, and Health" Team, Université Paris-Saclay, UVSQ, 12 Avenue Paul Vaillant Couturier, 94805, Villejuif, France.

Persistent organic pollutants (POPs) are a group of organic chemical compounds. Contradictory results have emerged in epidemiological studies attempting to elucidate their relationship with breast cancer risk. This study explored the relationship between dietary exposures to multiple POPs and ER-positive breast cancer risk in the French E3N cohort study, using three different approaches to handle multicollinearity among exposures.

View Article and Find Full Text PDF

Sleep-related problems (SRPs) are a common precursor to anxiety disorders, especially during peri-adolescence, and may be a predictor of treatment response. However, evidence-based anxiety treatments do not alleviate SRPs to a clinically significant degree. The current study examines whether improving sleep in a sample of young adolescents previously treated for anxiety disorders can further reduce anxiety severity.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!