AI Article Synopsis

  • * Using a dataset of 779 patients, researchers analyzed various ML models and found that the eXtreme Gradient Boosting (XGBoost) algorithm outperformed others in predicting true positive CHD cases with a high positive predictive value (PPV) of 94%.
  • * Overall, the findings suggest that implementing ML techniques can improve the identification of CHD in large datasets, thus strengthening public health surveillance and data reliability.

Article Abstract

Introduction: International Classification of Diseases (ICD) codes recorded in administrative data are often used to identify congenital heart defects (CHD). However, these codes may inaccurately identify true positive (TP) CHD individuals. CHD surveillance could be strengthened by accurate CHD identification in administrative records using machine learning (ML) algorithms.

Methods: To identify features relevant to accurate CHD identification, traditional ML models were applied to a validated dataset of 779 patients; encounter level data, including ICD-9-CM and CPT codes, from 2011 to 2013 at four US sites were utilized. Five-fold cross-validation determined overlapping important features that best predicted TP CHD individuals. Median values and 95% confidence intervals (CIs) of area under the receiver operating curve, positive predictive value (PPV), negative predictive value, sensitivity, specificity, and F1-score were compared across four ML models: Logistic Regression, Gaussian Naive Bayes, Random Forest, and eXtreme Gradient Boosting (XGBoost).

Results: Baseline PPV was 76.5% from expert clinician validation of ICD-9-CM CHD-related codes. Feature selection for ML decreased 7138 features to 10 that best predicted TP CHD cases. During training and testing, XGBoost performed the best in median accuracy (F1-score) and PPV, 0.84 (95% CI: 0.76, 0.91) and 0.94 (95% CI: 0.91, 0.96), respectively. When applied to the entire dataset, XGBoost revealed a median PPV of 0.94 (95% CI: 0.94, 0.95).

Conclusions: Applying ML algorithms improved the accuracy of identifying TP CHD cases in comparison to ICD codes alone. Use of this technique to identify CHD cases would improve generalizability of results obtained from large datasets to the CHD patient population, enhancing public health surveillance efforts.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10841295PMC
http://dx.doi.org/10.1002/bdr2.2245DOI Listing

Publication Analysis

Top Keywords

chd cases
12
chd
10
machine learning
8
congenital heart
8
heart defects
8
administrative data
8
icd codes
8
chd individuals
8
accurate chd
8
chd identification
8

Similar Publications

Introduction: Undiagnosed chronic disease has serious health consequences, and variation in rates of underdiagnosis between populations can contribute to health inequalities. We aimed to estimate the level of undiagnosed disease of 11 common conditions and its variation across sociodemographic characteristics and regions in England.

Methods: We used linked primary care, hospital and mortality data on approximately 1.

View Article and Find Full Text PDF

Fetal echocardiography (FE) is recommended for parents with congenital heart disease (pCHD) due to a 3-6% recurrence risk of congenital heart disease (CHD). This study aimed to evaluate the cost of FE for detecting neonatal CHD in pCHD. FE data were collected between 12/2015 and 12/2022.

View Article and Find Full Text PDF

Objectives: This study aimed to assess the role of olfactory sulci (OS) in diagnosing CHARGE syndrome among fetuses with major congenital heart defects (CHDs).

Methods: We prospectively evaluated OS development in fetuses diagnosed with CHDs from 2017 to 2021. Neurosonography (NSG) was performed using transabdominal and transvaginal approaches after 30 weeks of gestation.

View Article and Find Full Text PDF

Eco-epidemiological Survey of Trypanosoma cruzi in Dogs from Mendoza, Argentina.

Ecohealth

January 2025

Laboratorio de Medicina y Endocrinología de la Fauna Silvestre, IMBECU, UNCuyo - CONICET, Av. Dr. Adrian Ruiz Leal s/n, Parque General San Martín, Mendoza, Argentina.

Urban domestic dog populations can provide important clues about the eco-epidemiological characteristics of Trypanosoma cruzi, the causative agent of Chagas disease (ChD). Given the limited data on ChD from the Metropolitan Area of Mendoza, Argentina, a seroprevalence survey of 327 dogs across an urban-rural gradient was conducted between April 2018 and May 2019. Seropositive cases were analyzed considering host, social, and environmental factors, subtypes (DTUs), and bloodstream parasite load.

View Article and Find Full Text PDF

Study Question: Is there an association between dydrogesterone exposure during early pregnancy and the reporting of birth defects?

Summary Answer: This observational analysis based on global safety data showed an increased reporting of birth defects, mainly hypospadias and congenital heart defects (CHD), in pregnancies exposed to dydrogesterone, especially when comparing to progesterone.

What Is Known Already: Intravaginal administration of progesterone is the standard of care to overcome luteal phase progesterone deficiency induced by ovarian stimulation in ART. In recent years, randomized controlled clinical trials demonstrated that oral dydrogesterone was non-inferior for pregnancy rate at 12 weeks of gestation and could be an alternative to micronized vaginal progesterone.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!