Objective: Unstructured data present in electronic health records (EHR) are a rich source of medical information; however, their abstraction is labor intensive. Automated EHR phenotyping (AEP) can reduce the need for manual chart review. We present an AEP model that is designed to automatically identify patients diagnosed with epilepsy.

Methods: The ground truth for model training and evaluation was captured from a combination of structured questionnaires filled out by physicians for a subset of patients and manual chart review using customized software. Modeling features included indicators of the presence of keywords and phrases in unstructured clinical notes, prescriptions for antiseizure medications (ASMs), International Classification of Diseases (ICD) codes for seizures and epilepsy, number of ASMs and epilepsy-related ICD codes, age, and sex. Data were randomly divided into training (70%) and hold-out testing (30%) sets, with distinct patients in each set. We trained regularized logistic regression and an extreme gradient boosting models. Model performance was measured using area under the receiver operating curve (AUROC) and area under the precision-recall curve (AUPRC), with 95% confidence intervals (CI) estimated via bootstrapping.

Results: Our study cohort included 3903 adults drawn from outpatient departments of nine hospitals between February 2015 and June 2022 (mean age = 47 ± 18 years, 57% women, 82% White, 84% non-Hispanic, 70% with epilepsy). The final models included 285 features, including 246 keywords and phrases captured from 8415 encounters. Both models achieved AUROC and AUPRC of 1 (95% CI = .99-1.00) in the hold-out testing set.

Significance: A machine learning-based AEP approach accurately identifies patients with epilepsy from notes, ICD codes, and ASMs. This model can enable large-scale epilepsy research using EHR databases.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10239346PMC
http://dx.doi.org/10.1111/epi.17589DOI Listing

Publication Analysis

Top Keywords

icd codes
12
patients epilepsy
8
electronic health
8
health records
8
manual chart
8
chart review
8
keywords phrases
8
hold-out testing
8
auprc 95%
8
epilepsy
5

Similar Publications

Introduction: Manipulation under anesthesia (MUA) is a standard and effective treatment to correct stiffness and improve range of motion (ROM) following total knee arthroplasty (TKA). Delayed MUA has been associated with increased rates of revision surgeries and infections. Early MUA has been shown to double the mean gain in flexion compared to delayed interventions.

View Article and Find Full Text PDF

Metformin intake and risk of metabolic acidosis after radical cystectomy with urinary diversion: A comparative study using data from the TriNetX research network.

Urol Oncol

January 2025

The James Buchanan Brady Urological Institute and Department of Urology, Johns Hopkins University School of Medicine, Baltimore, MD; Department of Oncology, The Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD. Electronic address:

Purpose: To investigate the association of diabetes mellitus and metformin use with metabolic acidosis risk after radical cystectomy (RC) and urinary diversion for bladder cancer.

Materials And Methods: This retrospective cohort study used TriNetX Research Network data. Patients undergoing RC with continent diversion or ileal conduit for bladder cancer were identified using International Classification of Diseases, 10th Revision (ICD-10) and ICD-10 Procedure Coding System (ICD-10-PCS) codes.

View Article and Find Full Text PDF

Preconception GLP-1 Receptor Agonist Use Associated with Decreased Risk for Adverse Obstetric Outcomes.

Am J Obstet Gynecol

January 2025

Department of Obstetrics and Gynecology, University Hospitals Cleveland Medical Center, Cleveland OH; Department of Reproductive Biology, Case Western Reserve University, Cleveland, OH. Electronic address:

Background: The use of glucagon-like-peptide-1 receptor agonists (GLP-1RAs) has greatly increased in patients of reproductive age within the past four years. However, there is minimal research into the long-term impact of these medications on future pregnancies.

Objectives: We aimed to evaluate the association between adverse obstetric outcomes and antecedent GLP-1RA use using a nationally representative database.

View Article and Find Full Text PDF

Large language models vs human for classifying clinical documents.

Int J Med Inform

January 2025

College of Science and Engineering, James Cook University, Townsville, 4811, QLD, Australia. Electronic address:

Background: Accurate classification of medical records is crucial for clinical documentation, particularly when using the 10th revision of the International Classification of Diseases (ICD-10) coding system. The use of machine learning algorithms and Systematized Nomenclature of Medicine (SNOMED) mapping has shown promise in performing these classifications. However, challenges remain, particularly in reducing false negatives, where certain diagnoses are not correctly identified by either approach.

View Article and Find Full Text PDF

Purpose: This study aimed to investigate the incidence and characteristics of sudden sensorineural hearing loss (SSNHL) in pregnant and non-pregnant women using the Korean National Health Insurance Service customized cohort data.

Materials And Methods: We defined the delivery group as women aged 15-49 years with International Classification of Diseases 10th Revision codes O80-O84 indicating delivery between January 2013 and December 2019. The control group was selected from individuals in the same age range without a history of delivery during the same period.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!