Background: Prediction models have demonstrated a range of applications across medicine, including using electronic health record (EHR) data to identify hospital readmission and mortality risk. Large language models (LLMs) can transform unstructured EHR text into structured features, which can then be integrated into statistical prediction models, ensuring that the results are both clinically meaningful and interpretable.

Objective: This study aims to compare the classification decisions made by clinical experts with those generated by a state-of-the-art LLM, using terms extracted from a large EHR data set of individuals with mental health disorders seen in emergency departments (EDs).

Methods: Using a dataset from the EHR systems of more than 50 health care provider organizations in the United States from 2016 to 2021, we extracted all clinical terms that appeared in at least 1000 records of individuals admitted to the ED for a mental health-related problem from a source population of over 6 million ED episodes. Two experienced mental health clinicians (one medically trained psychiatrist and one clinical psychologist) reached consensus on the classification of EHR terms and diagnostic codes into categories. We evaluated an LLM's agreement with clinical judgment across three classification tasks as follows: (1) classify terms into "mental health" or "physical health", (2) classify mental health terms into 1 of 42 prespecified categories, and (3) classify physical health terms into 1 of 19 prespecified broad categories.

Results: There was high agreement between the LLM and clinical experts when categorizing 4553 terms as "mental health" or "physical health" (κ=0.77, 95% CI 0.75-0.80). However, there was still considerable variability in LLM-clinician agreement on the classification of mental health terms (κ=0.62, 95% CI 0.59-0.66) and physical health terms (κ=0.69, 95% CI 0.67-0.70).

Conclusions: The LLM displayed high agreement with clinical experts when classifying EHR terms into certain mental health or physical health term categories. However, agreement with clinical experts varied considerably within both sets of mental and physical health term categories. Importantly, the use of LLMs presents an alternative to manual human coding, presenting great potential to create interpretable features for prediction models.

Download full-text PDF

Source
http://dx.doi.org/10.2196/65454DOI Listing

Publication Analysis

Top Keywords

mental health
24
prediction models
16
clinical experts
16
health terms
16
physical health
16
health
13
agreement clinical
12
terms
10
electronic health
8
mental
8

Similar Publications

Background: Understanding based on up-to-date data on the burden of non-communicable diseases (NCDs) is limited, especially regarding how subtypes contribute to the overall NCD burden and the attributable risk factors across locations and subtypes. We aimed to report the global, regional, and national burden of NCDs, subtypes, and attributable risk factors in 2021, and trends from 1990 to 2021 by age, sex, and socio-demographic index (SDI).

Materials And Methods: We used data from the Global Burden of Disease Study 2021 to estimate the prevalence, deaths, and disability-adjusted life years (DALYs) for NCDs and subtypes, along with attributable risk factors.

View Article and Find Full Text PDF

The present study sought to examine the occurrence and correlates of depression, PTSD, and insomnia in a cohort of Palestinian refugees residing in camps located in Jordan during the outbreak of the War on Gaza on Oct.7th.This is a cross-sectional cohort study that employed the convenient sampling method to recruit Palestinian refugees residing in Irbid and Azmi Almufti camps for Palestinian refugees.

View Article and Find Full Text PDF

We aimed to compare sleep problems in autistic and non-autistic adults with co-occurring depression and anxiety. The primary research question was whether autism status influences sleep quality, after accounting for the effects of depression and anxiety. We hypothesized that autistic adults would report higher levels of depression, anxiety, and sleep problems compared to non-autistic adults, after controlling for these covariates.

View Article and Find Full Text PDF

Purpose: Prior research demonstrates that children with autism are more likely to experience unintentional injuries than the general population. Limited research exists on the symptoms or traits directly related to autism and this elevated injury rate, especially from the perspective of families with children with autism. This study used qualitative methodology to elucidate risk factors that may contribute to unintentional injuries in children with autism from the perspective of mothers raising children with autism.

View Article and Find Full Text PDF

Purpose: Individuals with metastatic breast cancer (MBC) may live with their disease for many years. We initiated the Johns Hopkins Hope at Hopkins Clinic to assess the needs and optimize the care of these patients.

Patients And Methods: Patients with MBC who agreed to participate in the Clinic in addition to usual care completed patient-reported outcome (PRO) surveys.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!