Importance: An emergency medicine (EM) handoff note generated by a large language model (LLM) has the potential to reduce physician documentation burden without compromising the safety of EM-to-inpatient (IP) handoffs.

Objective: To develop LLM-generated EM-to-IP handoff notes and evaluate their accuracy and safety compared with physician-written notes.

Design, Setting, And Participants: This cohort study used EM patient medical records with acute hospital admissions that occurred in 2023 at NewYork-Presbyterian/Weill Cornell Medical Center. A customized clinical LLM pipeline was trained, tested, and evaluated to generate templated EM-to-IP handoff notes. Using both conventional automated methods (ie, recall-oriented understudy for gisting evaluation [ROUGE], bidirectional encoder representations from transformers score [BERTScore], and source chunking approach for large-scale inconsistency evaluation [SCALE]) and a novel patient safety-focused framework, LLM-generated handoff notes vs physician-written notes were compared. Data were analyzed from October 2023 to March 2024.

Exposure: LLM-generated EM handoff notes.

Main Outcomes And Measures: LLM-generated handoff notes were evaluated for (1) lexical similarity with respect to physician-written notes using ROUGE and BERTScore; (2) fidelity with respect to source notes using SCALE; and (3) readability, completeness, curation, correctness, usefulness, and implications for patient safety using a novel framework.

Results: In this study of 1600 EM patient records (832 [52%] female and mean [SD] age of 59.9 [18.9] years), LLM-generated handoff notes, compared with physician-written ones, had higher ROUGE (0.322 vs 0.088), BERTScore (0.859 vs 0.796), and SCALE scores (0.691 vs 0.456), indicating the LLM-generated summaries exhibited greater similarity and more detail. As reviewed by 3 board-certified EM physicians, a subsample of 50 LLM-generated summaries had a mean (SD) usefulness score of 4.04 (0.86) out of 5 (compared with 4.36 [0.71] for physician-written) and mean (SD) patient safety scores of 4.06 (0.86) out of 5 (compared with 4.50 [0.56] for physician-written). None of the LLM-generated summaries were classified as a critical patient safety risk.

Conclusions And Relevance: In this cohort study of 1600 EM patient medical records, LLM-generated EM-to-IP handoff notes were determined superior compared with physician-written summaries via conventional automated evaluation methods, but marginally inferior in usefulness and safety via a novel evaluation framework. This study suggests the importance of a physician-in-loop implementation design for this model and demonstrates an effective strategy to measure preimplementation patient safety of LLM models.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11615705PMC
http://dx.doi.org/10.1001/jamanetworkopen.2024.48723DOI Listing

Publication Analysis

Top Keywords

handoff notes
28
llm-generated handoff
16
patient safety
16
em-to-ip handoff
12
compared physician-written
12
llm-generated summaries
12
notes
10
handoff
9
llm-generated
9
large language
8

Similar Publications

Introduction: Effective handoff between pediatric residents is crucial to ensure continuity of care and patient safety. Omissions in information and communication breakdowns can be associated with uncertainty in clinical decision-making and adverse patient events. In our role as chief residents, we were notified of an increase in patient safety alerts due to communication failures and gaps during handoff.

View Article and Find Full Text PDF

Developing and Evaluating Large Language Model-Generated Emergency Medicine Handoff Notes.

JAMA Netw Open

December 2024

Department of Emergency Medicine, NewYork-Presbyterian/Weill Cornell Medicine, New York.

Importance: An emergency medicine (EM) handoff note generated by a large language model (LLM) has the potential to reduce physician documentation burden without compromising the safety of EM-to-inpatient (IP) handoffs.

Objective: To develop LLM-generated EM-to-IP handoff notes and evaluate their accuracy and safety compared with physician-written notes.

Design, Setting, And Participants: This cohort study used EM patient medical records with acute hospital admissions that occurred in 2023 at NewYork-Presbyterian/Weill Cornell Medical Center.

View Article and Find Full Text PDF
Article Synopsis
  • This study focused on understanding how common diagnostic uncertainty is when critically ill children are admitted to Pediatric Intensive Care Units (PICUs) and what factors contribute to it.
  • Researchers reviewed medical records from 882 pediatric patients across four hospitals to assess the presence of diagnostic uncertainty at admission and how it changed by the time of discharge.
  • Key findings indicated that 25.9% of patients showed diagnostic uncertainty upon PICU admission, with significant factors being the time of admission, illness severity, atypical symptoms, and discrepancies in diagnoses between different healthcare providers.
View Article and Find Full Text PDF
Article Synopsis
  • This study explored how critical care nurses in Egypt perceive patient safety culture and its link to adverse events affecting patient care.
  • Findings showed low positive responses in critical areas like staffing and teamwork, with many nurses having a moderate to high perception of overall patient safety.
  • A significant relationship was found between poor safety culture perceptions and higher incidences of adverse events, such as patient falls and drug-related issues, particularly among nurses with certain demographics and experience levels.
View Article and Find Full Text PDF

Interfacility patient transfers are fraught with issues such as missed or ineffective communication in Montana given wide geographic distance between facilities and variance in resources. Inaccurate, absent, or delayed patient details may negatively affect patient outcomes and further result in duplicative testing and medication errors. The objective of this study was to describe the process of patient information communication during interfacility transfers as perceived by nurses practicing in Montana.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!