Generalizable clinical note section identification with large language models.

JAMIA Open

Computational Health Informatics Program, Boston Children's Hospital, Boston, MA 02215, United States.

Published: October 2024

Objectives: Clinical note section identification helps locate relevant information and could be beneficial for downstream tasks such as named entity recognition. However, the traditional supervised methods suffer from transferability issues. This study proposes a new framework for using large language models (LLMs) for section identification to overcome the limitations.

Materials And Methods: We framed section identification as question-answering and provided the section definitions in free-text. We evaluated multiple LLMs off-the-shelf without any training. We also fine-tune our LLMs to investigate how the size and the specificity of the fine-tuning dataset impacts model performance.

Results: GPT4 achieved the highest 1 score of 0.77. The best open-source model (Tulu2-70b) achieved 0.64 and is on par with GPT3.5 (ChatGPT). GPT4 is also found to obtain 1 scores greater than 0.9 for 9 out of the 27 (33%) section types and greater than 0.8 for 15 out of 27 (56%) section types. For our fine-tuned models, we found they plateaued with an increasing size of the general domain dataset. We also found that adding a reasonable amount of section identification examples is beneficial.

Discussion: These results indicate that GPT4 is nearly production-ready for section identification, and seemingly contains both knowledge of note structure and the ability to follow complex instructions, and the best current open-source LLM is catching up.

Conclusion: Our study shows that LLMs are promising for generalizable clinical note section identification. They have the potential to be further improved by adding section identification examples to the fine-tuning dataset.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11319784PMC
http://dx.doi.org/10.1093/jamiaopen/ooae075DOI Listing

Publication Analysis

Top Keywords

clinical note
12
note identification
12
generalizable clinical
8
identification
8
large language
8
language models
8
fine-tuning dataset
8
identification examples
8
note
4
identification large
4

Similar Publications

Drug Development.

Alzheimers Dement

December 2024

STEM Neurology & Neuropsychological0 Research Group Egypt (SNRGE), Port Said, Port Said, Egypt.

Background: Donepezil, an acetylcholinesterase inhibitor (AChEI), is an FDA-approved drug to treat these neurodegenerative diseases, e.g., Alzheimer's Disease (AD) and Mild Cognitive Impairment (MCI).

View Article and Find Full Text PDF

Dementia Care Research and Psychosocial Factors.

Alzheimers Dement

December 2024

NYU Langone Health, New York, NY, USA.

Background: Large language models (LLMs) provide powerful natural language processing capabilities in medical and clinical tasks. Evaluating LLM performance is crucial due to potential false results. In this study, we assessed ChatGPT and Llama2, two state-of-the-art LLMs, in extracting information from clinical notes, focusing on cognitive tests, specifically the Mini Mental State Exam (MMSE) and Cognitive Dementia Rating (CDR).

View Article and Find Full Text PDF

Background: HIV-associated neurocognitive disorders (HAND) are prevalent complications of ageing with treated HIV, disproportionally affecting sub-Saharan Africa. Causal HAND treatments are lacking worldwide; therefore, reversible factors are important to explore. Sleep duration and quality are frequently associated with risk of cognitive impairments.

View Article and Find Full Text PDF

Background: Cerebral amyloid angiopathy (CAA) is a significant contributor to hemorrhagic stroke, notably lobar intracerebral hemorrhage (ICH) and convexity subarachnoid hemorrhage (SAH). This study describes the natural occurrence of ICH and SAH events among veterans, including those with AD, within the United States Veterans Affairs Healthcare System (VAHS).

Method: The VAHS database was evaluated to identify ICD-10 codes for ICH (I61.

View Article and Find Full Text PDF

Background and objective Prostate cancer (PCA) is the most prevalent cancer among males. The National Institute for Health and Care Excellence (NICE) recommends referral to PCA diagnostic pathway based on two criteria: (1) abnormal digital rectal examination (DRE) and (2) elevated prostate-specific antigen (PSA). This study evaluates the diagnostic value of routine DRE in patients undergoing PCA assessment with pre-biopsy MRI.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!