Automated Transformation of Unstructured Cardiovascular Diagnostic Reports into Structured Datasets Using Sequentially Deployed Large Language Models.

Sumukh Vasisht Shankar Lovedeep S Dhingra Arya Aminorroaya Philip Adejumo Girish N Nadkarni Hua Xu Cynthia Brandt Evangelos K Oikonomou Aline F Pedroso Rohan Khera

medRxiv

Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine, New Haven, CT, USA.

Published: October 2024

Rich cardiovascular data is often hidden in unstructured reports, making it difficult to use in real-time patient care and research due to manual abstraction requirements.
A two-step process was developed using generative and interpretative large language models (LLMs) to convert these unstructured reports into usable formats, specifically focusing on transthoracic echocardiograms (TTE).
The HeartDX-LM model demonstrated impressive accuracy, extracting 98.7% of values from unstructured reports across various datasets, proving its effectiveness in improving data accessibility for clinical analysis.

Background: Rich data in cardiovascular diagnostic testing are often sequestered in unstructured reports, with the necessity of manual abstraction limiting their use in real-time applications in patient care and research.

Methods: We developed a two-step process that sequentially deploys generative and interpretative large language models (LLMs; Llama2 70b and Llama2 13b). Using a Llama2 70b model, we generated varying formats of transthoracic echocardiogram (TTE) reports from 3,000 real-world echo reports with paired structured elements, leveraging temporal changes in reporting formats to define the variations. Subsequently, we fine-tuned Llama2 13b using sequentially larger batches of generated echo reports as inputs, to extract data from free-text narratives across 18 clinically relevant echocardiographic fields. This was set up as a prompt-based supervised training task. We evaluated the fine-tuned Llama2 13b model, HeartDx-LM, on several distinct echocardiographic datasets: (i) reports across the different time periods and formats at Yale New Haven Health System (YNHHS), (ii) the Medical Information Mart for Intensive Care (MIMIC) III dataset, and (iii) the MIMIC IV dataset. We used the accuracy of extracted fields and Cohen's Kappa as the metrics and have publicly released the HeartDX-LM model.

Results: The HeartDX-LM model was trained on randomly selected 2,000 synthetic echo reports with varying formats and paired structured labels, with a wide range of clinical findings. We identified a lower threshold of 500 annotated reports required for fine-tuning Llama2 13b to achieve stable and consistent performance. At YNHHS, the HeartDx-LM model accurately extracted 69,144 out of 70,032 values (98.7%) across 18 clinical fields from unstructured reports in the test set from contemporary records where paired structured data were also available. In older echo reports where only unstructured reports were available, the model achieved 87.1% accuracy against expert annotations for the same 18 fields for a random sample of 100 reports. Similarly, in expert-annotated external validation sets from MIMIC-IV and MIMIC-III, HeartDx-LM correctly extracted 201 out of 220 available values (91.3%) and 615 out of 707 available values (87.9%), respectively, from 100 randomly chosen and expert annotated echo reports from each set.

Conclusion: We developed a novel method using paired large and moderate-sized LLMs to automate the extraction of unstructured echocardiographic reports into tabular datasets. Our approach represents a scalable strategy that transforms unstructured reports into computable elements that can be leveraged to improve cardiovascular care quality and enable research.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11482995	PMC
http://dx.doi.org/10.1101/2024.10.08.24315035	DOI Listing

Publication Analysis

Top Keywords

echo reports

unstructured reports

llama2 13b

reports

paired structured

cardiovascular diagnostic

large language

language models

llama2 70b

varying formats

Similar Publications

Right ventricular stroke work index from echocardiography in patients with pulmonary arterial hypertension-the role in short-term follow-up assessment.

Eur Heart J Imaging Methods Pract

July 2024

Department of Clinical Sciences Lund, Clinical Physiology, Lund University, Skane University Hospital, Entrégatan, Lund 221 85, Sweden.

Raluca Jumatate Anna Werther-Evaldsson Annika Ingvarsson Göran Rådegran Carl Cronstedt Meurling

Aims: Right ventricular (RV) failure causes high mortality in patients with pulmonary arterial hypertension (PAH). RV stroke work index (RVSWi) poses as a potential predictor of outcome. We evaluated how RVSWi by echocardiography (ECHO) or right heart catheterization (RHC) is altered following PAH treatment and if RVSWi is an indicator of outcome in PAH.

View Article and Find Full Text PDF

Similar Publications

Coupled echosounder and Doppler profiler measurements in the Strait of Gibraltar.

Sci Rep

December 2024

Spanish Institute of Oceanography (IEO), Cádiz, Spain.

Simone Sammartino Jesús García-Lafuente Irene Nadal Ricardo F Sánchez-Leal

Long time series of velocity profiles collected by up-looking acoustic profilers in the westernmost sill of the Strait of Gibraltar show an unexpected pattern in the deepest ∼80 m of the water column, consisting in an appreciable diurnal weakening of the measured horizontal velocity. A harmonic analysis performed on long time series reveals a surprising magnitude of S constituent (exactly 1 cpd of frequency) in the horizontal velocity and echo amplitude, which prevails over the rest of diurnal constituents within this depth range, including K, despite being around 200 times smaller than it in the tide generating potential. High resolution echograms collected by a new instrument recently installed in the mooring line, point at the diel vertical migration of living acoustic scatterers (zooplankton) as the most reasonable cause.

View Article and Find Full Text PDF

Similar Publications

Bone Health ECHO Case Report: Significant Elevation in Bone Turnover Markers and Progression of Vertebral Fractures After Denosumab Discontinuation Followed by a PTH-Analog.

J Clin Densitom

December 2024

New Mexico Clinical Research & Osteoporosis Center, Albuquerque, NM, USA.

Yevgeniya Kushchayeva Sergiy Kushchayev Kimberly Dunn Iryna Pestun Micol S Rothman

Bone Health ECHO (Extension for Community Healthcare Outcomes) is a virtual community of practice, where healthcare professionals have met via videoconferencing weekly since 2015. This model of learning is focused on short didactics and the presentation of real but de-identified patient cases followed by highly interactive discussions. These are often clinical situations with diagnostic and therapeutic dilemmas that are not readily addressed by randomized placebo-controlled clinical trials and clinical practice guidelines.

View Article and Find Full Text PDF

Similar Publications

Diagnostic accuracy of texture analysis applied to T- and T-Relaxation maps for liver fibrosis classification via machine-learning algorithms with liver histology as reference standard.

Eur J Radiol

December 2024

Division of Endocrinology and Metabolism, Department of Medicine III, Medical University of Vienna, Austria.

Diana Sitarcikova Sarah Poetter-Lang Nina Bastati Sami Ba-Ssalamah Siegfried Trattnig

Objectives: To explore texture analysis' ability on T and T relaxation maps to classify liver fibrosis into no-to-mild liver fibrosis (nmF) versus severe fibrosis (sF) group using machine learning algorithms and histology as reference standard.

Materials And Methods: In this single-center study, patients undergoing 3 T MRI who also had histology examination were retrospectively enrolled. SNAPSHOT-FLASH sequence for T1 mapping, radial turbo-spin-echo sequence for T2 mapping and spin-echo echo-planar-imaging magnetic resonance elastography (MRE) sequences were analyzed.

View Article and Find Full Text PDF

Similar Publications

Online detection and evaluation of early fatigue cracks using sideband peak intensity technique with frequency-mismatched excitation pulse-echo method.

Ultrasonics

December 2024

Department of Civil and Architectural Engineering and Mechanics, University of Arizona, Tucson, AZ 85721, USA.

Fengling Wang Mingzhu Sun Shuzeng Zhang Guangdong Zhang Xiongbing Li

This work presents a nonlinear ultrasonic (NLU) technique called sideband peak intensity (SPI) combining an improved pulse-echo (PE) experimental method for online detection and evaluation of fatigue cracks at their early stages. Advantages of the proposed technique are that it enjoys the high sensitivity and ease of application of NLU SPI technique and easy implementation of the PE experimental method. The PE experimental method is improved by adopting frequency-mismatched excitations to enhance the sensitivity and robustness of the SPI technique.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!