AI Article Synopsis

  • Rich cardiovascular data is often hidden in unstructured reports, making it difficult to use in real-time patient care and research due to manual abstraction requirements.
  • A two-step process was developed using generative and interpretative large language models (LLMs) to convert these unstructured reports into usable formats, specifically focusing on transthoracic echocardiograms (TTE).
  • The HeartDX-LM model demonstrated impressive accuracy, extracting 98.7% of values from unstructured reports across various datasets, proving its effectiveness in improving data accessibility for clinical analysis.

Article Abstract

Background: Rich data in cardiovascular diagnostic testing are often sequestered in unstructured reports, with the necessity of manual abstraction limiting their use in real-time applications in patient care and research.

Methods: We developed a two-step process that sequentially deploys generative and interpretative large language models (LLMs; Llama2 70b and Llama2 13b). Using a Llama2 70b model, we generated varying formats of transthoracic echocardiogram (TTE) reports from 3,000 real-world echo reports with paired structured elements, leveraging temporal changes in reporting formats to define the variations. Subsequently, we fine-tuned Llama2 13b using sequentially larger batches of generated echo reports as inputs, to extract data from free-text narratives across 18 clinically relevant echocardiographic fields. This was set up as a prompt-based supervised training task. We evaluated the fine-tuned Llama2 13b model, HeartDx-LM, on several distinct echocardiographic datasets: (i) reports across the different time periods and formats at Yale New Haven Health System (YNHHS), (ii) the Medical Information Mart for Intensive Care (MIMIC) III dataset, and (iii) the MIMIC IV dataset. We used the accuracy of extracted fields and Cohen's Kappa as the metrics and have publicly released the HeartDX-LM model.

Results: The HeartDX-LM model was trained on randomly selected 2,000 synthetic echo reports with varying formats and paired structured labels, with a wide range of clinical findings. We identified a lower threshold of 500 annotated reports required for fine-tuning Llama2 13b to achieve stable and consistent performance. At YNHHS, the HeartDx-LM model accurately extracted 69,144 out of 70,032 values (98.7%) across 18 clinical fields from unstructured reports in the test set from contemporary records where paired structured data were also available. In older echo reports where only unstructured reports were available, the model achieved 87.1% accuracy against expert annotations for the same 18 fields for a random sample of 100 reports. Similarly, in expert-annotated external validation sets from MIMIC-IV and MIMIC-III, HeartDx-LM correctly extracted 201 out of 220 available values (91.3%) and 615 out of 707 available values (87.9%), respectively, from 100 randomly chosen and expert annotated echo reports from each set.

Conclusion: We developed a novel method using paired large and moderate-sized LLMs to automate the extraction of unstructured echocardiographic reports into tabular datasets. Our approach represents a scalable strategy that transforms unstructured reports into computable elements that can be leveraged to improve cardiovascular care quality and enable research.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11482995PMC
http://dx.doi.org/10.1101/2024.10.08.24315035DOI Listing

Publication Analysis

Top Keywords

echo reports
20
unstructured reports
16
llama2 13b
16
reports
15
paired structured
12
cardiovascular diagnostic
8
large language
8
language models
8
llama2 70b
8
varying formats
8

Similar Publications

Aims: Right ventricular (RV) failure causes high mortality in patients with pulmonary arterial hypertension (PAH). RV stroke work index (RVSWi) poses as a potential predictor of outcome. We evaluated how RVSWi by echocardiography (ECHO) or right heart catheterization (RHC) is altered following PAH treatment and if RVSWi is an indicator of outcome in PAH.

View Article and Find Full Text PDF

Long time series of velocity profiles collected by up-looking acoustic profilers in the westernmost sill of the Strait of Gibraltar show an unexpected pattern in the deepest ∼80 m of the water column, consisting in an appreciable diurnal weakening of the measured horizontal velocity. A harmonic analysis performed on long time series reveals a surprising magnitude of S constituent (exactly 1 cpd of frequency) in the horizontal velocity and echo amplitude, which prevails over the rest of diurnal constituents within this depth range, including K, despite being around 200 times smaller than it in the tide generating potential. High resolution echograms collected by a new instrument recently installed in the mooring line, point at the diel vertical migration of living acoustic scatterers (zooplankton) as the most reasonable cause.

View Article and Find Full Text PDF

Bone Health ECHO (Extension for Community Healthcare Outcomes) is a virtual community of practice, where healthcare professionals have met via videoconferencing weekly since 2015. This model of learning is focused on short didactics and the presentation of real but de-identified patient cases followed by highly interactive discussions. These are often clinical situations with diagnostic and therapeutic dilemmas that are not readily addressed by randomized placebo-controlled clinical trials and clinical practice guidelines.

View Article and Find Full Text PDF

Objectives: To explore texture analysis' ability on T and T relaxation maps to classify liver fibrosis into no-to-mild liver fibrosis (nmF) versus severe fibrosis (sF) group using machine learning algorithms and histology as reference standard.

Materials And Methods: In this single-center study, patients undergoing 3 T MRI who also had histology examination were retrospectively enrolled. SNAPSHOT-FLASH sequence for T1 mapping, radial turbo-spin-echo sequence for T2 mapping and spin-echo echo-planar-imaging magnetic resonance elastography (MRE) sequences were analyzed.

View Article and Find Full Text PDF

This work presents a nonlinear ultrasonic (NLU) technique called sideband peak intensity (SPI) combining an improved pulse-echo (PE) experimental method for online detection and evaluation of fatigue cracks at their early stages. Advantages of the proposed technique are that it enjoys the high sensitivity and ease of application of NLU SPI technique and easy implementation of the PE experimental method. The PE experimental method is improved by adopting frequency-mismatched excitations to enhance the sensitivity and robustness of the SPI technique.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!