Background: Rich data in cardiovascular diagnostic testing are often sequestered in unstructured reports, with the necessity of manual abstraction limiting their use in real-time applications in patient care and research.
Methods: We developed a two-step process that sequentially deploys generative and interpretative large language models (LLMs; Llama2 70b and Llama2 13b). Using a Llama2 70b model, we generated varying formats of transthoracic echocardiogram (TTE) reports from 3,000 real-world echo reports with paired structured elements, leveraging temporal changes in reporting formats to define the variations. Subsequently, we fine-tuned Llama2 13b using sequentially larger batches of generated echo reports as inputs, to extract data from free-text narratives across 18 clinically relevant echocardiographic fields. This was set up as a prompt-based supervised training task. We evaluated the fine-tuned Llama2 13b model, HeartDx-LM, on several distinct echocardiographic datasets: (i) reports across the different time periods and formats at Yale New Haven Health System (YNHHS), (ii) the Medical Information Mart for Intensive Care (MIMIC) III dataset, and (iii) the MIMIC IV dataset. We used the accuracy of extracted fields and Cohen's Kappa as the metrics and have publicly released the HeartDX-LM model.
Results: The HeartDX-LM model was trained on randomly selected 2,000 synthetic echo reports with varying formats and paired structured labels, with a wide range of clinical findings. We identified a lower threshold of 500 annotated reports required for fine-tuning Llama2 13b to achieve stable and consistent performance. At YNHHS, the HeartDx-LM model accurately extracted 69,144 out of 70,032 values (98.7%) across 18 clinical fields from unstructured reports in the test set from contemporary records where paired structured data were also available. In older echo reports where only unstructured reports were available, the model achieved 87.1% accuracy against expert annotations for the same 18 fields for a random sample of 100 reports. Similarly, in expert-annotated external validation sets from MIMIC-IV and MIMIC-III, HeartDx-LM correctly extracted 201 out of 220 available values (91.3%) and 615 out of 707 available values (87.9%), respectively, from 100 randomly chosen and expert annotated echo reports from each set.
Conclusion: We developed a novel method using paired large and moderate-sized LLMs to automate the extraction of unstructured echocardiographic reports into tabular datasets. Our approach represents a scalable strategy that transforms unstructured reports into computable elements that can be leveraged to improve cardiovascular care quality and enable research.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11482995 | PMC |
http://dx.doi.org/10.1101/2024.10.08.24315035 | DOI Listing |
Eur Heart J Imaging Methods Pract
July 2024
Department of Clinical Sciences Lund, Clinical Physiology, Lund University, Skane University Hospital, Entrégatan, Lund 221 85, Sweden.
Aims: Right ventricular (RV) failure causes high mortality in patients with pulmonary arterial hypertension (PAH). RV stroke work index (RVSWi) poses as a potential predictor of outcome. We evaluated how RVSWi by echocardiography (ECHO) or right heart catheterization (RHC) is altered following PAH treatment and if RVSWi is an indicator of outcome in PAH.
View Article and Find Full Text PDFLong time series of velocity profiles collected by up-looking acoustic profilers in the westernmost sill of the Strait of Gibraltar show an unexpected pattern in the deepest ∼80 m of the water column, consisting in an appreciable diurnal weakening of the measured horizontal velocity. A harmonic analysis performed on long time series reveals a surprising magnitude of S constituent (exactly 1 cpd of frequency) in the horizontal velocity and echo amplitude, which prevails over the rest of diurnal constituents within this depth range, including K, despite being around 200 times smaller than it in the tide generating potential. High resolution echograms collected by a new instrument recently installed in the mooring line, point at the diel vertical migration of living acoustic scatterers (zooplankton) as the most reasonable cause.
View Article and Find Full Text PDFBone Health ECHO (Extension for Community Healthcare Outcomes) is a virtual community of practice, where healthcare professionals have met via videoconferencing weekly since 2015. This model of learning is focused on short didactics and the presentation of real but de-identified patient cases followed by highly interactive discussions. These are often clinical situations with diagnostic and therapeutic dilemmas that are not readily addressed by randomized placebo-controlled clinical trials and clinical practice guidelines.
View Article and Find Full Text PDFEur J Radiol
December 2024
Division of Endocrinology and Metabolism, Department of Medicine III, Medical University of Vienna, Austria.
Objectives: To explore texture analysis' ability on T and T relaxation maps to classify liver fibrosis into no-to-mild liver fibrosis (nmF) versus severe fibrosis (sF) group using machine learning algorithms and histology as reference standard.
Materials And Methods: In this single-center study, patients undergoing 3 T MRI who also had histology examination were retrospectively enrolled. SNAPSHOT-FLASH sequence for T1 mapping, radial turbo-spin-echo sequence for T2 mapping and spin-echo echo-planar-imaging magnetic resonance elastography (MRE) sequences were analyzed.
Ultrasonics
December 2024
Department of Civil and Architectural Engineering and Mechanics, University of Arizona, Tucson, AZ 85721, USA.
This work presents a nonlinear ultrasonic (NLU) technique called sideband peak intensity (SPI) combining an improved pulse-echo (PE) experimental method for online detection and evaluation of fatigue cracks at their early stages. Advantages of the proposed technique are that it enjoys the high sensitivity and ease of application of NLU SPI technique and easy implementation of the PE experimental method. The PE experimental method is improved by adopting frequency-mismatched excitations to enhance the sensitivity and robustness of the SPI technique.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!