Objective: Patients now have direct access to their radiology reports, which can include complex terminology and be difficult to understand. We assessed ChatGPT's ability to generate summarized MRI reports for patients with prostate cancer and evaluated physician satisfaction with the artificial intelligence (AI)-summarized report.

Methods: We used ChatGPT to summarize five full MRI reports for patients with prostate cancer performed at a single institution from 2021 to 2022. Three summarized reports were generated for each full MRI report. Full MRI and summarized reports were assessed for readability using Flesch-Kincaid Grade Level (FK) score. Radiation oncologists were asked to evaluate the AI-summarized reports via an anonymous questionnaire. Qualitative responses were given on a 1-5 Likert-type scale. Fifty newly diagnosed prostate cancer patient MRIs performed at a single institution were additionally assessed for physician online portal response rates.

Results: Fifteen summarized reports were generated from five full MRI reports using ChatGPT. The median FK score for the full MRI reports and summarized reports was 9.6 vs. 5.0, ( < 0.05), respectively. Twelve radiation oncologists responded to our questionnaire. The mean [SD] ratings for summarized reports were factual correctness (4.0 [0.6], understanding 4.0 [0.7]), completeness (4.1 [0.5]), potential for harm (3.5 [0.9]), overall quality (3.4 [0.9]), and likelihood to send to patient (3.1 [1.1]). Current physician online portal response rates were 14/50 (28%) at our institution.

Conclusions: We demonstrate a novel application of ChatGPT to summarize MRI reports at a reading level appropriate for patients. Physicians were likely to be satisfied with the summarized reports with respect to factual correctness, ease of understanding, and completeness. Physicians were less likely to be satisfied with respect to potential for harm, overall quality, and likelihood to send to patients. Further research is needed to optimize ChatGPT's ability to summarize radiology reports and understand what factors influence physician trust in AI-summarized reports.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10734360PMC
http://dx.doi.org/10.1177/20552076231221620DOI Listing

Publication Analysis

Top Keywords

full mri
20
mri reports
16
summarized reports
16
prostate cancer
12
reports
10
reports patients
8
patients prostate
8
performed single
8
single institution
8
reports generated
8

Similar Publications

Background: The underlying shoulder pathology in radiographic superior escape of the humeral head and association between acromiohumeral interval (AHI) on radiographs and magnetic resonance imaging (MRI) are poorly understood.

Methods: A retrospective review of shoulder radiographs and MRI scans was undertaken. AHI was measured using both modalities.

View Article and Find Full Text PDF

The rising incidence of pancreatic diseases, including acute and chronic pancreatitis and various pancreatic neoplasms, poses a significant global health challenge. Pancreatic ductal adenocarcinoma (PDAC) for example, has a high mortality rate due to late-stage diagnosis and its inaccessible location. Advances in imaging technologies, though improving diagnostic capabilities, still necessitate biopsy confirmation.

View Article and Find Full Text PDF

Noise-induced hearing loss (NIHL) is a common occupational condition. The aim of this study was to develop a classification model for NIHL on the basis of both functional magnetic resonance imaging (fMRI) and structural magnetic resonance imaging (sMRI) by applying machine learning methods. fMRI indices such as the amplitude of low-frequency fluctuation (ALFF), fractional amplitude of low-frequency fluctuation (fALFF), regional homogeneity (ReHo), degree of centrality (DC), and sMRI indices such as gray matter volume (GMV), white matter volume (WMV), and cortical thickness were extracted from each brain region.

View Article and Find Full Text PDF

Purpose: To improve the current method for MRI turbulence quantification which is the intravoxel phase dispersion (IVPD) method. Turbulence is commonly characterized by the Reynolds stress tensor (RST) which describes the velocity covariance matrix. A major source for systematic errors in MRI is the sequence's sensitivity to the variance of the derivatives of velocity, such as the acceleration variance, which can lead to a substantial measurement bias.

View Article and Find Full Text PDF

Acute ischemic stroke (AIS) is a leading cause of mortality and disability worldwide, with early and accurate diagnosis being critical for timely intervention and improved patient outcomes. This retrospective study aimed to assess the diagnostic performance of two advanced artificial intelligence (AI) models, Chat Generative Pre-trained Transformer (ChatGPT-4o) and Claude 3.5 Sonnet, in identifying AIS from diffusion-weighted imaging (DWI).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!