Objectives: This study investigated the impact of human-large language model (LLM) collaboration on the accuracy and efficiency of brain MRI differential diagnosis.

Materials And Methods: In this retrospective study, forty brain MRI cases with a challenging but definitive diagnosis were randomized into two groups of twenty cases each. Six radiology residents with an average experience of 6.3 months in reading brain MRI exams evaluated one set of cases supported by conventional internet search (Conventional) and the other set utilizing an LLM-based search engine and hybrid chatbot. A cross-over design ensured that each case was examined with both workflows in equal frequency. For each case, readers were instructed to determine the three most likely differential diagnoses. LLM responses were analyzed by a panel of radiologists. Benefits and challenges in human-LLM interaction were derived from observations and participant feedback.

Results: LLM-assisted brain MRI differential diagnosis yielded superior accuracy (70/114; 61.4% (LLM-assisted) vs 53/114; 46.5% (conventional) correct diagnoses, p = 0.033, chi-square test). No difference in interpretation time or level of confidence was observed. An analysis of LLM responses revealed that correct LLM suggestions translated into correct reader responses in 82.1% of cases (60/73). Inaccurate case descriptions by readers (9.2% of cases), LLM hallucinations (11.5% of cases), and insufficient contextualization of LLM responses were identified as challenges related to human-LLM interaction.

Conclusion: Human-LLM collaboration has the potential to improve brain MRI differential diagnosis. Yet, several challenges must be addressed to ensure effective adoption and user acceptance.

Key Points: Question While large language models (LLM) have the potential to support radiological differential diagnosis, the role of human-LLM collaboration in this context remains underexplored. Findings LLM-assisted brain MRI differential diagnosis yielded superior accuracy over conventional internet search. Inaccurate case descriptions, LLM hallucinations, and insufficient contextualization were identified as potential challenges. Clinical relevance Our results highlight the potential of an LLM-assisted workflow to increase diagnostic accuracy but underline the necessity to study collaborative efforts between humans and LLMs over LLMs in isolation.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s00330-025-11484-6DOI Listing

Publication Analysis

Top Keywords

brain mri
28
mri differential
20
differential diagnosis
20
llm responses
12
large language
8
llm
8
conventional internet
8
internet search
8
challenges human-llm
8
llm-assisted brain
8

Similar Publications

Denoising complex-valued diffusion MR images using a two-step, nonlocal principal component analysis approach.

Magn Reson Med

March 2025

Center for Magnetic Resonance Research, Radiology, Medical School, University of Minnesota, Minneapolis, Minnesota, USA.

Purpose: To propose a two-step, nonlocal principal component analysis (PCA) method and demonstrate its utility for denoising complex diffusion MR images with a few diffusion directions.

Methods: A two-step denoising pipeline was implemented to ensure accurate patch selection even with high noise levels and was coupled with data preprocessing for g-factor normalization and phase stabilization before data denoising with a nonlocal PCA algorithm. At the heart of our proposed pipeline was the use of a data-driven optimal shrinkage algorithm to manipulate the singular values in a way that would optimally estimate the noise-free signal.

View Article and Find Full Text PDF

Introduction: A better understanding of who will develop dementia can inform patient care. Although MRI offers prognostic insights, access is limited globally, whereas CT-imaging is readily available in acute stroke. We explored the prognostic utility of acute CT-imaging for predicting dementia.

View Article and Find Full Text PDF

The brain's complex functionality emerges from network interactions that go beyond dyadic connections, with higher-order interactions significantly contributing to this complexity. Homotopic functional connectivity (HoFC) is a key neurophysiological characteristic of the human brain, reflecting synchronized activity between corresponding regions in the brain's hemispheres. Using resting-state functional magnetic resonance imaging data from the Human Connectome Project, we evaluate dyadic and higher-order interactions of three functional connectivity (FC) parameterizations-bivariate correlation, partial correlation, and tangent space embedding-in their effectiveness at capturing HoFC through the inter-hemispheric analogy test.

View Article and Find Full Text PDF

Objectives: Naldemedine is a peripherally acting μ-opioid receptor antagonist used to treat opioid-induced constipation. As this drug does not cross the blood-brain barrier, it is believed that patients without brain metastases do not experience opioid withdrawal symptoms.

Methods: Here, we experienced a case in which a cancer patient without brain metastasis presented with anxiety and restlessness that was severe enough to interfere with daily life.

View Article and Find Full Text PDF

Introduction: A full understanding of how we see our world remains a fundamental research question in vision neuroscience. While topographic profiling has allowed us to identify different visual areas, the exact functional characteristics and organization of areas up in the visual hierarchy (beyond V1 & V2) is still debated. It is hypothesized that visual area V4 represents a vital intermediate stage of processing spatial and curvature information preceding object recognition.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!