Purpose: To investigate if GPT-4 improves the accuracy, consistency, and trustworthiness of a context-aware chatbot to provide personalized imaging recommendations from American College of Radiology (ACR) appropriateness criteria documents using semantic similarity processing: In addition, we sought to enable auditability of the output by revealing the information source the decision relies on.
Material And Methods: We refined an existing chatbot that incorporated specialized knowledge of the ACR guidelines by upgrading GPT-3.5-Turbo to its successor GPT-4 by OpenAI, using the latest version of LlamaIndex, and improving the prompting strategy. This chatbot was compared to the previous version, generic GPT-3.5-Turbo and GPT-4, and general radiologists regarding the performance in applying the ACR appropriateness guidelines.
Results: The refined context-aware chatbot performed superior to the previous version using GPT-3.5-Turbo, generic chatbots GPT-3.5-Turbo and GPT-4, and general radiologists in providing "usually or may be appropriate" recommendations according to the ACR guidelines (all p < 0.001). It also outperformed GPT-3.5-Turbo and general radiologists in respect to "usually appropriate" recommendations (both p < 0.001). Moreover, the consistency in correct answers was higher with 78 % consistent correct "usually appropriate" answers and 94 % for "usually or may be appropriate" recommendations. In all cases, the same source documents were chosen, ensuring transparency.
Conclusion: Our study demonstrates the significance of context awareness in ensuring the use of appropriate knowledge and proposes a strategy to enhance trust in chatbot-based outputs to provide transparency. The improvements in accuracy, consistency, and source transparency address trust issues and enhance the clinical decision support process.
Abbreviations: ACR, American College of Radiology; accGPT, appropriateness criteria context aware GPT; accGPT-4, appropriateness criteria context aware GPT using GPT-4; GPT, generative pre-trained transformer; LLM, Large Language Model.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.ejrad.2024.111756 | DOI Listing |
BMC Musculoskelet Disord
January 2025
Pain Medicine Section, Anesthesiology Dept, Hospital Clinic de Barcelona, Barcelona, Spain.
Background: Multidisciplinary programs are the first recommendation for non-specific chronic low-back pain, but implementing this type of program is complicated to get up and running. The primary aim of this study was to assess the feasibility and appropriateness of the PAINDOC multidisciplinary program for subjects with chronic low-back pain. The secondary objectives were to evaluate the decrease in pain intensity, pain-related disability and pain catastrophizing, as well as the improvement in quality of life with this program.
View Article and Find Full Text PDFBMC Pediatr
January 2025
Department of Neonatology Nursing, West China Second University Hospital, Sichuan University, No. 20, Section 3, South Renmin Road, Chengdu, Sichuan Province, China.
Background: Current treatment of giant omphalocele in newborns is not standardized. The main treatments include one-time repair and staged surgery using synthetic and biologic mesh, or silos. However, surgery can lead to various postoperative complications.
View Article and Find Full Text PDFSci Rep
January 2025
Department of Neurosurgery, University Hospital Tübingen, Tübingen, Germany.
To compare 1D (linear) tumor volume calculations and classification systems with 3D-segmented volumetric analysis (SVA), focusing specifically on their effectiveness in the evaluation and management of NF2-associated vestibular schwannomas (VS). VS were clinically followed every 6 months with cranial, thin-sliced (< 3 mm) MRI. We retrospectively reviewed and used T1-weighted post-contrast enhanced (gadolinium) images for both SVA and linear measurements.
View Article and Find Full Text PDFBMJ Open Ophthalmol
January 2025
Ophthalmology & Vision Sciences, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada.
Dual inhibition of the angiopoietin (Ang)/Tie and vascular endothelial growth factor (VEGF) signalling pathways in patients with retinal diseases, such as neovascular age-related macular degeneration (nAMD) and diabetic macular oedema (DME), may induce greater vascular stability and contribute to increased treatment efficacy and durability compared with treatments that only target the VEGF pathway. Faricimab, a bispecific intravitreal agent that inhibits both VEGF and Ang-2, is the first injectable ophthalmic drug to achieve treatment intervals of up to 16 weeks in Phase 3 studies for nAMD and DME while exhibiting improvements in visual acuity and retinal thickness. Data from real-world studies have supported the safety, visual and anatomic benefits and durability of faricimab, even in patients who were previously treated with other intravitreal agents.
View Article and Find Full Text PDFJ Cardiovasc Magn Reson
January 2025
Department of Cardiology, Angiology and Intensive Care Medicine, Deutsches Herzzentrum der Charité, Augustenburger Platz 1, 13353 Berlin, Germany; Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany.
Background And Aims: Heart failure (HF) is an imminent global health problem. Yet established screening algorithms for asymptomatic pre-HF, allowing for early and effective preventive interventions, are largely lacking. The HERZCHECK trial, conducted in structurally underserved rural regions of North-Eastern Germany, aims to close this gap by evaluating the feasibility, diagnostic efficacy, and cost-effectiveness of a fully mobile, telemedically-supervised screening approach, combining cardiac magnetic resonance imaging (CMR) and laboratory testing as central elements.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!