Assessing Laterality Errors in Radiology: Comparing Generative Artificial Intelligence and Natural Language Processing.

J Am Coll Radiol

Research Fellow, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts; Chief Data Science Officer, Mass General Brigham AI, Boston, Massachusetts; ACR DSI Chief Science Officer; Chief Imaging Information, Mass General Brigham; Vice Chairman of Radiology-Informatics, Massachusetts General Hospital and Brigham and Women's Hospital; and Co-Chair, Mass General Brigham AI Imaging AI Governance Committee.

Published: October 2024

Purpose: We compared the performance of generative artificial intelligence (AI) (Augmented Transformer Assisted Radiology Intelligence [ATARI, Microsoft Nuance, Microsoft Corporation, Redmond, Washington]) and natural language processing (NLP) tools for identifying laterality errors in radiology reports and images.

Methods: We used an NLP-based (mPower, Microsoft Nuance) tool to identify radiology reports flagged for laterality errors in its Quality Assurance Dashboard. The NLP model detects and highlights laterality mismatches in radiology reports. From an initial pool of 1,124 radiology reports flagged by the NLP for laterality errors, we selected and evaluated 898 reports that encompassed radiography, CT, MRI, and ultrasound modalities to ensure comprehensive coverage. A radiologist reviewed each radiology report to assess if the flagged laterality errors were present (reporting error-true-positive) or absent (NLP error-false-positive). Next, we applied ATARI to 237 radiology reports and images with consecutive NLP true-positive (118 reports) and false-positive (119 reports) laterality errors. We estimated accuracy of NLP and generative AI tools to identify overall and modality-wise laterality errors.

Results: Among the 898 NLP-flagged laterality errors, 64% (574 of 898) had NLP errors and 36% (324 of 898) were reporting errors. The text query ATARI feature correctly identified the absence of laterality mismatch (NLP false-positives) with a 97.4% accuracy (115 of 118 reports; 95% confidence interval [CI] = 96.5%-98.3%). Combined vision and text query resulted in 98.3% accuracy (116 of 118 reports or images; 95% CI = 97.6%-99.0%), and query alone had a 98.3% accuracy (116 of 118 images; 95% CI = 97.6%-99.0%).

Conclusion: The generative AI-empowered ATARI prototype outperformed the assessed NLP tool for determining true and false laterality errors in radiology reports while enabling an image-based laterality determination. Underlying errors in ATARI text query in complex radiology reports emphasize the need for further improvement in the technology.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jacr.2024.06.014DOI Listing

Publication Analysis

Top Keywords

laterality errors
32
radiology reports
28
errors radiology
12
reports
12
118 reports
12
text query
12
errors
11
laterality
11
radiology
10
nlp
9

Similar Publications

Rapid heating cycle molding technology has recently emerged as a novel injection molding technique, with the uniformity of temperature distribution on the mold cavity surface being a critical factor influencing product quality. A numerical simulation method is employed to investigate the rapid heating process of molds and optimize heating power, with the positions of heating rods as variables. The temperature uniformity coefficient is an indicator used to assess the uniformity of temperature distribution within a system or process, while the thermal response rate plays a crucial role in evaluating the heating efficiency of a heating system.

View Article and Find Full Text PDF

: Clavicle injuries are common and seem to be frequently subject to diagnostic misclassification. The accurate identification of clavicle fractures is essential, particularly for registry and Big Data analyses. This study aims to assess the frequency of diagnostic errors in clavicle injury classifications.

View Article and Find Full Text PDF

Reliability and Accuracy of Standard Reference Procedures for Measurements of Trunk and Arm Postures in Ergonomics.

Bioengineering (Basel)

January 2025

Unit of Occupational Medicine, Institute of Environmental Medicine, Karolinska Institutet, 171 77 Stockholm, Sweden.

Adequate reference procedures for obtaining the reference zero-angle position are important for precise and accurate posture measurements, but few studies have systematically investigated these. A limited number of previous studies suggest differences in accuracy between procedures, with some causing an underestimation of the true arm elevation angle when sensors are taped to the skin. The reliability of commonly used reference procedures for the measurement of the trunk posture is also not well explored, and alternative procedures may improve precision.

View Article and Find Full Text PDF

Development and biomechanical evaluation of a 3D printed analogue of the human lumbar spine.

3D Print Med

January 2025

Musculoskeletal Biomechanics Research Lab, Department of Mechanical Engineering, McGill University, 845 Sherbrooke St. W (163), Montréal, QC, H3A 0C3, Canada.

Background: There exists a need for validated lumbar spine models in spine biomechanics research. Although cadaveric testing is the current gold standard for spinal implant development, it poses significant issues related to reliability and repeatability due to the wide variability in cadaveric physiologies. Moreover, there are increasing ethical concerns with human dissection practices.

View Article and Find Full Text PDF

Non-dominant hand contractions (NDHCs) have been shown to help expert motor skills in high-pressure scenarios that induce performance anxiety. Most studies of NHDCs under pressure have examined benefits in overlearned specialist movements (e.g.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!