Objectives: To evaluate an artificial intelligence (AI)-assisted double reading system for detecting clinically relevant missed findings on routinely reported chest radiographs.

Methods: A retrospective study was performed in two institutions, a secondary care hospital and tertiary referral oncology centre. Commercially available AI software performed a comparative analysis of chest radiographs and radiologists' authorised reports using a deep learning and natural language processing algorithm, respectively. The AI-detected discrepant findings between images and reports were assessed for clinical relevance by an external radiologist, as part of the commercial service provided by the AI vendor. The selected missed findings were subsequently returned to the institution's radiologist for final review.

Results: In total, 25,104 chest radiographs of 21,039 patients (mean age 61.1 years ± 16.2 [SD]; 10,436 men) were included. The AI software detected discrepancies between imaging and reports in 21.1% (5289 of 25,104). After review by the external radiologist, 0.9% (47 of 5289) of cases were deemed to contain clinically relevant missed findings. The institution's radiologists confirmed 35 of 47 missed findings (74.5%) as clinically relevant (0.1% of all cases). Missed findings consisted of lung nodules (71.4%, 25 of 35), pneumothoraces (17.1%, 6 of 35) and consolidations (11.4%, 4 of 35).

Conclusion: The AI-assisted double reading system was able to identify missed findings on chest radiographs after report authorisation. The approach required an external radiologist to review the AI-detected discrepancies. The number of clinically relevant missed findings by radiologists was very low.

Clinical Relevance Statement: The AI-assisted double reader workflow was shown to detect diagnostic errors and could be applied as a quality assurance tool. Although clinically relevant missed findings were rare, there is potential impact given the common use of chest radiography.

Key Points: • A commercially available double reading system supported by artificial intelligence was evaluated to detect reporting errors in chest radiographs (n=25,104) from two institutions. • Clinically relevant missed findings were found in 0.1% of chest radiographs and consisted of unreported lung nodules, pneumothoraces and consolidations. • Applying AI software as a secondary reader after report authorisation can assist in reducing diagnostic errors without interrupting the radiologist's reading workflow. However, the number of AI-detected discrepancies was considerable and required review by a radiologist to assess their relevance.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11364654PMC
http://dx.doi.org/10.1007/s00330-024-10676-wDOI Listing

Publication Analysis

Top Keywords

missed findings
40
clinically relevant
28
chest radiographs
24
relevant missed
24
double reading
16
ai-assisted double
12
reading system
12
external radiologist
12
findings
11
missed
10

Similar Publications

Background/purpose: Although metabolic dysfunction-associated steatotic liver disease (MASLD) has been proposed to replace the diagnosis of non-alcoholic fatty liver disease (NAFLD) with new diagnostic criteria since 2023, the genetic predisposition of MASLD remains to be explored.

Methods: Participants with data of genome-wide association studies (GWAS) in the Taiwan Biobank database were collected. Patients with missing data, positive for HBsAg, anti-HCV, and alcohol drinking history were excluded.

View Article and Find Full Text PDF

Prediction of pulmonary embolism by an explainable machine learning approach in the real world.

Sci Rep

January 2025

Department of Respiratory and Critical Care Medicine, Changhai Hospital, The Second Military Medical University, Shanghai, People's Republic of China.

In recent years, large amounts of researches showed that pulmonary embolism (PE) has become a common disease, and PE remains a clinical challenge because of its high mortality, high disability, high missed and high misdiagnosed rates. To address this, we employed an artificial intelligence-based machine learning algorithm (MLA) to construct a robust predictive model for PE. We retrospectively analyzed 1480 suspected PE patients hospitalized in West China Hospital of Sichuan University between May 2015 and April 2020.

View Article and Find Full Text PDF

Blood-based diagnosis of pediatric tuberculosis: a prospective cohort study in South Africa and Dominican Republic.

J Infect

January 2025

Center for Cellular and Molecular Diagnostics, Department of Biochemistry and Molecular Biology, Tulane University School of Medicine, New Orleans, LA, USA. Electronic address:

Objectives: Pediatric tuberculosis (TB) diagnosis is complicated by challenges in obtaining invasive respiratory specimens that frequently contain few Mycobacterium tuberculosis (Mtb) bacilli. We report the diagnostic performance of an Mtb antigen-derived peptide (MAP-TB) assay and its ability to monitor TB treatment response.

Methods: Study cohorts enrolled children who presented with presumptive TB at two hospitals in South Africa from 2012 to 2017 (157 children aged <13 years) and at community-based clinics in the Dominican Republic from 2019 to 2023 (101 children aged <18 years).

View Article and Find Full Text PDF

Introduction: A significant proportion of newly diagnosed prostate cancer (PCa) cases are slow growing with a low risk of metastatic progression. There is a lack of data concerning the optimal biopsy regimen for improving diagnosis yield in PI-RADS3 lesions. This study aimed to assess the diagnostic value of current biopsy regimens in PI-RADS 3 lesions and identify clinical predictors to improve clinically significant PCa (csPCa) detection.

View Article and Find Full Text PDF

Background: Ethnic diversity in cancer clinical trials is essential to ensure that therapeutic advances are equitable and broadly applicable in multicultural societies. Yet, missing consensus on the documentation of ethnic origin, partially based on the complexity of the terminology and fear of discrimination, leads to suboptimal patient management of minority populations. Additionally, eligibility criteria, such as stringent laboratory cut-offs, often fail to account for variations across ethnic groups, potentially excluding patients without evidence-based justification.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!