Artificial intelligence (AI) algorithms have been retrospectively evaluated as replacement for one radiologist in screening mammography double-reading; however, methods for resolving discordance between radiologists and AI in the absence of 'real-world' arbitration may underestimate cancer detection rate (CDR) and recall. In 108,970 consecutive screens from a population screening program (BreastScreen WA, Western Australia), 20,120 were radiologist/AI discordant without real-world arbitration. Recall probabilities were randomly assigned for these screens in 1000 simulations. Recall thresholds for screen-detected and interval cancers (sensitivity) and no cancer (false-positive proportion, FPP) were varied to calculate mean CDR and recall rate for the entire cohort. Assuming 100% sensitivity, the maximum CDR was 7.30 per 1000 screens. To achieve >95% probability that the mean CDR exceeded the screening program CDR (6.97 per 1000), interval cancer sensitivities ≥63% (at 100% screen-detected sensitivity) and ≥91% (at 80% screen-detected sensitivity) were required. Mean recall rate was relatively constant across sensitivity assumptions, but varied by FPP. FPP > 6.5% resulted in recall rates that exceeded the program estimate (3.38%). CDR improvements depend on a majority of interval cancers being detected in radiologist/AI discordant screens. Such improvements are likely to increase recall, requiring careful monitoring where AI is deployed for screen-reading.

Download full-text PDF

Source
http://dx.doi.org/10.1177/09691413241262960DOI Listing

Publication Analysis

Top Keywords

discordance radiologists
8
artificial intelligence
8
cdr recall
8
screening program
8
radiologist/ai discordant
8
interval cancers
8
recall rate
8
screen-detected sensitivity
8
recall
7
cdr
6

Similar Publications

Importance: Radiotherapy (RT) plan quality is an established predictive factor associated with cancer recurrence and survival outcomes. The addition of radiologists to the peer review (PR) process may increase RT plan quality.

Objective: To determine the rate of changes to the RT plan with and without radiology involvement in PR of radiation targets.

View Article and Find Full Text PDF

Purpose: To evaluate correlation between terminal ileal (TI) stricture diagnosis at MR enterography (MRE) and ileocolonoscopy (IC) in patients with Crohn's disease (CD).

Methods: One hundred and four patients with CD (51% females; 41 ± 15 years) underwent IC and MRE within 3 months in this retrospective case-control study. Positive cases had TI strictures diagnosed by endoscopy (n = 35); or MRE (threshold small bowel dilation ≥ 3cm; n = 34).

View Article and Find Full Text PDF

Nonmass Lesions on Breast Ultrasound: Interreader Agreement and Associations With Malignancy.

AJR Am J Roentgenol

December 2024

Department of Radiology, Research Institute of Radiology, Asan Medical Center, College of Medicine, University of Ulsan, Seoul, Republic of Korea.

Nonmass lesions (NMLs) on breast ultrasound lack clear definition and encompass a broad range of benign and malignant entities. Given anticipated inclusion of NMLs in the BI-RADS 6th edition, thorough understanding of these lesions will be critical for optimal management. To evaluate interreader agreement for classification of lesions on breast ultrasound as NMLs and to identify imaging features associated with malignancy in these lesions.

View Article and Find Full Text PDF

The purpose of this study was to determine the necessity and cost-effectiveness of radiologists' interpretation of plain hand radiographs for diagnosing and managing different hand pathologies in the plastic surgery outpatient clinic setting. Through a retrospective cohort study, we identified new patient encounters from January 2021 to December 2022 in an outpatient hand clinic. We included patients with radiology reports that were submitted subsequent to the surgeon's consult note in clinic.

View Article and Find Full Text PDF

Objectives: We aim to evaluate the accuracy of radiologists and radiology residents in the detection of paediatric appendicular fractures with and without the help of a commercially available fracture detection artificial intelligence (AI) solution in the hopes of showing potential clinical benefits in a general hospital setting.

Methods: This was a retrospective study involving three associate consultants (AC) and three senior residents (SR) in radiology, who acted as readers. One reader from each human group interpreted the radiographs with the aid of AI.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!