AI Article Synopsis

  • The study aimed to evaluate and compare the effectiveness of a deep learning algorithm (DLA) in detecting prostate lesions and classifying them using the PI-RADS system against clinical reports and radiologists with varying experience levels.
  • A total of 121 patients undergoing MRI and biopsy were reviewed by different groups of radiologists, alongside the DLA results, with the assessment of performance based on AUROC values, sensitivity, and specificity.
  • The DLA displayed moderate diagnostic accuracy, performing better than less experienced radiologists but not as well as expert radiologists, and its effectiveness was comparable to clinical reports.

Article Abstract

Purpose: To compare the performance of lesion detection and Prostate Imaging-Reporting and Data System (PI-RADS) classification between a deep learning-based algorithm (DLA), clinical reports and radiologists with different levels of experience in prostate MRI.

Methods: This retrospective study included 121 patients who underwent prebiopsy MRI and prostate biopsy. More than five radiologists (Reader groups 1, 2: residents; Readers 3, 4: less-experienced radiologists; Reader 5: expert) independently reviewed biparametric MRI (bpMRI). The DLA results were obtained using bpMRI. The reference standard was based on pathologic reports. The diagnostic performance of the PI-RADS classification of DLA, clinical reports, and radiologists was analyzed using AUROC. Dichotomous analysis (PI-RADS cutoff value ≥ 3 or 4) was performed, and the sensitivities and specificities were compared using McNemar's test.

Results: Clinically significant cancer [CSC, Gleason score ≥ 7] was confirmed in 43 patients (35.5%). The AUROC of the DLA (0.828) for diagnosing CSC was significantly higher than that of Reader 1 (AUROC, 0.706; p = 0.011), significantly lower than that of Reader 5 (AUROC, 0.914; p = 0.013), and similar to clinical reports and other readers (p = 0.060-0.661). The sensitivity of DLA (76.7%) was comparable to those of all readers and the clinical reports at a PI-RADS cutoff value ≥ 4. The specificity of the DLA (85.9%) was significantly higher than those of clinical reports and Readers 2-3 and comparable to all others at a PI-RADS cutoff value ≥ 4.

Conclusions: The DLA showed moderate diagnostic performance at a level between those of residents and an expert in detecting and classifying according to PI-RADS. The performance of DLA was similar to that of clinical reports from various radiologists in clinical practice.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.ejrad.2021.109894DOI Listing

Publication Analysis

Top Keywords

clinical reports
24
pi-rads classification
12
dla clinical
12
reports radiologists
12
pi-rads cutoff
12
dla
9
deep learning-based
8
learning-based algorithm
8
algorithm dla
8
radiologists levels
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!