A Semiautonomous Deep Learning System to Reduce False Positives in Screening Mammography.

Radiol Artif Intell

From Whiterabbit.ai, 3930 Freedom Cir, Santa Clara, CA 95054 (S.P., T.T., B.M., Y.N.T.V., T.M., R.M.H., M.S., N.G., N.Z.D., J.S.); Onsite Women's Health, Westfield, Mass (S.H.); SSM Health, St Louis, Mo (C.M.A.); and Mallinckrodt Institute of Radiology, Washington University School of Medicine, St Louis, Mo (R.L.W.).

Published: May 2024

Purpose To evaluate the ability of a semiautonomous artificial intelligence (AI) model to identify screening mammograms not suspicious for breast cancer and reduce the number of false-positive examinations. Materials and Methods The deep learning algorithm was trained using 123 248 two-dimensional digital mammograms (6161 cancers) and a retrospective study was performed on three nonoverlapping datasets of 14 831 screening mammography examinations (1026 cancers) from two U.S. institutions and one U.K. institution (2008-2017). The stand-alone performance of humans and AI was compared. Human plus AI performance was simulated to examine reductions in the cancer detection rate, number of examinations, false-positive callbacks, and benign biopsies. Metrics were adjusted to mimic the natural distribution of a screening population, and bootstrapped CIs and values were calculated. Results Retrospective evaluation on all datasets showed minimal changes to the cancer detection rate with use of the AI device (noninferiority margin of 0.25 cancers per 1000 examinations: U.S. dataset 1, = .02; U.S. dataset 2, < .001; U.K. dataset, < .001). On U.S. dataset 1 (11 592 mammograms; 101 cancers; 3810 female patients; mean age, 57.3 years ± 10.0 [SD]), the device reduced screening examinations requiring radiologist interpretation by 41.6% (95% CI: 40.6%, 42.4%; < .001), diagnostic examinations callbacks by 31.1% (95% CI: 28.7%, 33.4%; < .001), and benign needle biopsies by 7.4% (95% CI: 4.1%, 12.4%; < .001). U.S. dataset 2 (1362 mammograms; 330 cancers; 1293 female patients; mean age, 55.4 years ± 10.5) was reduced by 19.5% (95% CI: 16.9%, 22.1%; < .001), 11.9% (95% CI: 8.6%, 15.7%; < .001), and 6.5% (95% CI: 0.0%, 19.0%; = .08), respectively. The U.K. dataset (1877 mammograms; 595 cancers; 1491 female patients; mean age, 63.5 years ± 7.1) was reduced by 36.8% (95% CI: 34.4%, 39.7%; < .001), 17.1% (95% CI: 5.9%, 30.1%: < .001), and 5.9% (95% CI: 2.9%, 11.5%; < .001), respectively. Conclusion This work demonstrates the potential of a semiautonomous breast cancer screening system to reduce false positives, unnecessary procedures, patient anxiety, and medical expenses. Artificial Intelligence, Semiautonomous Deep Learning, Breast Cancer, Screening Mammography Published under a CC BY 4.0 license.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11140506PMC
http://dx.doi.org/10.1148/ryai.230033DOI Listing

Publication Analysis

Top Keywords

deep learning
12
screening mammography
12
breast cancer
12
001 dataset
12
female patients
12
patients age
12
001
10
95%
9
semiautonomous deep
8
system reduce
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!