Purpose: To understand the impact of deep learning diabetic retinopathy (DR) algorithms on physician readers in computer-assisted settings.

Design: Evaluation of diagnostic technology.

Participants: One thousand seven hundred ninety-six retinal fundus images from 1612 diabetic patients.

Methods: Ten ophthalmologists (5 general ophthalmologists, 4 retina specialists, 1 retina fellow) read images for DR severity based on the International Clinical Diabetic Retinopathy disease severity scale in each of 3 conditions: unassisted, grades only, or grades plus heatmap. Grades-only assistance comprised a histogram of DR predictions (grades) from a trained deep-learning model. For grades plus heatmap, we additionally showed explanatory heatmaps.

Main Outcome Measures: For each experiment arm, we computed sensitivity and specificity of each reader and the algorithm for different levels of DR severity against an adjudicated reference standard. We also measured accuracy (exact 5-class level agreement and Cohen's quadratically weighted κ), reader-reported confidence (5-point Likert scale), and grading time.

Results: Readers graded more accurately with model assistance than without for the grades-only condition (P < 0.001). Grades plus heatmaps improved accuracy for patients with DR (P < 0.001), but reduced accuracy for patients without DR (P = 0.006). Both forms of assistance increased readers' sensitivity moderate-or-worse DR: unassisted: mean, 79.4% [95% confidence interval (CI), 72.3%-86.5%]; grades only: mean, 87.5% [95% CI, 85.1%-89.9%]; grades plus heatmap: mean, 88.7% [95% CI, 84.9%-92.5%] without a corresponding drop in specificity (unassisted: mean, 96.6% [95% CI, 95.9%-97.4%]; grades only: mean, 96.1% [95% CI, 95.5%-96.7%]; grades plus heatmap: mean, 95.5% [95% CI, 94.8%-96.1%]). Algorithmic assistance increased the accuracy of retina specialists above that of the unassisted reader or model alone; and increased grading confidence and grading time across all readers. For most cases, grades plus heatmap was only as effective as grades only. Over the course of the experiment, grading time decreased across all conditions, although most sharply for grades plus heatmap.

Conclusions: Deep learning algorithms can improve the accuracy of, and confidence in, DR diagnosis in an assisted read setting. They also may increase grading time, although these effects may be ameliorated with experience.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.ophtha.2018.11.016DOI Listing

Publication Analysis

Top Keywords

grades heatmap
20
deep learning
12
diabetic retinopathy
12
grades
12
grading time
12
retina specialists
8
accuracy patients
8
assistance increased
8
grading
6
[95%
6

Similar Publications

Historical redlining and clustering of present-day breast cancer factors.

Cancer Causes Control

January 2025

Department of Epidemiology and Environmental Health, School of Public Health and Health Professions, State University of New York at Buffalo, 265 Farber Hall, Buffalo, NY, 14214, USA.

Purpose: Historical redlining, a 1930s-era form of residential segregation and proxy of structural racism, has been associated with breast cancer risk, stage, and survival, but research is lacking on how known present-day breast cancer risk factors are related to historical redlining. We aimed to describe the clustering of present-day neighborhood-level breast cancer risk factors with historical redlining and evaluate geographic patterning across the US.

Methods: This ecologic study included US neighborhoods (census tracts) with Home Owners' Loan Corporation (HOLC) grades, defined as having a score in the Historic Redlining Score dataset; 2019 Population Level Analysis and Community EStimates (PLACES) data; and 2014-2016 Environmental Justice Index (EJI) data.

View Article and Find Full Text PDF

Weakly supervised deep learning-based classification for histopathology of gliomas: a single center experience.

Sci Rep

January 2025

Department of Neurosurgery, West China Hospital, Sichuan University, 37 Guoxue Avenue, Chengdu, 610041, People's Republic of China.

Multiple artificial intelligence systems have been created to facilitate accurate and prompt histopathological diagnosis of tumors using hematoxylin-eosin-stained slides. We aimed to investigate whether weakly supervised deep learning can aid in glioma diagnosis. We analyzed 472 whole slide images (WSIs) from 226 patients in West China Hospital (WCH) and 1604 WSIs from 880 patients in The Cancer Genome Atlas (TCGA).

View Article and Find Full Text PDF

Sepsis, characterized by a widespread and dysregulated immune response to infection leading to organ dysfunction, presents significant challenges in diagnosis and treatment. In this study, we investigated 203 coagulation-related genes in sepsis patients to explore their roles in the disease. Through differential gene expression analysis, we identified 20 genes with altered expression patterns.

View Article and Find Full Text PDF
Article Synopsis
  • Identifying early-stage mycosis fungoides (MF), a type of skin cancer, is hard because it looks a lot like harmless skin conditions.
  • Researchers are using deep learning (DL), a type of computer technology, to help doctors tell the difference between MF and these benign conditions by looking at images from skin biopsies.
  • The study showed that this DL method can get pretty close to the accuracy of expert doctors, which is promising for improving cancer diagnoses in the future.
View Article and Find Full Text PDF

Quickly diagnosing Bietti crystalline dystrophy with deep learning.

iScience

September 2024

Beijing Tongren Eye Center, Beijing Key Laboratory of Intraocular Tumor Diagnosis and Treatment, Beijing Ophthalmology & Visual Sciences Key Lab, Medical Artificial Intelligence Research and Verification Key Laboratory of the Ministry of Industry and Information Technology, Beijing Tongren Hospital, Capital Medical University, Beijing, China.

Article Synopsis
  • Bietti crystalline dystrophy (BCD) is a challenging inherited retinal disease that requires early diagnosis, which this study aims to improve through deep learning techniques.
  • The research involves labeling ultra-wide-field color fundus photographs to classify images as BCD, retinitis pigmentosa, or normal, and further categorizing BCD patients into three clinical stages.
  • Three deep learning models (ResNeXt, Wide ResNet, and ResNeSt) were evaluated for their diagnostic accuracy and effectiveness, resulting in the creation of a significant BCD database for the Chinese population and a promising automated diagnosis method for future clinical use.
View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!