Human Versus Machine: Comparing a Deep Learning Algorithm to Human Gradings for Detecting Glaucoma on Fundus Photographs.

Am J Ophthalmol

Vision, Imaging and Performance Laboratory (VIP), Duke Eye Center and Department of Ophthalmology, Duke University, Durham, North Carolina, USA. Electronic address:

Published: March 2020

Purpose: To compare the diagnostic performance of human gradings vs predictions provided by a machine-to-machine (M2M) deep learning (DL) algorithm trained to quantify retinal nerve fiber layer (RNFL) damage on fundus photographs.

Design: Evaluation of a machine learning algorithm.

Methods: An M2M DL algorithm trained with RNFL thickness parameters from spectral-domain optical coherence tomography was applied to a subset of 490 fundus photos of 490 eyes of 370 subjects graded by 2 glaucoma specialists for the probability of glaucomatous optical neuropathy (GON), and estimates of cup-to-disc (C/D) ratios. Spearman correlations with standard automated perimetry (SAP) global indices were compared between the human gradings vs the M2M DL-predicted RNFL thickness values. The area under the receiver operating characteristic curves (AUC) and partial AUC for the region of clinically meaningful specificity (85%-100%) were used to compare the ability of each output to discriminate eyes with repeatable glaucomatous SAP defects vs eyes with normal fields.

Results: The M2M DL-predicted RNFL thickness had a significantly stronger absolute correlation with SAP mean deviation (rho=0.54) than the probability of GON given by human graders (rho=0.48; P < .001). The partial AUC for the M2M DL algorithm was significantly higher than that for the probability of GON by human graders (partial AUC = 0.529 vs 0.411, respectively; P = .016).

Conclusion: An M2M DL algorithm performed as well as, if not better than, human graders at detecting eyes with repeatable glaucomatous visual field loss. This DL algorithm could potentially replace human graders in population screening efforts for glaucoma.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7073295PMC
http://dx.doi.org/10.1016/j.ajo.2019.11.006DOI Listing

Publication Analysis

Top Keywords

human graders
16
human gradings
12
m2m algorithm
12
rnfl thickness
12
human
8
deep learning
8
learning algorithm
8
algorithm trained
8
m2m dl-predicted
8
dl-predicted rnfl
8

Similar Publications

The primary objective of this short-term longitudinal study was to investigate how age groups affect the relationships between cyberbullying victimization, bystanding, and depression among a convenience sample of students across different educational levels; there was a total of 234 elementary school students (fourth and fifth graders), 363 middle school students (sixth to eighth graders), and 341 high school students (ninth to twelfth graders) from the United States who completed self-reported questionnaires on cyberbullying, depression, and peer attachment during 2020. Additionally, this study examined whether peer attachment acted as a moderator in these relationships. The results revealed that strong peer attachment significantly moderated the connections between cyberbullying involvement and depression, as measured six months later, with particularly pronounced effects among middle school students.

View Article and Find Full Text PDF

Word problems are essential for math learning and education, bridging numerical knowledge with real-world applications. Despite their importance, the neural mechanisms underlying word problem solving, especially in children, remain poorly understood. Here, we examine children's cognitive and brain response profiles for arithmetic word problems (AWPs), which involve one-step mathematical operations, and compare them with nonarithmetic word problems (NWPs), structured as parallel narratives without numerical operations.

View Article and Find Full Text PDF

Purpose: To evaluate visibility of a sub-band posterior to the external limiting membrane (ELM) and assess its age-associated variation.

Methods: In a retrospective cross-sectional study, normal eyes were imaged using a high-resolution spectral-domain optical coherence tomography (SD-OCT) prototype (2.7-µm axial resolution).

View Article and Find Full Text PDF

Purpose: To validate the performance of the Notal OCT Analyzer (NOA) in processing self-administered OCT images from an OCT system designed for home use (home OCT [HOCT]) as part of a pivotal study aimed at achieving de novo United States Food and Drug Admininstration marketing authorization.

Design: A prospective quantitative cross-sectional artificial intelligence study.

Participants: The study enrolled adults aged ≥55 years diagnosed with neovascular age-related macular degeneration (nAMD) in ≥1 eligible eye with a best-corrected visual acuity of 20/320 or better.

View Article and Find Full Text PDF

Purpose: The diagnosis of fungal keratitis using potassium hydroxide (KOH) smears of corneal scrapings enables initiation of the correct antimicrobial therapy at the point-of-care but requires time-consuming manual examination and expertise. This study evaluates the efficacy of a deep learning framework, dual stream multiple instance learning (DSMIL), in automating the analysis of whole slide imaging (WSI) of KOH smears for rapid and accurate detection of fungal infections.

Design: Retrospective observational study.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!