A multi-label transformer-based deep learning approach to predict focal visual field progression.

Graefes Arch Clin Exp Ophthalmol

Department of Ophthalmology, Tri-Service General Hospital, National Defense Medical Center, No.325, Sec.2, Chenggong Rd., Neihu District, Taipei, Taiwan.

Published: July 2024

Purpose: Tracking functional changes in visual fields (VFs) through standard automated perimetry remains a clinical standard for glaucoma diagnosis. This study aims to develop and evaluate a deep learning (DL) model to predict regional VF progression, which has not been explored in prior studies.

Methods: The study included 2430 eyes of 1283 patients with four or more consecutive VF examinations from the baseline. A multi-label transformer-based network (MTN) using longitudinal VF data was developed to predict progression in six VF regions mapped to the optic disc. Progression was defined using the mean deviation (MD) slope and calculated for all six VF regions, referred to as clusters. Separate MTN models, trained for focal progression detection and forecasting on various numbers of VFs as model input, were tested on a held-out test set.

Results: The MTNs overall demonstrated excellent macro-average AUCs above 0.884 in detecting focal VF progression given five or more VFs. With a minimum of 6 VFs, the model demonstrated superior and more stable overall and per-cluster performance, compared to 5 VFs. The MTN given 6 VFs achieved a macro-average AUC of 0.848 for forecasting progression across 8 VF tests. The MTN also achieved excellent performance (AUCs ≥ 0.86, 1.0 sensitivity, and specificity ≥ 0.70) in four out of six clusters for the eyes already with severe VF loss (baseline MD ≤  - 12 dB).

Conclusion: The high prediction accuracy suggested that multi-label DL networks trained with longitudinal VF results may assist in identifying and forecasting progression in VF regions.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s00417-024-06393-1DOI Listing

Publication Analysis

Top Keywords

multi-label transformer-based
8
deep learning
8
progression
8
progression regions
8
focal progression
8
vfs model
8
forecasting progression
8
vfs
6
transformer-based deep
4
learning approach
4

Similar Publications

Multi-label text classification via secondary use of large clinical real-world data sets.

Sci Rep

November 2024

Research Unit for Digital Surgery, Division of Plastic, Aesthetic and Reconstructive Surgery, Department of Surgery, Medical University of Graz, Auenbruggerplatz 29/4, 8036, Graz, Austria.

Article Synopsis
  • Procedural coding is a significant challenge for clinicians, but advancements in natural language processing (NLP) can help create applications that ease their administrative tasks by predicting procedure codes from operative notes.
  • This study focused on adapting existing German medical BERT models to perform multi-label classification for predicting procedure codes, using approximately 350,000 coded surgery notes to train various models, including support vector machines and fastText.
  • The results indicated that support vector machines generally outperformed other models for longer reports, while fastText showed comparable performance to BERT models and was easier to explain, highlighting the potential for effective coding support applications.
View Article and Find Full Text PDF
Article Synopsis
  • Search engines have become the go-to resource for people seeking information on various topics, with "How To" searches being particularly popular for finding step-by-step solutions to problems.
  • This study focuses on "How To" articles from wikiHow, using a dataset of over 11,000 entries to identify multiple instructional categories through advanced deep learning models like XLNet and BERT.
  • The XLNet architecture achieved remarkable results, with an accuracy of 97.30% and high macro average scores, highlighting its effectiveness in classifying multi-label instructions within their proposed 'InstructNet' framework.
View Article and Find Full Text PDF

Deep learning approaches for multi-label Chest X-ray (CXR) images classification usually require large-scale datasets. However, acquiring such datasets with full annotations is costly, time-consuming, and prone to noisy labels. Therefore, we introduce a weakly supervised learning problem called Single Positive Multi-label Learning (SPML) into CXR images classification (abbreviated as SPML-CXR), in which only one positive label is annotated per image.

View Article and Find Full Text PDF

Multi-modality multi-label ocular abnormalities detection with transformer-based semantic dictionary learning.

Med Biol Eng Comput

November 2024

Institut de Chimie Moléculaire de l'Université de Bourgogne, ICMUB UMR CNRS 6302, Université de Bourgogne, Dijon, France.

Blindness is preventable by early detection of ocular abnormalities. Computer-aided diagnosis for ocular abnormalities is built by analyzing retinal imaging modalities, for instance, Color Fundus Photography (CFP). This research aims to propose a multi-label detection of 28 ocular abnormalities consisting of frequent and rare abnormalities from a single CFP by using transformer-based semantic dictionary learning.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!