Precision Segmentation of Subretinal Fluids in OCT Using Multiscale Attention-Based U-Net Architecture.

Bioengineering (Basel)

Department of Electrical, Computer, and Software Engineering, The University of Auckland, Auckland 1010, New Zealand.

Published: October 2024

AI Article Synopsis

  • The paper introduces a deep-learning architecture aimed at segmenting retinal fluids in patients with Diabetic Macular Oedema (DME) and Age-related Macular Degeneration (AMD), addressing challenges in accuracy faced by existing techniques.
  • The proposed model features an encoder-decoder network inspired by U-Net, utilizing enhanced OCT images with edge maps and advanced components like Residual and Inception modules alongside a multiscale attention mechanism for improved feature extraction.
  • Results show the network achieves high F1 Scores on multiple datasets, indicating enhanced segmentation accuracy and clinical potential for diagnosing and managing retinal diseases.

Article Abstract

This paper presents a deep-learning architecture for segmenting retinal fluids in patients with Diabetic Macular Oedema (DME) and Age-related Macular Degeneration (AMD). Accurate segmentation of multiple fluid types is critical for diagnosis and treatment planning, but existing techniques often struggle with precision. We propose an encoder-decoder network inspired by U-Net, processing enhanced OCT images and their edge maps. The encoder incorporates Residual and Inception modules with an autoencoder-based multiscale attention mechanism to extract detailed features. Our method shows superior performance across several datasets. On the RETOUCH dataset, the network achieved F1 Scores of 0.82 for intraretinal fluid (IRF), 0.93 for subretinal fluid (SRF), and 0.94 for pigment epithelial detachment (PED). The model also performed well on the OPTIMA and DUKE datasets, demonstrating high precision, recall, and F1 Scores. This architecture significantly enhances segmentation accuracy and edge precision, offering a valuable tool for diagnosing and managing retinal diseases. Its integration of dual-input processing, multiscale attention, and advanced encoder modules highlights its potential to improve clinical outcomes and advance retinal disease treatment.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11504175PMC
http://dx.doi.org/10.3390/bioengineering11101032DOI Listing

Publication Analysis

Top Keywords

multiscale attention
8
precision
4
precision segmentation
4
segmentation subretinal
4
subretinal fluids
4
fluids oct
4
oct multiscale
4
multiscale attention-based
4
attention-based u-net
4
u-net architecture
4

Similar Publications

Background: AD prevention and early interventions require tools for evaluation of people during aging for diagnosis and prognosis of AD conversion. Since AD is a complicated continuum of neurodegenerative processes, developing of such tools have been difficult because it needs longitudinal and multimodal data which are often complicated and incomplete. To address this challenge, we are developing AI4AD framework using ADNI data.

View Article and Find Full Text PDF

As combination therapy becomes more common in clinical applications, predicting adverse effects of combination medications is a challenging task. However, there are three limitations of the existing prediction models. First, they rely on a single view of the drug and cannot fully utilize multiview information, resulting in limited performance when capturing complex structures.

View Article and Find Full Text PDF

Microscopic imaging aids disease diagnosis by describing quantitative cell morphology and tissue size. However, the high spatial resolution of these images poses significant challenges for manual quantitative evaluation. This project proposes using computer-aided analysis methods to address these challenges, enabling rapid and precise clinical diagnosis, course analysis, and prognostic prediction.

View Article and Find Full Text PDF

Semantic segmentation of high-resolution images from remote sensing is crucial across various sectors. However, due to limitations in computational resources and the complexity of network architectures, many sophisticated semantic segmentation models struggle with efficiency in real-world applications, leading to an interest in developing lightweight model like borders. These models often employ a dual-branch structure, which balances processing speed and performance effectively.

View Article and Find Full Text PDF

To achieve real-time monitoring and intelligent maintenance of transformers, a framework based on deep vision and digital twin has been developed. An enhanced visual detection model, DETR + X, is proposed, implementing multidimensional sample data augmentation through Swin2SR and GAN networks. This model converts one-dimensional DGA data into three-dimensional feature images based on Gram angle fields, facilitating the transformation and fusion of heterogeneous modal information.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!