Typically, deep learning models for image segmentation tasks are trained using large datasets of images annotated at the pixel level, which can be expensive and highly time-consuming. A way to reduce the amount of annotated images required for training is to adopt a semi-supervised approach. In this regard, generative deep learning models, concretely Generative Adversarial Networks (GANs), have been adapted to semi-supervised training of segmentation tasks. This work proposes MaskGDM, a deep learning architecture combining some ideas from EditGAN, a GAN that jointly models images and their segmentations, together with a generative diffusion model. With careful integration, we find that using a generative diffusion model can improve EditGAN performance results in multiple segmentation datasets, both multi-class and with binary labels. According to the quantitative results obtained, the proposed model improves multi-class image segmentation when compared to the EditGAN and DatasetGAN models, respectively, by [Formula: see text] and [Formula: see text]. Moreover, using the ISIC dataset, our proposal improves the results from other models by up to [Formula: see text] for the binary image segmentation approach.

Download full-text PDF

Source
http://dx.doi.org/10.1142/S0129065724500576DOI Listing

Publication Analysis

Top Keywords

image segmentation
16
deep learning
12
[formula text]
12
generative adversarial
8
adversarial networks
8
learning models
8
segmentation tasks
8
generative diffusion
8
diffusion model
8
models [formula
8

Similar Publications

Background: Segmentation models for clinical data experience severe performance degradation when trained on a single client from one domain and distributed to other clients from different domain. Federated Learning (FL) provides a solution by enabling multi-party collaborative learning without compromising the confidentiality of clients' private data.

Methods: In this paper, we propose a cross-domain FL method for Weakly Supervised Semantic Segmentation (FL-W3S) of white blood cells in microscopic images.

View Article and Find Full Text PDF

Traumatic injury remains a leading cause of death worldwide, with traumatic bleeding being one of its most critical and fatal consequences. The use of whole-body computed tomography (WBCT) in trauma management has rapidly expanded. However, interpreting WBCT images within the limited time available before treatment is particularly challenging for acute care physicians.

View Article and Find Full Text PDF

Purpose: The integration of artificial intelligence (AI), particularly deep learning (DL), with optical coherence tomography (OCT) offers significant opportunities in the diagnosis and management of glaucoma. This article explores the application of various DL models in enhancing OCT capabilities and addresses the challenges associated with their clinical implementation.

Methods: A review of articles utilizing DL models was conducted, including convolutional neural networks (CNNs), recurrent neural networks (RNNs), generative adversarial networks (GANs), autoencoders, and large language models (LLMs).

View Article and Find Full Text PDF

Preferential Sites of Retinal Capillary Occlusion in Sickle Cell Disease.

Invest Ophthalmol Vis Sci

January 2025

Department of Ophthalmology, New York Eye and Ear Infirmary of Mount Sinai, New York, New York, United States.

Purpose: To assess the preferential sites of retinal capillary occlusion at the parafovea in patients with sickle cell disease (SCD) using optical coherence tomography angiography (OCT-A).

Methods: OCT-A scans from 107 patients with SCD and 51 race-matched unaffected controls were obtained using a commercial spectral domain-OCT system. At least eight sequential 3 × 3 mm scans centered at the fovea were acquired and averaged for image analysis.

View Article and Find Full Text PDF

The liver segmentation method proposed by Couinaud is widely accepted by surgeons because of its convenience and practicality. However, this conventional eight-segment classification does not reflect realistic details of the liver and thus requires further adjustments to promote improvements in surgical strategies. This study aimed to explore the ramification patterns of the hepatic vasculature comprehensively.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!