Using diffusion models to generate synthetic labeled data for medical image segmentation.

Int J Comput Assist Radiol Surg

Department of Medical Imaging, University of Toronto, 263 McCaul Street, Toronto, M5T 1W7, ON, Canada.

Published: August 2024

Purpose: Medical image analysis has become a prominent area where machine learning has been applied. However, high-quality, publicly available data are limited either due to patient privacy laws or the time and cost required for experts to annotate images. In this retrospective study, we designed and evaluated a pipeline to generate synthetic labeled polyp images for augmenting medical image segmentation models with the aim of reducing this data scarcity.

Methods: We trained diffusion models on the HyperKvasir dataset, comprising 1000 images of polyps in the human GI tract from 2008 to 2016. Qualitative expert review, Fréchet Inception Distance (FID), and Multi-Scale Structural Similarity (MS-SSIM) were tested for evaluation. Additionally, various segmentation models were trained with the generated data and evaluated using Dice score (DS) and Intersection over Union (IoU).

Results: Our pipeline produced images more akin to real polyp images based on FID scores. Segmentation model performance also showed improvements over GAN methods when trained entirely, or partially, with synthetic data, despite requiring less compute for training. Moreover, the improvement persists when tested on different datasets, showcasing the transferability of the generated images.

Conclusions: The proposed pipeline produced realistic image and mask pairs which could reduce the need for manual data annotation when performing a machine learning task. We support this use case by showing that the methods proposed in this study enhanced segmentation model performance, as measured by Dice and IoU scores, when trained fully or partially on synthetic data.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s11548-024-03213-zDOI Listing

Publication Analysis

Top Keywords

medical image
12
diffusion models
8
generate synthetic
8
synthetic labeled
8
image segmentation
8
machine learning
8
polyp images
8
segmentation models
8
pipeline produced
8
segmentation model
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!