Using histopathology latent diffusion models as privacy-preserving dataset augmenters improves downstream classification performance.

Comput Biol Med

Else Kroener Fresenius Center for Digital Health, Technical University Dresden, Dresden, Germany; Pathology & Data Analytics, Leeds Institute of Medical Research at St James's, University of Leeds, Leeds, United Kingdom; Department of Medicine I, University Hospital Dresden, Dresden, Germany; Medical Oncology, National Center for Tumor Diseases (NCT), University Hospital Heidelberg, Heidelberg, Germany. Electronic address:

Published: June 2024

Latent diffusion models (LDMs) have emerged as a state-of-the-art image generation method, outperforming previous Generative Adversarial Networks (GANs) in terms of training stability and image quality. In computational pathology, generative models are valuable for data sharing and data augmentation. However, the impact of LDM-generated images on histopathology tasks compared to traditional GANs has not been systematically studied. We trained three LDMs and a styleGAN2 model on histology tiles from nine colorectal cancer (CRC) tissue classes. The LDMs include 1) a fine-tuned version of stable diffusion v1.4, 2) a Kullback-Leibler (KL)-autoencoder (KLF8-DM), and 3) a vector quantized (VQ)-autoencoder deploying LDM (VQF8-DM). We assessed image quality through expert ratings, dimensional reduction methods, distribution similarity measures, and their impact on training a multiclass tissue classifier. Additionally, we investigated image memorization in the KLF8-DM and styleGAN2 models. All models provided a high image quality, with the KLF8-DM achieving the best Frechet Inception Distance (FID) and expert rating scores for complex tissue classes. For simpler classes, the VQF8-DM and styleGAN2 models performed better. Image memorization was negligible for both styleGAN2 and KLF8-DM models. Classifiers trained on a mix of KLF8-DM generated and real images achieved a 4% improvement in overall classification accuracy, highlighting the usefulness of these images for dataset augmentation. Our systematic study of generative methods showed that KLF8-DM produces the highest quality images with negligible image memorization. The higher classifier performance in the generatively augmented dataset suggests that this augmentation technique can be employed to enhance histopathology classifiers for various tasks.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compbiomed.2024.108410DOI Listing

Publication Analysis

Top Keywords

image quality
12
image memorization
12
latent diffusion
8
diffusion models
8
tissue classes
8
stylegan2 models
8
models
7
image
7
klf8-dm
6
histopathology latent
4

Similar Publications

Background: Locoregional external beam radiotherapy (EBRT) is selectively used in thyroid cancer patients to induce locoregional control. However, despite technological advances, EBRT remains associated with toxicities. We evaluated thyroid-cancer specific toxicities and long-term Quality of Life (QoL) post-EBRT.

View Article and Find Full Text PDF

Colony-stimulating factor 1 receptor (CSF1R) is almost exclusively expressed on microglia in the human brain and thus, has promise as a biomarker for imaging microglia density as a proxy for neuroinflammation. [C]CPPC is a radiotracer with selective affinity to CSF1R, and has been evaluated for in-human microglia PET imaging. The flourine-18 labeled CPPC derivative, 5-cyano-N-(4-(4-(2-[F]fluoroethyl)piperazin-1-yl)-2-(piperidin-1-yl)phenyl)furan-2-carboxamide ([F]FCPPC), was previously synthesized, however, with a low radiochemical yield using manual radiosynthesis.

View Article and Find Full Text PDF

Background: Bedside thoracic ultrasound (US) offers numerous advantages over chest X-ray (CXR) for identification of recurrent pneumothoraces (PTX) after tube thoracostomy (TT) removal. Technologic advancements have led to the development of hand-held devices capable of producing high-quality images termed ultra-portable US (UPUS). We hypothesized that UPUS would be as successful as CXR in detecting post-TT removal PTX and would be preferred by patients.

View Article and Find Full Text PDF

In the field of agriculture, particularly within the context of machine learning applications, quality datasets are essential for advancing research and development. To address the challenges of identifying different mango leaf types and recognizing the diverse and unique characteristics of mango varieties in Bangladesh, a comprehensive and publicly accessible dataset titled "BDMANGO" has been created. This dataset includes images essential for research, featuring six mango varieties: Amrapali, Banana, Chaunsa, Fazli, Haribhanga, and Himsagar, which were collected from different locations.

View Article and Find Full Text PDF

Aberrant anatomical variation of the vertebral artery (VA) from an internal carotid artery (ICA) is considered a rare finding. The incidence of this phenomenon can lead to patients suffering from posterior circulation neurological deficit if the ICA becomes significantly diseased. VA atypical anatomical origin is considered one of the rare pathologies, not only precipitating neurovascular incidents but equally leading to severe difficulty in VA dissection and surgical exposure, especially in carotid artery procedures.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!