Generating and evaluating synthetic data in digital pathology through diffusion models.

Sci Rep

Data Science for Health Unit, Fondazione Bruno Kessler, Via Sommarive 18, Povo, Trento, 38123, Italy.

Published: November 2024

Synthetic data is becoming a valuable tool for computational pathologists, aiding in tasks like data augmentation and addressing data scarcity and privacy. However, its use necessitates careful planning and evaluation to prevent the creation of clinically irrelevant artifacts.This manuscript introduces a comprehensive pipeline for generating and evaluating synthetic pathology data using a diffusion model. The pipeline features a multifaceted evaluation strategy with an integrated explainability procedure, addressing two key aspects of synthetic data use in the medical domain.The evaluation of the generated data employs an ensemble-like approach. The first step includes assessing the similarity between real and synthetic data using established metrics. The second step involves evaluating the usability of the generated images in deep learning models accompanied with explainable AI methods. The final step entails verifying their histopathological realism through questionnaires answered by professional pathologists. We show that each of these evaluation steps are necessary as they provide complementary information on the generated data's quality.The pipeline is demonstrated on the public GTEx dataset of 650 Whole Slide Images (WSIs), including five different tissues. An equal number of tiles from each tissue are generated and their reliability is assessed using the proposed evaluation pipeline, yielding promising results.In summary, the proposed workflow offers a comprehensive solution for generative AI in digital pathology, potentially aiding the community in their transition towards digitalization and data-driven modeling.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11574254PMC
http://dx.doi.org/10.1038/s41598-024-79602-wDOI Listing

Publication Analysis

Top Keywords

synthetic data
16
generating evaluating
8
evaluating synthetic
8
data
8
digital pathology
8
synthetic
5
evaluation
5
data digital
4
pathology diffusion
4
diffusion models
4

Similar Publications

Anchorage Accurately Assembles Anchor-Flanked Synthetic Long Reads.

Lebniz Int Proc Inform

August 2024

Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, USA Department of Computer Science and Engineering, The Pennsylvania State University, University Park, PA, USA.

Modern sequencing technologies allow for the addition of short-sequence tags, known as anchors, to both ends of a captured molecule. Anchors are useful in assembling the full-length sequence of a captured molecule as they can be used to accurately determine the endpoints. One representative of such anchor-enabled technology is LoopSeq Solo, a synthetic long read (SLR) sequencing protocol.

View Article and Find Full Text PDF

Background: Medroxyprogesterone acetate (MPA), a synthetic progestogen, is extensively used for the treatment of various conditions, including contraception, irregular menstruation, functional uterine bleeding, and endometriosis. However, like all pharmaceutical agents, MPA is associated with adverse drug reactions. This study aimed to evaluate the adverse events (AEs) associated with MPA in by analyzing real-world data from the U.

View Article and Find Full Text PDF

Endovascular treatment of postoperative aortic coarctation aneurysms-a single center experience.

Front Cardiovasc Med

December 2024

Department of Cardiology, University Hospital 'St. Ekaterina', Medical University of Sofia, Sofia, Bulgaria.

Background: Formation of local type aortic aneurysm years after surgical repair of coarctation (CoA) occurs in 10% of patients independent of the surgical technique and is a potentially life-threatening condition if left untreated with a high risk of aortic rupture. Redo open surgery is associated with 14% in-hospital mortality and a high risk of complications. Endovascular treatment appears to be a feasible alternative with a high success rate and low morbidity and mortality, but data concerning long-term results is still mandatory.

View Article and Find Full Text PDF

A novel series of D-A-D-type 9-phenyl-9-phosphafluorene oxide (PhFlOP) derivatives was prepared and is reported herein. The synthetic protocol involved 5 steps from commercially available 2-bromo-4-fluoro-1-nitrobenzene, featuring a noble-metal-free system, mild reaction conditions, and a good yield, especially for the final CsCO-facilitated nucleophilic substitution (77-91% yield). The characterization data obtained from IR and NMR spectroscopy (H, C, F, and P) as well as HRMS spectrometry were in full agreement with the expected structures, and single-crystal X-ray diffraction analysis was conducted to confirm the structure of compound .

View Article and Find Full Text PDF

Accurate DNA Sequence Prediction for Sorting Target-Chirality Carbon Nanotubes and Manipulating Their Functionalities.

ACS Nano

January 2025

South China Advanced Institute for Soft Matter Science and Technology, School of Emergent Soft Matter, South China University of Technology, Guangzhou 510640, China.

Synthetic single-wall carbon nanotubes (SWCNTs) contain various chiralities, which can be sorted by DNA. However, finding DNA sequences for this purpose mainly relies on trial-and-error methods. Predicting the right DNA sequences to sort SWCNTs remains a substantial challenge.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!