Large language models and synthetic health data: progress and prospects.

Daniel Smolyak Margrét V Bjarnadóttir Kenyon Crowley Ritu Agarwal

JAMIA Open

Center for Digital Health and Artificial Intelligence, Carey Business School, Johns Hopkins University, Baltimore, MD 21202, United States.

Published: December 2024

Objectives: Given substantial obstacles surrounding health data acquisition, high-quality synthetic health data are needed to meet a growing demand for the application of advanced analytics for clinical discovery, prediction, and operational excellence. We highlight how recent advances in large language models (LLMs) present new opportunities for progress, as well as new risks, in synthetic health data generation (SHDG).

Materials And Methods: We synthesized systematic scoping reviews in the SHDG domain, recent LLM methods for SHDG, and papers investigating the capabilities and limits of LLMs.

Results: We summarize the current landscape of generative machine learning models (eg, Generative Adversarial Networks) for SHDG, describe remaining challenges and limitations, and identify how recent LLM approaches can potentially help mitigate them.

Discussion: Six research directions are outlined for further investigation of LLMs for SHDG: evaluation metrics, LLM adoption, data efficiency, generalization, health equity, and regulatory challenges.

Conclusion: LLMs have already demonstrated both high potential and risks in the health domain, and it is important to study their advantages and disadvantages for SHDG.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11512648	PMC
http://dx.doi.org/10.1093/jamiaopen/ooae114	DOI Listing

Publication Analysis

Top Keywords

health data

synthetic health

large language

language models

health

data

shdg

models synthetic

data progress

progress prospects

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!