Synthetic data in machine learning for medicine and healthcare.

Nat Biomed Eng

Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.

Published: June 2021

The proliferation of synthetic data in artificial intelligence for medicine and healthcare raises concerns about the vulnerabilities of the software and the challenges of current policy.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9353344PMC
http://dx.doi.org/10.1038/s41551-021-00751-8DOI Listing

Publication Analysis

Top Keywords

synthetic data
8
medicine healthcare
8
data machine
4
machine learning
4
learning medicine
4
healthcare proliferation
4
proliferation synthetic
4
data artificial
4
artificial intelligence
4
intelligence medicine
4

Similar Publications

Tabular data, spreadsheets organized in rows and columns, are ubiquitous across scientific fields, from biomedicine to particle physics to economics and climate science. The fundamental prediction task of filling in missing values of a label column based on the rest of the columns is essential for various applications as diverse as biomedical risk models, drug discovery and materials science. Although deep learning has revolutionized learning from raw data and led to numerous high-profile success stories, gradient-boosted decision trees have dominated tabular data for the past 20 years.

View Article and Find Full Text PDF

Poplar transformation with variable explant sources to maximize transformation efficiency.

Sci Rep

January 2025

Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA.

For decades, Agrobacterium tumefaciens-mediated plant transformation has played an integral role in advancing fundamental and applied plant biology. The recent omnipresent emergence of synthetic biology, which relies on plant transformation to manipulate plant DNA and gene expression for novel product biosynthesis, has further propelled basic as well as applied interests in plant transformation technologies. The strong demand for a faster design-build-test-learn cycle, the essence of synthetic biology, is, however, still ill-matched with the long-standing issues of high tissue culture recalcitrance and low transformation efficiency of a wide range of plant species especially food, fiber and energy crops.

View Article and Find Full Text PDF

Biopsy location and tumor-associated macrophages in predicting malignant glioma recurrence using an in-silico model.

NPJ Syst Biol Appl

January 2025

Center for Interdisciplinary Digital Sciences (CIDS), Department Information Services and High-Performance Computing (ZIH), Dresden University of Technology, 01062, Dresden, Germany.

Predicting the biological behavior and time to recurrence (TTR) of high-grade diffuse gliomas (HGG) after maximum safe neurosurgical resection and combined radiation and chemotherapy plays a pivotal role in planning clinical follow-up, selecting potentially necessary second-line treatment and improving the quality of life for patients diagnosed with a malignant brain tumor. The current standard-of-care (SoC) for HGG includes follow-up neuroradiological imaging to detect recurrence as early as possible and relies on several clinical, neuropathological, and radiological prognostic factors, which have limited accuracy in predicting TTR. In this study, using an in-silico analysis, we aim to improve predictive power for TTR by considering the role of (i) prognostically relevant information available through diagnostics used in the current SoC, (ii) advanced image-based information not currently part of the standard diagnostic workup, such as tumor-normal tissue interface (edge) features and quantitative data specific to biopsy positions within the tumor, and (iii) information on tumor-associated macrophages.

View Article and Find Full Text PDF

The increasing number of contaminants released into the environment necessitates innovative strategies for their detection and identification, particularly in complex environmental matrices like hospital wastewater. Hospital effluents contain both natural and synthetic hormones that might significantly contribute to endocrine disruption in aquatic ecosystems. In this study, HT-EDA has been implemented to identify the main effect-drivers (testosterone, androsterone and norgestrel) from hospital effluent using microplate fractionation, the AR-CALUX bioassay and an efficient data processing workflow.

View Article and Find Full Text PDF

Purpose: The purpose of this study is to present a correlative microscopy-tomography approach in conjunction with machine learning-based image segmentation techniques, with the goal of enabling quantitative structural and compositional elucidation of real-world pharmaceutical tablets.

Methods: Specifically, the approach involves three sequential steps: 1) user-oriented tablet constituent identification and characterization using correlative mosaic field-of-view SEM and energy dispersive X-ray spectroscopy techniques, 2) phase contrast synchrotron X-ray micro-computed tomography (SyncCT) characterization of a large, representative volume of the tablet, and 3) constituent segmentation and quantification of the imaging data through user-guided, iterative supervised machine learning and deep learning.

Results: This approach was implemented on a real-world tablet containing 15% API and multiple common excipients.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!