The Problem of Fairness in Synthetic Healthcare Data.

Entropy (Basel)

Department of Computer Science, Rensselaer Polytechnic Institute, Troy, NY 12180, USA.

Published: September 2021

Access to healthcare data such as electronic health records (EHR) is often restricted by laws established to protect patient privacy. These restrictions hinder the reproducibility of existing results based on private healthcare data and also limit new research. Synthetically-generated healthcare data solve this problem by preserving privacy and enabling researchers and policymakers to drive decisions and methods based on realistic data. Healthcare data can include information about multiple in- and out- patient visits of patients, making it a time-series dataset which is often influenced by protected attributes like age, gender, race etc. The COVID-19 pandemic has exacerbated health inequities, with certain subgroups experiencing poorer outcomes and less access to healthcare. To combat these inequities, synthetic data must "fairly" represent diverse minority subgroups such that the conclusions drawn on synthetic data are correct and the results can be generalized to real data. In this article, we develop two fairness metrics for synthetic data, and analyze all subgroups defined by protected attributes to analyze the bias in three published synthetic research datasets. These covariate-level disparity metrics revealed that synthetic data may not be representative at the univariate and multivariate subgroup-levels and thus, fairness should be addressed when developing data generation methods. We discuss the need for measuring fairness in synthetic healthcare data to enable the development of robust machine learning models to create more equitable synthetic healthcare datasets.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8468495PMC
http://dx.doi.org/10.3390/e23091165DOI Listing

Publication Analysis

Top Keywords

healthcare data
24
synthetic data
16
data
13
synthetic healthcare
12
synthetic
8
fairness synthetic
8
healthcare
8
access healthcare
8
protected attributes
8
problem fairness
4

Similar Publications

Psychological Distress as a Mediator Between Work-Family Conflict and Nurse Managers' Professional and Organizational Turnover Intentions.

J Nurs Adm

December 2024

Author Affiliation: Assistant Professor, School of Nursing and Healthcare Leadership, University of Washington, Tacoma.

Objective: This study aimed to investigate the mediating role of psychological distress in the relationship between work-family conflict and nurse managers' (NMs') professional and organizational turnover intentions.

Background: Work-family conflict is prevalent among NMs. It can have a significant impact on their intent to leave their organization and the profession.

View Article and Find Full Text PDF

Background: Due to advances in treatment, HIV is now a chronic condition with near-normal life expectancy. However, people with HIV continue to have a higher burden of mental and physical health conditions and are impacted by wider socioeconomic issues. Positive Voices is a nationally representative series of surveys of people with HIV in the United Kingdom.

View Article and Find Full Text PDF

Background: Bangladesh and West Bengal, India, are 2 densely populated South Asian neighboring regions with many socioeconomic and cultural similarities. In dealing with breast cancer (BC)-related issues, statistics show that people from these regions are having similar problems and fates. According to the Global Cancer Statistics 2020 and 2012 reports, for BC (particularly female BC), the age-standardized incidence rate is approximately 22 to 25 per 100,000 people, and the age-standardized mortality rate is approximately 11 to 13 per 100,000 for these areas.

View Article and Find Full Text PDF

Objective: Cervical spondylotic myelopathy (CSM) shows varying levels of improvement after surgical treatment. While some patients improve soon after surgery, others may take months to years to show any signs of improvement. The goal of this study was to evaluate postoperative improvement, patient-reported outcomes, and patient satisfaction up to 2 years after surgical treatment for CSM, which will help optimize the current treatment strategies and effectively manage patient expectations.

View Article and Find Full Text PDF

Objective: Aneurysmal subarachnoid hemorrhage (SAH) is associated with high morbidity and mortality rates. In particular, functional outcomes of SAH caused by large or giant (≥ 10 mm) ruptured intracranial aneurysms are worsened by high procedure-related complication rates. However, studies describing the risk factors for poor functional outcomes specific to ruptured large/giant aneurysms are sparse.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!