Evaluating and Improving the Performance and Racial Fairness of Algorithms for GFR Estimation.

Linying Zhang Lauren R Richter Tevin Kim George Hripcsak

medRxiv

Department of Biomedical Informatics Columbia University, New York, NY, USA.

Published: January 2024

Data-driven clinical prediction algorithms help clinicians, and understanding the factors that affect their performance and fairness is crucial for equitable healthcare.
This study focused on creating a prediction algorithm for estimating glomerular filtration rate (GFR) using electronic health records (EHR) and compared three approaches: CKD-EPI equations, epidemiological models, and EHR-based models with various machine learning techniques.
The findings revealed that while using more complex models and relevant clinical features reduced overall estimation error, the performance gap between African American and non-African American patients persisted, highlighting the need for better identification of objective patient characteristics to enhance algorithm fairness and effectiveness.

Data-driven clinical prediction algorithms are used widely by clinicians. Understanding what factors can impact the performance and fairness of data-driven algorithms is an important step towards achieving equitable healthcare. To investigate the impact of modeling choices on the algorithmic performance and fairness, we make use of a case study to build a prediction algorithm for estimating glomerular filtration rate (GFR) based on the patient's electronic health record (EHR). We compare three distinct approaches for estimating GFR: CKD-EPI equations, epidemiological models, and EHR-based models. For epidemiological models and EHR-based models, four machine learning models of varying computational complexity (i.e., linear regression, support vector machine, random forest regression, and neural network) were compared. Performance metrics included root mean squared error (RMSE), median difference, and the proportion of GFR estimates within 30% of the measured GFR value (P30). Differential performance between non-African American and African American group was used to assess algorithmic fairness with respect to race. Our study showed that the variable race had a negligible effect on error, accuracy, and differential performance. Furthermore, including more relevant clinical features (e.g., common comorbidities of chronic kidney disease) and using more complex machine learning models, namely random forest regression, significantly lowered the estimation error of GFR. However, the difference in performance between African American and non-African American patients did not decrease, where the estimation error for African American patients remained consistently higher than non-African American patients, indicating that more objective patient characteristics should be discovered and included to improve algorithm performance.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10802656	PMC
http://dx.doi.org/10.1101/2024.01.07.24300943	DOI Listing

Publication Analysis

Top Keywords

non-african american

african american

american patients

performance

performance fairness

epidemiological models

models ehr-based

ehr-based models

machine learning

learning models

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!