A data-driven framework for identifying patient subgroups on which an AI/machine learning model may underperform.

Adarsh Subbaswamy Berkman Sahiner Nicholas Petrick Vinay Pai Roy Adams Matthew C Diamond Suchi Saria

NPJ Digit Med

Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA.

Published: November 2024

A fundamental goal of evaluating the performance of a clinical model is to ensure it performs well across a diverse intended patient population. A primary challenge is that the data used in model development and testing often consist of many overlapping, heterogeneous patient subgroups that may not be explicitly defined or labeled. While a model's average performance on a dataset may be high, the model can have significantly lower performance for certain subgroups, which may be hard to detect. We describe an algorithmic framework for identifying subgroups with potential performance disparities (AFISP), which produces a set of interpretable phenotypes corresponding to subgroups for which the model's performance may be relatively lower. This could allow model evaluators, including developers and users, to identify possible failure modes prior to wide-scale deployment. We illustrate the application of AFISP by applying it to a patient deterioration model to detect significant subgroup performance disparities, and show that AFISP is significantly more scalable than existing algorithmic approaches.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11582698	PMC
http://dx.doi.org/10.1038/s41746-024-01275-6	DOI Listing

Publication Analysis

Top Keywords

framework identifying

patient subgroups

performance disparities

disparities afisp

model

performance

subgroups

data-driven framework

patient

identifying patient

Similar Publications

Sources of moral distress in nursing professionals: A scoping review.

Nurs Ethics

January 2025

Pontifícia Universidade Católica do Paraná (PUCPR).

Murilo Karasinski Elena Lomba de Oliveira Victor Lucas de Souza Pousa Giovanna Carolina Massaneiro Dos Santos Carla Corradi Perini

This article presents a scoping review aimed at mapping the main sources of moral distress among nursing professionals. The review was conducted according to the Arksey and O'Malley methodology, using the SPIDER framework to guide the systematic search in the BVS, PubMed, PsycArticles, Scielo, and Scopus databases. Initially, 2320 publications were identified.

View Article and Find Full Text PDF

Similar Publications

On fostering employee participation in voluntary corporate-sponsored wellness programs.

Appl Psychol Health Well Being

February 2025

College of Business, James Madison University, Harrisonburg, Virginia, USA.

M Gabriela Sava Bogdan C Bichescu Kristin Scott Sara Krivacek

As organizations are increasingly turning to voluntary wellness programs to improve employee well-being, the majority of studies in literature have focused on corporate-level benefits of wellness programs, such as productivity. However, there is a scarcity of studies that examine the intrinsic motivators that influence employee participation in such programs. In this study, we use a unique secondary dataset from a voluntary corporate wellness program and propose a novel theoretical framework based on motivational and behavioral theories to examine and understand the participants' behavior.

View Article and Find Full Text PDF

Similar Publications

Tendon Anomaly Identification in Prestressed Concrete Beams Based on an Advanced Monitoring MEMS and Data-Driven Detection of Structural Damage.

Sensors (Basel)

January 2025

Department of Structures for Engineering and Architecture, University of Naples Federico II, Via Claudio 21, 80125 Naples, Italy.

Giorgio de Alteriis Giulio Mariniello Tommaso Pastore Alessia Teresa Silvestri Giuseppe Augugliaro

The growing importance of state assessments in civil engineering has led to intensive research into the development of damage identification methods based on vibrations. Natural frequencies and modal shapes have garnered great interest because modal parameters are invariant of structure. Moreover, thanks to the global nature of modal parameters, their variations are not limited to the location of the damage.

View Article and Find Full Text PDF

Similar Publications

Analysis of Autonomous Penetration Testing Through Reinforcement Learning and Recommender Systems.

Sensors (Basel)

January 2025

Group of Analysis, Security and Systems (GASS), Department of Software Engineering and Artificial Intelligence (DISIA), Faculty of Computer Science and Engineering, Office 431, Universidad Complutense de Madrid (UCM), Calle Profesor José García Santesmases, 9, Ciudad Universitaria, 28040 Madrid, Spain.

Ariadna Claudia Moreno Aldo Hernandez-Suarez Gabriel Sanchez-Perez Linda Karina Toscano-Medina Hector Perez-Meana

Conducting penetration testing (pentesting) in cybersecurity is a crucial turning point for identifying vulnerabilities within the framework of Information Technology (IT), where real malicious offensive behavior is simulated to identify potential weaknesses and strengthen preventive controls. Given the complexity of the tests, time constraints, and the specialized level of expertise required for pentesting, analysis and exploitation tools are commonly used. Although useful, these tools often introduce uncertainty in findings, resulting in high rates of false positives.

View Article and Find Full Text PDF

Similar Publications

Driving-Related Cognitive Abilities Prediction Based on Transformer's Multimodal Fusion Framework.

Sensors (Basel)

December 2024

Faculty of Information Science and Technology, Beijing University of Technology, Beijing 100124, China.

Yifan Li Bo Liu Wenli Zhang

With the increasing complexity of urban roads and rising traffic flow, traffic safety has become a critical societal concern. Current research primarily addresses drivers' attention, reaction speed, and perceptual abilities, but comprehensive assessments of cognitive abilities in complex traffic environments are lacking. This study, grounded in cognitive science and neuropsychology, identifies and quantitatively evaluates ten cognitive components related to driving decision-making, execution, and psychological states by analyzing video footage of drivers' actions.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!