A fundamental goal of evaluating the performance of a clinical model is to ensure it performs well across a diverse intended patient population. A primary challenge is that the data used in model development and testing often consist of many overlapping, heterogeneous patient subgroups that may not be explicitly defined or labeled. While a model's average performance on a dataset may be high, the model can have significantly lower performance for certain subgroups, which may be hard to detect. We describe an algorithmic framework for identifying subgroups with potential performance disparities (AFISP), which produces a set of interpretable phenotypes corresponding to subgroups for which the model's performance may be relatively lower. This could allow model evaluators, including developers and users, to identify possible failure modes prior to wide-scale deployment. We illustrate the application of AFISP by applying it to a patient deterioration model to detect significant subgroup performance disparities, and show that AFISP is significantly more scalable than existing algorithmic approaches.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11582698PMC
http://dx.doi.org/10.1038/s41746-024-01275-6DOI Listing

Publication Analysis

Top Keywords

framework identifying
8
patient subgroups
8
performance disparities
8
disparities afisp
8
model
6
performance
6
subgroups
5
data-driven framework
4
patient
4
identifying patient
4

Similar Publications

This article presents a scoping review aimed at mapping the main sources of moral distress among nursing professionals. The review was conducted according to the Arksey and O'Malley methodology, using the SPIDER framework to guide the systematic search in the BVS, PubMed, PsycArticles, Scielo, and Scopus databases. Initially, 2320 publications were identified.

View Article and Find Full Text PDF

As organizations are increasingly turning to voluntary wellness programs to improve employee well-being, the majority of studies in literature have focused on corporate-level benefits of wellness programs, such as productivity. However, there is a scarcity of studies that examine the intrinsic motivators that influence employee participation in such programs. In this study, we use a unique secondary dataset from a voluntary corporate wellness program and propose a novel theoretical framework based on motivational and behavioral theories to examine and understand the participants' behavior.

View Article and Find Full Text PDF

The growing importance of state assessments in civil engineering has led to intensive research into the development of damage identification methods based on vibrations. Natural frequencies and modal shapes have garnered great interest because modal parameters are invariant of structure. Moreover, thanks to the global nature of modal parameters, their variations are not limited to the location of the damage.

View Article and Find Full Text PDF

Analysis of Autonomous Penetration Testing Through Reinforcement Learning and Recommender Systems.

Sensors (Basel)

January 2025

Group of Analysis, Security and Systems (GASS), Department of Software Engineering and Artificial Intelligence (DISIA), Faculty of Computer Science and Engineering, Office 431, Universidad Complutense de Madrid (UCM), Calle Profesor José García Santesmases, 9, Ciudad Universitaria, 28040 Madrid, Spain.

Conducting penetration testing (pentesting) in cybersecurity is a crucial turning point for identifying vulnerabilities within the framework of Information Technology (IT), where real malicious offensive behavior is simulated to identify potential weaknesses and strengthen preventive controls. Given the complexity of the tests, time constraints, and the specialized level of expertise required for pentesting, analysis and exploitation tools are commonly used. Although useful, these tools often introduce uncertainty in findings, resulting in high rates of false positives.

View Article and Find Full Text PDF

Driving-Related Cognitive Abilities Prediction Based on Transformer's Multimodal Fusion Framework.

Sensors (Basel)

December 2024

Faculty of Information Science and Technology, Beijing University of Technology, Beijing 100124, China.

With the increasing complexity of urban roads and rising traffic flow, traffic safety has become a critical societal concern. Current research primarily addresses drivers' attention, reaction speed, and perceptual abilities, but comprehensive assessments of cognitive abilities in complex traffic environments are lacking. This study, grounded in cognitive science and neuropsychology, identifies and quantitatively evaluates ten cognitive components related to driving decision-making, execution, and psychological states by analyzing video footage of drivers' actions.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!