A Unified Framework on Generalizability of Clinical Prediction Models.

Front Artif Intell

Malone Center for Engineering in Healthcare, Whiting School of Engineering, Baltimore, MD, United States.

Published: April 2022

To be useful, clinical prediction models (CPMs) must be generalizable to patients in new settings. Evaluating generalizability of CPMs helps identify spurious relationships in data, provides insights on when they fail, and thus, improves the explainability of the CPMs. There are discontinuities in concepts related to generalizability of CPMs in the clinical research and machine learning domains. Specifically, conventional statistical reasons to explain poor generalizability such as inadequate model development for the purposes of generalizability, differences in coding of predictors and outcome between development and external datasets, measurement error, inability to measure some predictors, and missing data, all have differing and often complementary treatments, in the two domains. Much of the current machine learning literature on generalizability of CPMs is in terms of dataset shift of which several types have been described. However, little research exists to synthesize concepts in the two domains. Bridging this conceptual discontinuity in the context of CPMs can facilitate systematic development of CPMs and evaluation of their sensitivity to factors that affect generalizability. We survey generalizability and dataset shift in CPMs from both the clinical research and machine learning perspectives, and describe a unifying framework to analyze generalizability of CPMs and to explain their sensitivity to factors affecting it. Our framework leads to a set of signaling statements that can be used to characterize differences between datasets in terms of factors that affect generalizability of the CPMs.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9100692PMC
http://dx.doi.org/10.3389/frai.2022.872720DOI Listing

Publication Analysis

Top Keywords

generalizability cpms
20
machine learning
12
generalizability
10
cpms
10
clinical prediction
8
prediction models
8
cpms clinical
8
clinical machine
8
dataset shift
8
sensitivity factors
8

Similar Publications

Background: The limitations of the traditional TNM system have spurred interest in multivariable models for personalized prognostication in laryngeal and hypopharyngeal cancers (LSCC/HPSCC). However, the performance of these models depends on the quality of data and modelling methodology, affecting their potential for clinical adoption. This systematic review and meta-analysis (SR-MA) evaluated clinical predictive models (CPMs) for recurrence and survival in treated LSCC/HPSCC.

View Article and Find Full Text PDF

Selective oxidation of nitrogenous heterocyclic compounds by heat/peroxymonosulfate in phenol-rich wastewater.

Water Res

February 2025

The Key Laboratory of Water and Sediment Sciences (Ministry of Education), College of Environmental Sciences and Engineering, Peking University, Beijing 100871, China; Yellow River Laboratory of Shanxi Province, Shanxi University, Taiyuan 030006, China. Electronic address:

In phenol-rich wastewater, such as coking wastewater, due to the high reactivity of phenol to various reactive oxygen species, it is difficult to selectively oxidize pollutants having lower biodegradability and higher toxicity than phenol. As one kind of such pollutants in coking wastewater, some nitrogenous heterocyclic compounds (NHCs) are more difficult to be removed by SO or HO• than phenol, but this study found that NHCs (quinoline, isoquinoline, and pyridine) can be selectively removed by peroxymonosulfate (PMS) direct oxidation in the presence of 10 mM phenol under thermal condition. The selective oxidation of NHCs needs a suitable pH range (4 < constant pH < 9) because protonated state of NHCs (pH < 4) is unfavorable to their oxidation and high pH would improve the extra PMS consumption by phenol.

View Article and Find Full Text PDF

Detection limits (DLs), where a variable cannot be measured outside of a certain range, are common in research. DLs may vary across study sites or over time. Most approaches to handling DLs in response variables implicitly make strong parametric assumptions on the distribution of data outside DLs.

View Article and Find Full Text PDF

Before deploying a clinical prediction model (CPM) in clinical practice, its performance needs to be demonstrated in the population of intended use. This is also called "targeted validation." Many CPMs developed in tertiary settings may be most useful in secondary care, where the patient case mix is broad and practitioners need to triage patients efficiently.

View Article and Find Full Text PDF

Real-life implementation and evaluation of the e-referral system SIPILINK.

Int J Med Inform

February 2025

Université Paris Cité, Sorbonne Université, Inserm, Centre de Recherche des Cordeliers, F-75006 Paris, France; Inria, HeKA, PariSanté Campus Paris, France; Department of Medical Informatics, AP-HP, Hôpital Européen Georges-Pompidou, Hôpital Necker F-75015 Paris, France; AP-HP, Hôpital Européen Georges-Pompidou, Hôpital Necker F-75015 Paris, France. Electronic address:

Introduction: General Practitioners (GPs) play a key role of gatekeeper, as they coordinate patients' care. However, most of them reported having difficulty to refer patients to hospital, especially in semi-urgent context. To facilitate the referral of semi-urgent patients, we implemented an e-referral platform, named SIPILINK, within 4 wards from a large public French hospital (internal medicine, diabetology, gynaecological surgery and oncology wards).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!