Model selection and performance assessment for prediction models are important tasks in machine learning, e.g. for the development of medical diagnosis or prognosis rules based on complex data. A common approach is to select the best model via cross-validation and to evaluate this final model on an independent dataset. In this work, we propose to instead evaluate several models simultaneously. These may result from varied hyperparameters or completely different learning algorithms. Our main goal is to increase the probability to correctly identify a model that performs sufficiently well. In this case, adjusting for multiplicity is necessary in the evaluation stage to avoid an inflation of the family wise error rate. We apply the so-called maxT-approach which is based on the joint distribution of test statistics and suitable to (approximately) control the family-wise error rate for a wide variety of performance measures. We conclude that evaluating only a single final model is suboptimal. Instead, several promising models should be evaluated simultaneously, e.g. all models within one standard error of the best validation model. This strategy has proven to increase the probability to correctly identify a good model as well as the final model performance in extensive simulation studies.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7270727PMC
http://dx.doi.org/10.1177/0962280219854487DOI Listing

Publication Analysis

Top Keywords

final model
12
model
9
prediction models
8
model selection
8
selection performance
8
performance assessment
8
increase probability
8
probability correctly
8
correctly identify
8
error rate
8

Similar Publications

The breeding and exploitation of chickens at the backyard or commercial family level is an activity of great economic relevance for families in Ecuador. In addition to providing protein of high biological value for food security, it revalues local food resources that could provide productive benefits. With this objective, a study has been conducted in order to explore the effect of C.

View Article and Find Full Text PDF

The scope of this article is to analyze the correlation between alcohol consumption and abdominal obesity in participants of the ELSA-Brasil cohort after a follow-up period of nine years. A longitudinal analysis was performed with baseline and follow-up data from ELSA-Brasil. At baseline, 15,105 civil servants were enrolled.

View Article and Find Full Text PDF

Background: Venous thromboembolism (VTE) and atrial fibrillation (AF) disproportionately affect older adults, who are at increased risk of bleeding from treatment with anticoagulant therapy. The impact of bleeding on older adults' quality of life (QoL) is poorly understood due to the lack of a validated measure of their experience. This study's purpose is to describe the first evidence-based steps in developing a new condition-specific patient-reported outcome measure (PROM) for the effect of anticoagulant-related bleeding on older adults' QoL.

View Article and Find Full Text PDF

Computational Methods for Modeling Lipid-Mediated Active Pharmaceutical Ingredient Delivery.

Mol Pharm

January 2025

Regional Center of Advanced Technologies and Materials, Czech Advanced Technology and Research Institute (CATRIN), Palacký University Olomouc, Šlechtitelů 27, 779 00 Olomouc, Czech Republic.

Lipid-mediated delivery of active pharmaceutical ingredients (API) opened new possibilities in advanced therapies. By encapsulating an API into a lipid nanocarrier (LNC), one can safely deliver APIs not soluble in water, those with otherwise strong adverse effects, or very fragile ones such as nucleic acids. However, for the rational design of LNCs, a detailed understanding of the composition-structure-function relationships is missing.

View Article and Find Full Text PDF

Background: Stereotactic radiosurgery (SRS) is widely used for managing brain metastases (BMs), but an adverse effect, radionecrosis, complicates post-SRS management. Differentiating radionecrosis from tumor recurrence non-invasively remains a major clinical challenge, as conventional imaging techniques often necessitate surgical biopsy for accurate diagnosis. Machine learning and deep learning models have shown potential in distinguishing radionecrosis from tumor recurrence.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!