Model selection and performance assessment for prediction models are important tasks in machine learning, e.g. for the development of medical diagnosis or prognosis rules based on complex data. A common approach is to select the best model via cross-validation and to evaluate this final model on an independent dataset. In this work, we propose to instead evaluate several models simultaneously. These may result from varied hyperparameters or completely different learning algorithms. Our main goal is to increase the probability to correctly identify a model that performs sufficiently well. In this case, adjusting for multiplicity is necessary in the evaluation stage to avoid an inflation of the family wise error rate. We apply the so-called maxT-approach which is based on the joint distribution of test statistics and suitable to (approximately) control the family-wise error rate for a wide variety of performance measures. We conclude that evaluating only a single final model is suboptimal. Instead, several promising models should be evaluated simultaneously, e.g. all models within one standard error of the best validation model. This strategy has proven to increase the probability to correctly identify a good model as well as the final model performance in extensive simulation studies.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7270727 | PMC |
http://dx.doi.org/10.1177/0962280219854487 | DOI Listing |
Braz J Biol
January 2025
Escuela Superior Politécnica de Chimborazo - ESPOCH, El Coca, Ecuador.
The breeding and exploitation of chickens at the backyard or commercial family level is an activity of great economic relevance for families in Ecuador. In addition to providing protein of high biological value for food security, it revalues local food resources that could provide productive benefits. With this objective, a study has been conducted in order to explore the effect of C.
View Article and Find Full Text PDFCien Saude Colet
January 2025
Universidade Federal do Espírito Santo. Vitória ES Brasil.
The scope of this article is to analyze the correlation between alcohol consumption and abdominal obesity in participants of the ELSA-Brasil cohort after a follow-up period of nine years. A longitudinal analysis was performed with baseline and follow-up data from ELSA-Brasil. At baseline, 15,105 civil servants were enrolled.
View Article and Find Full Text PDFPLoS One
January 2025
Department of Pharmacotherapy, University of Utah, Salt Lake City, Utah, United States of America.
Background: Venous thromboembolism (VTE) and atrial fibrillation (AF) disproportionately affect older adults, who are at increased risk of bleeding from treatment with anticoagulant therapy. The impact of bleeding on older adults' quality of life (QoL) is poorly understood due to the lack of a validated measure of their experience. This study's purpose is to describe the first evidence-based steps in developing a new condition-specific patient-reported outcome measure (PROM) for the effect of anticoagulant-related bleeding on older adults' QoL.
View Article and Find Full Text PDFMol Pharm
January 2025
Regional Center of Advanced Technologies and Materials, Czech Advanced Technology and Research Institute (CATRIN), Palacký University Olomouc, Šlechtitelů 27, 779 00 Olomouc, Czech Republic.
Lipid-mediated delivery of active pharmaceutical ingredients (API) opened new possibilities in advanced therapies. By encapsulating an API into a lipid nanocarrier (LNC), one can safely deliver APIs not soluble in water, those with otherwise strong adverse effects, or very fragile ones such as nucleic acids. However, for the rational design of LNCs, a detailed understanding of the composition-structure-function relationships is missing.
View Article and Find Full Text PDFMed Phys
January 2025
Deparment of Radiation Oncology, Duke University, Durham, North Carolina, USA.
Background: Stereotactic radiosurgery (SRS) is widely used for managing brain metastases (BMs), but an adverse effect, radionecrosis, complicates post-SRS management. Differentiating radionecrosis from tumor recurrence non-invasively remains a major clinical challenge, as conventional imaging techniques often necessitate surgical biopsy for accurate diagnosis. Machine learning and deep learning models have shown potential in distinguishing radionecrosis from tumor recurrence.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!