A tutorial on variable selection for clinical prediction models: feature selection methods in data mining could improve the results.

J Clin Epidemiol

Prevention of Metabolic Disorders Research Center, Research Institute for Endocrine Sciences, Shahid Beheshti University of Medical Sciences, Velenjak, 1985717413 Tehran, Iran; Department of Biostatistics and Epidemiology, Research Institute for Endocrine Sciences, Shahid Beheshti University of Medical Sciences, Velenjak, 1985717413 Tehran, Iran. Electronic address:

Published: March 2016

Objectives: Identifying an appropriate set of predictors for the outcome of interest is a major challenge in clinical prediction research. The aim of this study was to show the application of some variable selection methods, usually used in data mining, for an epidemiological study. We introduce here a systematic approach.

Study Design And Setting: The P-value-based method, usually used in epidemiological studies, and several filter and wrapper methods were implemented to select the predictors of diabetes among 55 variables in 803 prediabetic females, aged ≥ 20 years, followed for 10-12 years. To develop a logistic model, variables were selected from a train data set and evaluated on the test data set. The measures of Akaike information criterion (AIC) and area under the curve (AUC) were used as performance criteria. We also implemented a full model with all 55 variables.

Results: We found that the worst and the best models were the full model and models based on the wrappers, respectively. Among filter methods, symmetrical uncertainty gave both the best AUC and AIC.

Conclusion: Our experiment showed that the variable selection methods used in data mining could improve the performance of clinical prediction models. An R program was developed to make these methods more feasible and visualize the results.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jclinepi.2015.10.002DOI Listing

Publication Analysis

Top Keywords

variable selection
12
clinical prediction
12
selection methods
12
methods data
12
data mining
12
prediction models
8
mining improve
8
data set
8
full model
8
methods
6

Similar Publications

[How to report the end-of-life decisions in the clinical record? Proposal of an "ABCD".].

Recenti Prog Med

January 2025

Uoc Anestesia e rianimazione, AO San Camillo-Forlanini, Roma.

Coping with the end of life decision making process in ICU, its complexity adds a challenge for the healthcare team: how to report in the medical record the events and reasoning that led to withholding or withdrawing treatments shifting from intensive to palliative care. Each healthcare team must select the best approach for managing the decision-making process and the necessary rules to ensure a correct clinical history narrative, indicating who must write and what has to be written. Taking into account the team organization, the report may be written not necessarily by the ICU director, but also by a staff physician as a spokesperson in the individual case.

View Article and Find Full Text PDF

Background: Correct information is an essential tool to guide thoughts, attitudes, daily choices or more important decisions such as those regarding health. Today, a huge amount of information sources and media is available. Increasing possibilities of obtaining data also require understanding and positioning skills, particularly the ability to navigate the ocean of information and to choose what is best without becoming overwhelmed.

View Article and Find Full Text PDF

With the increasing maturity of genetic profiling, an essential and routine task in cancer research is to model disease outcomes/phenotypes using genetic variables. Many methods have been successfully developed. However, oftentimes, empirical performance is unsatisfactory because of a "lack of information.

View Article and Find Full Text PDF

Oral health-related quality of life status and risk factors in patients with mental disorders.

Hua Xi Kou Qiang Yi Xue Za Zhi

February 2025

State Key Laboratory of Oral Diseases & National Center for Stomatology & National Clinical Research Center for Oral Diseases & Dept. of Orthognathic and Temporomandibular Joint Surgery, West China Hospital of Stomatology, Sichuan University, Chengdu 610041, China.

Objectives: This study aims to explore the current status and risk factors of oral health-related quality of life OHRQoL in patients with mental disorders and provide evidence for effective intervention measures.

Methods: A total of 397 patients diagnosed with mental illness were selected by convenience sampling, and investigation was carried out using general data questionnaire, health literacy in dentistry-14 (HeLD-14), oral health impact profile-14 (OHIP-14), and oral health status checklist.

Results: The total score of OHIP-14 in patients with mental disorders was 8(2, 14).

View Article and Find Full Text PDF

Introduction In the realm of Carotid Artery Stenting (CAS), various access methods such as Transfemoral access (TFA), Transradial Artery access (TRA), and Transbrachial access (TBA) have been employed. While TFA is widely established, TRA and TBA offer alternative options. TBA lacks comprehensive studies, and there is a notable lack of comprehensive evidence systematically evaluating its outcomes.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!