Comparisons of the prediction models for undiagnosed diabetes between machine learning versus traditional statistical methods.

Seong Gyu Choi Minsuk Oh Dong-Hyuk Park Byeongchan Lee Yong-Ho Lee Sun Ha Jee Justin Y Jeon

Sci Rep

Department of Sports Industry Studies, Yonsei University, Seoul, Republic of Korea.

Published: August 2023

We compared the prediction performance of machine learning-based undiagnosed diabetes prediction models with that of traditional statistics-based prediction models. We used the 2014-2020 Korean National Health and Nutrition Examination Survey (KNHANES) (N = 32,827). The KNHANES 2014-2018 data were used as training and internal validation sets and the 2019-2020 data as external validation sets. The receiver operating characteristic curve area under the curve (AUC) was used to compare the prediction performance of the machine learning-based and the traditional statistics-based prediction models. Using sex, age, resting heart rate, and waist circumference as features, the machine learning-based model showed a higher AUC (0.788 vs. 0.740) than that of the traditional statistical-based prediction model. Using sex, age, waist circumference, family history of diabetes, hypertension, alcohol consumption, and smoking status as features, the machine learning-based prediction model showed a higher AUC (0.802 vs. 0.759) than the traditional statistical-based prediction model. The machine learning-based prediction model using features for maximum prediction performance showed a higher AUC (0.819 vs. 0.765) than the traditional statistical-based prediction model. Machine learning-based prediction models using anthropometric and lifestyle measurements may outperform the traditional statistics-based prediction models in predicting undiagnosed diabetes.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10421881	PMC
http://dx.doi.org/10.1038/s41598-023-40170-0	DOI Listing

Publication Analysis

Top Keywords

prediction models

machine learning-based

prediction model

prediction

undiagnosed diabetes

prediction performance

traditional statistics-based

statistics-based prediction

higher auc

traditional statistical-based

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!