Predicting dental caries outcomes in young adults using machine learning approach.

BMC Oral Health

Department of Preventive and Community Dentistry, The University of Iowa College of Dentistry, 801 Newton Rd, Iowa City, IA, 52242, US.

Published: May 2024

Objectives: To predict the dental caries outcomes in young adults from a set of longitudinally-obtained predictor variables and identify the most important predictors using machine learning techniques.

Methods: This study was conducted using the Iowa Fluoride Study dataset. The predictor variables - sex, mother's education, family income, composite socio-economic status (SES), caries experience at ages 9, 13, and 17, and the cumulative estimates of risk and protective factors, including fluoride, dietary, and behavioral variables from ages 5-9, 9-13, 13-17, and 17-23 were used to predict the age 23 DMFS count. The following machine learning models (LASSO regression, generalized boosting machines (GBM), negative binomial (NegGLM), and extreme gradient boosting models (XGBOOST)) were compared under 5-fold cross validation with nested resampling techniques.

Results: The prevalence of cavitated level caries experience at age 23 (mean DMFS count) was 4.75. The predictive analysis found LASSO to be the best performing model (compared to GBM, NegGLM, and XGBOOST), with a root mean square error (RMSE) of 0.70, and coefficient of determination (R) of 0.44. After dichotomization of the predicted and observed values of the LASSO regression, the classification results showed accuracy, precision, recall, and ROC AUC of 83.7%, 85.9%, 93.1%, 68.2%, respectively. Previous caries experience at age 13 and age 17 and sugar-sweetened beverages intakes at age 13 and age 17 were found to be the four most important predictors of cavitated caries count at age 23.

Conclusion: Our machine learning model showed high accuracy and precision in the prediction of caries in young adults from a longitudinally-obtained predictor variables. Our model could, in the future, after further development and validation with other diverse population data, be used by public health specialists and policy-makers as a screening tool to identify the risk of caries in young adults and apply more targeted interventions. However, data from a more diverse population are needed to improve the quality and generalizability of caries prediction.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11069237PMC
http://dx.doi.org/10.1186/s12903-024-04294-7DOI Listing

Publication Analysis

Top Keywords

young adults
16
machine learning
16
predictor variables
12
caries experience
12
caries
9
dental caries
8
caries outcomes
8
outcomes young
8
longitudinally-obtained predictor
8
age dmfs
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!