Risk prediction model of metabolic syndrome in perimenopausal women based on machine learning.

Int J Med Inform

Department of Obstetrics and Gynecology, Peking University Ninth School of Clinical Medicine, Beijing Shijitan Hospital, Beijing 100038, China. Electronic address:

Published: August 2024

AI Article Synopsis

  • Metabolic syndrome (MetS) is a key concern for cardio-metabolic health, especially in postmenopausal women, making the perimenopausal period crucial for prevention efforts.
  • Researchers developed a risk prediction model using four machine learning methods—XGBoost, Random Forest, Multilayer Perceptron, and Logistic Regression—to forecast the likelihood of developing MetS among women aged 45-55 based on health examination data.
  • Results indicated that Random Forest and XGBoost performed the best in predicting MetS, with key risk factors identified including waist circumference, fasting blood glucose, and cholesterol levels, emphasizing the potential for early intervention in high-risk individuals.

Article Abstract

Introduction: Metabolic syndrome (MetS) is considered to be an important parameter of cardio-metabolic health and contributing to the development of atherosclerosis, type 2 diabetes. The incidence of MetS significantly increases in postmenopausal women, therefore, the perimenopausal period is considered a critical phase for prevention. We aimed to use four machine learning methods to predict whether perimenopausal women will develop MetS within 2 years.

Methods: Women aged 45-55 years who underwent 2 consecutive years of physical examinations in Ninth Clinical College of Peking University between January 2021 and December 2022 were included. We extracted 26 features from physical examinations, and used backward selection method to select top 10 features with the largest area under the receiver operating characteristic curve (AUC). Extreme gradient boosting (XGBoost), Random forest (RF), Multilayer perceptron (MLP) and Logistic regression (LR) were used to establish the model. Those performance were measured by AUC, accuracy, precision, recall and F1 score. SHapley Additive exPlanation (SHAP) value was used to identify risk factors affecting perimenopausal MetS.

Results: A total of 8700 women had physical examination records, and 2,254 women finally met the inclusion criteria. For predicting MetS events, RF and XGBoost had the highest AUC (0.96, 0.95, respectively). XGBoost has the highest F1 value (F1 = 0.77), followed by RF, LR and MLP. SHAP value suggested that the top 5 variables affecting MetS in this study were Waist circumference, Fasting blood glucose, High-density lipoprotein cholesterol, Triglycerides and Diastolic blood pressure, respectively.

Conclusion: We've developed a targeted MetS risk prediction model for perimenopausal women, using health examination data. This model enables early identification of high MetS risk in this group, offering significant benefits for individual health management and wider socio-economic health initiatives.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.ijmedinf.2024.105480DOI Listing

Publication Analysis

Top Keywords

perimenopausal women
12
risk prediction
8
prediction model
8
metabolic syndrome
8
machine learning
8
physical examinations
8
xgboost highest
8
mets risk
8
women
7
mets
7

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!