AI Article Synopsis

  • Diabetes is a metabolic disorder that progresses through three stages: health, pre-diabetes, and type 2 diabetes, with early diagnosis crucial for prevention and control.
  • A study using data from the Beijing Physical Examination Center analyzed over 1.8 million samples, categorizing them into normal fasting plasma glucose (NFG), impaired fasting glucose (IFG), and type 2 diabetes (T2DM) based on WHO standards.
  • Machine learning models, particularly eXtreme Gradient Boosting, were used to effectively distinguish between these categories, highlighting key risk factors associated with diabetes.

Article Abstract

Diabetes is a metabolic disorder caused by insufficient insulin secretion and insulin secretion disorders. From health to diabetes, there are generally three stages: health, pre-diabetes and type 2 diabetes. Early diagnosis of diabetes is the most effective way to prevent and control diabetes and its complications. In this work, we collected the physical examination data from Beijing Physical Examination Center from January 2006 to December 2017, and divided the population into three groups according to the WHO (1999) Diabetes Diagnostic Standards: normal fasting plasma glucose (NFG) (FPG < 6.1 mmol/L), mildly impaired fasting plasma glucose (IFG) (6.1 mmol/L ≤ FPG < 7.0 mmol/L) and type 2 diabetes (T2DM) (FPG > 7.0 mmol/L). Finally, we obtained1,221,598 NFG samples, 285,965 IFG samples and 387,076 T2DM samples, with a total of 15 physical examination indexes. Furthermore, taking eXtreme Gradient Boosting (XGBoost), random forest (RF), Logistic Regression (LR), and Fully connected neural network (FCN) as classifiers, four models were constructed to distinguish NFG, IFG and T2DM. The comparison results show that XGBoost has the best performance, with AUC (macro) of 0.7874 and AUC (micro) of 0.8633. In addition, based on the XGBoost classifier, three binary classification models were also established to discriminate NFG from IFG, NFG from T2DM, IFG from T2DM. On the independent dataset, the AUCs were 0.7808, 0.8687, 0.7067, respectively. Finally, we analyzed the importance of the features and identified the risk factors associated with diabetes.

Download full-text PDF

Source
http://dx.doi.org/10.3934/mbe.2022166DOI Listing

Publication Analysis

Top Keywords

physical examination
16
fpg mmol/l
12
diabetes
9
examination data
8
insulin secretion
8
type diabetes
8
fasting plasma
8
plasma glucose
8
nfg ifg
8
ifg t2dm
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!