Diabetes is a growing health concern in developing countries, causing considerable mortality rates. While machine learning (ML) approaches have been widely used to improve early detection and treatment, several studies have shown low classification accuracies due to overfitting, underfitting, and data noise. This research employs parallel and sequential ensemble ML approaches paired with feature selection techniques to boost classification accuracy. The Pima India Diabetes Data from the UCI ML Repository served as the dataset. Data preprocessing included cleaning the dataset by replacing missing values with column means and selecting highly correlated features using forward and backward selection methods. The dataset was split into two parts: training (70%), and testing (30%). Python was used for classification in Jupyter Notebook, and there were two design phases. The first phase utilized J48, Classification and Regression Tree (CART), and Decision Stump (DS) to create a random forest model. The second phase employed the same algorithms alongside sequential ensemble methods-XG Boost, AdaBoostM1, and Gradient Boosting-using an average voting algorithm for binary classification. Evaluation revealed that XG Boost, AdaBoostM1, and Gradient Boosting achieved classification accuracies of 100%, with performance metrics including F1 score, MCC, Precision, Recall, AUC-ROC, and AUC-PR all equal to 1.00, indicating reliable predictions of diabetes presence. Researchers and practitioners can leverage the predictive model developed in this work to make quick predictions of diabetes mellitus, which could save many lives.

Download full-text PDF

Source
http://dx.doi.org/10.1038/s41598-025-87767-1DOI Listing

Publication Analysis

Top Keywords

diabetes mellitus
8
classification accuracies
8
sequential ensemble
8
boost adaboostm1
8
adaboostm1 gradient
8
predictions diabetes
8
classification
6
diabetes
5
efficient diagnosis
4
diagnosis diabetes
4

Similar Publications

Clinical Relevance: Although laser refractive surgeries and multifocal intraocular lens implantation are generally avoided in patients with diabetic retinopathy, a substantial proportion of well-glycaemic-controlled type 2 diabetes mellitus patients are considered for these procedures. Pupil dynamics play a significant role in determining postoperative satisfaction in these patients.

Background: To evaluate pupillary dynamics in patients with and without diabetes following uneventful phacoemulsification surgery.

View Article and Find Full Text PDF

Background: Gestational diabetes mellitus is hyperglycemia in special populations (pregnant women), however gestational diabetes mellitus (GDM) not only affects maternal health, but also has profound effects on offspring health. The prevalence of gestational diabetes in my country is gradually increasing.

Objective: To study the application effect of self-transcendence nursing model in GDM patients.

View Article and Find Full Text PDF

Background: Atrial fibrillation (AF) is the most prevalent arrhythmia encountered in clinical practice. Triglyceride glucose index (Tyg), a convenient evaluation variable for insulin resistance, has shown associations with adverse cardiovascular outcomes. However, studies on the Tyg index's predictive value for adverse prognosis in patients with AF without diabetes are lacking.

View Article and Find Full Text PDF

Objective: Understanding healthcare-seeking propensity is crucial for optimizing healthcare utilization, especially for patients with chronic conditions like hypertension or diabetes, given their substantial burden on healthcare systems globally. This study aims to evaluate hypertensive or diabetic patients' healthcare-seeking propensity based on the severity of symptoms, categorizing symptoms as either major or minor. It also explores factors influencing healthcare-seeking propensity and examines whether healthcare-seeking propensity affects healthcare utilization and preventable hospitalizations.

View Article and Find Full Text PDF

Background: The Weight-adjusted-waist index (WWI) has emerged as a predictive factor for a range of metabolic disorders. To date, the predictive value of the WWI in relation to sarcopenia in individuals with diabetics has not been extensively explored. This study aims to investigate the impact of the WWI on the prevalence of sarcopenia among patients with type 2 diabetes mellitus (T2DM).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!