A robust and generalized framework in diabetes classification across heterogeneous environments.

Comput Biol Med

School of Information Technology, Deakin University, Melbourne, Victoria, Australia. Electronic address:

Published: January 2025

Diabetes mellitus (DM) represents a major global health challenge, affecting a diverse range of demographic populations across all age groups. It has particular implications for women during pregnancy and the postpartum period. The contemporary prevalence of sedentary lifestyle patterns and suboptimal dietary practices has substantially contributed to the escalating incidence of this metabolic disorder. The timely identification of diabetes mellitus (DM) in the female population is crucial for preventing related complications and facilitating the implementation of effective therapeutic interventions. However, conventional predictive models frequently demonstrate limited external validity when applied across heterogeneous datasets, potentially compromising clinical utility. This study proposes a robust machine learning (ML) framework for diabetes prediction across diverse populations using two distinct datasets: the PIMA and BD datasets. The framework employs intra-dataset, inter-dataset, and partial fusion dataset validation techniques to comprehensively assess the generalizability and performance of various models. In intra-dataset validation, the Extreme Gradient Boosting (XGBoost) model achieved the highest accuracy on the PIMA dataset with 79%. In contrast, the Random Forest (RF) and Gradient Boosting (GB) models demonstrated accuracy close to 99% on the BD dataset. For inter-dataset validation, where models were trained on one dataset and tested on the other, the ensemble model outperformed others with 88% accuracy when trained on PIMA and tested on BD. However, model performance declined when trained on BD and tested on PIMA (74%), reflecting the challenges of inter-dataset generalization ability. Finally, during partial fusion data validation, the deep learning (DL) model achieved 74% accuracy when trained on the BD dataset augmented with 30% of the PIMA dataset. This accuracy increased to 98% when training on the PIMA dataset combined with 30% of the BD data. These findings emphasize the importance of dataset diversity and the partial fusion dataset that can significantly enhance the model's robustness and generalizability. This framework offers valuable insights into the complexities of diabetes prediction across heterogeneous environments.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compbiomed.2025.109720DOI Listing

Publication Analysis

Top Keywords

partial fusion
12
pima dataset
12
dataset
9
framework diabetes
8
heterogeneous environments
8
diabetes mellitus
8
diabetes prediction
8
fusion dataset
8
gradient boosting
8
model achieved
8

Similar Publications

Background: Acute myeloid leukemia (AML) with RAM immunophenotype is a newly recognized high-risk AML immunophenotypic subcategory characterized by blasts with bright expression of CD56 and weak to absent expression of CD45, HLA-DR, and CD38, as first described by the Children's Oncology Group (COG). The relationship between AML-RAM and other CD56-positive acute leukemias is unclear. The goal of this study is to characterize the clinicopathological characteristics of AML with RAM phenotype and compare them with other CD56 co-expressing acute leukemias.

View Article and Find Full Text PDF

Study Design: Exploratory prospective observational case-control study.

Objectives: Aim of this study was to compare clinical and radiologic outcome, as well as peri-operative complications, of anterior lumbar interbody fusion (ALIF) and full-endoscopic/percutaneous trans-Kambin transforaminal lumbar interbody fusion (pTLIF) with a large-footprint interbody cage.

Methods: Patients that underwent elective ALIF and pTLIF with a large-footprint interbody cage were prospectively evaluated.

View Article and Find Full Text PDF

A robust and generalized framework in diabetes classification across heterogeneous environments.

Comput Biol Med

January 2025

School of Information Technology, Deakin University, Melbourne, Victoria, Australia. Electronic address:

Diabetes mellitus (DM) represents a major global health challenge, affecting a diverse range of demographic populations across all age groups. It has particular implications for women during pregnancy and the postpartum period. The contemporary prevalence of sedentary lifestyle patterns and suboptimal dietary practices has substantially contributed to the escalating incidence of this metabolic disorder.

View Article and Find Full Text PDF

The issue of obstacle avoidance and safety for visually impaired individuals has been a major topic of research. However, complex street environments still pose significant challenges for blind obstacle detection systems. Existing solutions often fail to provide real-time, accurate obstacle avoidance decisions.

View Article and Find Full Text PDF

Rare dual MYH9-ROS1 fusion variants in a patient with lung adenocarcinoma: A case report.

Medicine (Baltimore)

January 2025

Department of Respiratory and Critical Care Medicine, Zhongshan City People's Hospital, Zhongshan, Guangdong Province, China.

Rationale: ROS proto-oncogene 1 (ROS1) fusion is a rare but important driver mutation in non-small cell lung cancer, which usually shows significant sensitivity to small molecule tyrosine kinase inhibitors. With the widespread application of next-generation sequencing (NGS), more fusions and co-mutations of ROS1 have been discovered. Non-muscle myosin heavy chain 9 (MYH9) is a rare fusion partner of ROS1 gene as reported.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!