Introduction: In the evolving landscape of healthcare and medicine, the merging of extensive medical datasets with the powerful capabilities of machine learning (ML) models presents a significant opportunity for transforming diagnostics, treatments, and patient care.

Methods: This research paper delves into the realm of data-driven healthcare, placing a special focus on identifying the most effective ML models for diabetes prediction and uncovering the critical features that aid in this prediction. The prediction performance is analyzed using a variety of ML models, such as Random Forest (RF), XG Boost (XGB), Linear Regression (LR), Gradient Boosting (GB), and Support VectorMachine (SVM), across numerousmedical datasets. The study of feature importance is conducted using methods including Filter-based, Wrapper-based techniques, and Explainable Artificial Intelligence (Explainable AI). By utilizing Explainable AI techniques, specifically Local Interpretable Model-agnostic Explanations (LIME) and SHapley Additive exPlanations (SHAP), the decision-making process of the models is ensured to be transparent, thereby bolstering trust in AI-driven decisions.

Results: Features identified by RF in Wrapper-based techniques and the Chi-square in Filter-based techniques have been shown to enhance prediction performance. A notable precision and recall values, reaching up to 0.9 is achieved in predicting diabetes.

Discussion: Both approaches are found to assign considerable importance to features like age, family history of diabetes, polyuria, polydipsia, and high blood pressure, which are strongly associated with diabetes. In this age of data-driven healthcare, the research presented here aspires to substantially improve healthcare outcomes.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11371799PMC
http://dx.doi.org/10.3389/frai.2024.1421751DOI Listing

Publication Analysis

Top Keywords

diabetes prediction
8
data-driven healthcare
8
prediction performance
8
wrapper-based techniques
8
diabetes
5
prediction
5
analyzing classification
4
classification feature
4
feature selection
4
selection strategies
4

Similar Publications

Objective: We derive and validate D-RISK, an electronic health record (EHR)-driven risk score to optimize and facilitate screening for undiagnosed dysglycemia (prediabetes + diabetes) in clinical practice.

Research Design And Methods: We used retrospective EHR data (derivation sample) and a prospective diabetes screening study (validation sample) to develop D-RISK. Logistic regression with backward selection was used to predict dysglycemia (HbA1c ≥5.

View Article and Find Full Text PDF

Hypoparathyroidism (hypoPTH), sensorineural deafness, and renal dysplasia (HDR) syndrome is a rare autosomal dominant condition with approximately 200 cases published. HDR syndrome is caused by variants of GATA binding protein 3 gene (), which encodes a transcription factor, with multiple types of variants reported. We present the case of a 76-year-old woman who was diagnosed with hypoPTH when she was aged 40 years and transferred care to our institution.

View Article and Find Full Text PDF

Objective: To investigate the association between the basic and clinical characteristics of patients with type 2 diabetes mellitus (T2DM) and their susceptibility to Klebsiella pneumoniae colonization (KPC). Additionally, a clinical prediction model was developed to identify high-risk patients for KPC.

Methods: Data from 486 T2DM patients who visited Shanghai Fifth People's Hospital from December 2020 to December 2022 were retrospectively collected.

View Article and Find Full Text PDF

In this study, we integrated transcriptomic and metabolomic analyses to achieve a comprehensive understanding of the underlying mechanisms of diabetic cardiomyopathy (DCM) in a diabetic rat model. Functional and molecular characterizations revealed significant cardiac injury, dysfunction, and ventricular remodeling in DCM. A thorough analysis of global changes in genes and metabolites showed that amino acid metabolism, especially the breakdown of branched-chain amino acids (BCAAs) such as valine, leucine, and isoleucine, is highly dysregulated.

View Article and Find Full Text PDF

Using Social Media for More Engaged Users and Enhanced Health Communication in Diabetes Care.

Am J Lifestyle Med

January 2025

College of Agricultural Sciences and Natural Resources, Texas Tech University System, Lubbock, TX, USA.

Background: Recent literature identified social media message features predictive of user engagement. Desired information from a patient perspective and use of social media information from a provider perspective in diabetes care is less clear.

Purpose: Our study analyzed diabetes patients' desired information from social media and how such information could be used in conjunction with doctor-patient communication to enhance compliance with recommended care.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!