Background: Diabetes, as a significant disease affecting public health, requires early detection for effective management and intervention. However, imbalanced datasets pose a challenge to accurate diabetes prediction. This imbalance often results in models performing poorly in predicting minority classes, affecting overall diagnostic performance.
Objectives: To address this issue, this study employs a combination of Synthetic Minority Over-sampling Technique (SMOTE) and Random Under-Sampling (RUS) for data balancing and uses Optuna for hyperparameter optimization of machine learning models. This approach aims to fill the gap in current research concerning data balancing and model optimization, thereby improving prediction accuracy and computational efficiency.
Methods: First, the study uses SMOTE and RUS methods to process the imbalanced diabetes dataset, balancing the data distribution. Then, Optuna is utilized to optimize the hyperparameters of the LightGBM model to enhance its performance. During the experiment, the effectiveness of the proposed methods is evaluated by comparing the training results of the dataset before and after balancing.
Results: The experimental results show that the enhanced LightGBM-Optuna model improves the accuracy from 97.07% to 97.11%, and the precision from 97.17% to 98.99%. The time required for a single search is only 2.5 seconds. These results demonstrate the superiority of the proposed method in handling imbalanced datasets and optimizing model performance.
Conclusions: The study indicates that combining SMOTE and RUS data balancing algorithms with Optuna for hyperparameter optimization can effectively enhance machine learning models, especially in dealing with imbalanced datasets for diabetes prediction.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11324958 | PMC |
http://dx.doi.org/10.1038/s41387-024-00324-z | DOI Listing |
Adv Clin Chem
January 2025
Department of Genetics, College of Basic Medical Sciences, Jilin University, Changchun, China. Electronic address:
Visceral adipose tissue, a type of abdominal adipose tissue, is highly involved in lipolysis. Because increased visceral adiposity is strongly associated with the metabolic complications related with obesity, such as type 2 diabetes and cardiovascular disease, there is a need for precise, targeted, personalized and site-specific measures clinically. Existing studies showed that ectopic fat accumulation may be characterized differently among different populations due to complex genetic architecture and non-genetic or epigenetic components, ie, Asians have more and Africans have less visceral fat vs Europeans.
View Article and Find Full Text PDFEndocr Pract
January 2025
Department of Endocrinology and Metabolism, Kyoto Prefectural University of Medicine, Graduate School of Medical Science, 465 Kajii-cho, Kawaramachi-Hirokoji, Kamigyo-ku, Kyoto, 602-8566, Japan.
Objectives: There is a relationship between insulin resistance and metabolic dysfunction-associated steatotic liver disease (MASLD) and the estimated glucose disposal rate (eGDR) has been reported as a surrogate marker of insulin resistance. This study aimed to investigate the association between eGDR and the incident MASLD, and compare the ability to predict incident MASLD with other insulin resistance markers.
Methods: Retrospective cohort data from a health check-up program were analyzed.
Ann Endocrinol (Paris)
January 2025
Department of Endocrinology, Diabetes and Nutrition, Nancy Regional University Hospital, Nancy, France.
Purpose: Pituitary neuroendocrine tumor (PitNET), excluding prolactinoma, often requires endoscopic endonasal surgery (EES). Identifying predictive factors for complications, and particularly rare ones such as hypogonadotropic hypogonadism (HH) that may affect fertility, is challenging. This study investigated de-novo postoperative HH and its potential impact on fertility.
View Article and Find Full Text PDFJ Vasc Access
January 2025
College of Nursing, Xuzhou Medical University, Xuzhou, Jiangsu, China.
Objective: To develop and validate a nomogram model for predicting central venous catheter-related infections (CRI) in patients with maintenance hemodialysis (MHD).
Methods: MHD patients with central venous catheters (CVCs) visiting the outpatient hemodialysis (HD) center of Xuzhou Medical University Affiliated Hospital from January 2020 to December 2023 were retrospectively selected through a HD monitoring system. Patient data were collected, and the patients were divided into training and validation sets in a 7:3 ratio.
Clin Interv Aging
January 2025
Department of Neurology, the Affiliated Huai'an Hospital of Xuzhou Medical University, Huai'an, Jiangsu, People's Republic of China.
Purpose: Research suggests that insulin resistance (IR) is associated with acute ischemic stroke (AIS) and depression. The use of insulin-based IR assessments is complicated. Therefore, we explored the relationship between four non-insulin-based IR indices and post-stroke depression (PSD).
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!