Objectives: Osteoporosis, prevalent among the elderly population, is primarily diagnosed through bone mineral density (BMD) testing, which has limitations in early detection. This study aims to develop and validate a machine learning approach for osteoporosis identification by integrating demographic data, laboratory and questionnaire data, offering a more practical and effective screening alternative.
Methods: In this study, data from the National Health and Nutrition Examination Survey were analyzed to explore factors linked to osteoporosis. After cleaning, 8766 participants with 223 variables were studied. Minimum Redundancy Maximum Relevance and SelectKBest were employed to select the import features. Four Machine learning algorithms (RF, NN, LightGBM and XGBoost.) were applied to examine osteoporosis, with performance comparisons made. Data balancing was done using SMOTE, and metrics like F1 score, and AUC were evaluated for each algorithm.
Results: The LightGBM model outperformed others with an F1 score of 0.914, an MCC of 0.831, and an AUC of 0.970 on the training set. On the test set, it achieved an F1 score of 0.912, an MCC of 0.826, and an AUC of 0.972. Top predictors for osteoporosis were height, age, and sex.
Conclusions: This study demonstrates the potential of machine learning models in assessing an individual's risk of developing osteoporosis, a condition that significantly impacts quality of life and imposes substantial healthcare costs. The superior performance of the LightGBM model suggests a promising tool for early detection and personalized prevention strategies. Importantly, identifying height, age, and sex as top predictors offers critical insights into the demographic and physiological factors that clinicians should consider when evaluating patients' risk profiles.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1186/s13104-025-07089-3 | DOI Listing |
JMIR Med Inform
March 2025
LynxCare Inc, Leuven, Belgium.
Background: Processing data from electronic health records (EHRs) to build research-grade databases is a lengthy and expensive process. Modern arthroplasty practice commonly uses multiple sites of care, including clinics and ambulatory care centers. However, most private data systems prevent obtaining usable insights for clinical practice.
View Article and Find Full Text PDFJMIR Res Protoc
March 2025
Institute for Data Science and Informatics, University of Missouri, Columbia, MO, United States.
Background: Amyotrophic lateral sclerosis (ALS) leads to rapid physiological and functional decline before causing untimely death. Current best-practice approaches to interdisciplinary care are unable to provide adequate monitoring of patients' health. Passive in-home sensor systems enable 24×7 health monitoring.
View Article and Find Full Text PDFJ Med Chem
March 2025
State Key Laboratory of Discovery and Utilization of Functional Components in Traditional Chinese Medicine; Shanghai Frontiers Science Center of TCM Chemical Biology; Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Shanghai 201203, China.
The anticancer agent irinotecan often induces severe delayed-onset diarrhea, inhibiting human carboxylesterase 2A (hCES2A) can significantly alleviate irinotecan-triggered gut toxicity (ITGT). This work presents an efficient workflow for design and developing novel efficacious hCES2A inhibitors. A well-training machine learning model identified as a lead compound, while compound was developed as a novel time-dependent hCES2A inhibitor (IC = 0.
View Article and Find Full Text PDFMol Inform
March 2025
Faculty of Information Technology, HUTECH University, Ho Chi Minh City, Vietnam.
Within a recent decade, graph neural network (GNN) has emerged as a powerful neural architecture for various graph-structured data modelling and task-driven representation learning problems. Recent studies have highlighted the remarkable capabilities of GNNs in handling complex graph representation learning tasks, achieving state-of-the-art results in node/graph classification, regression, and generation. However, most traditional GNN-based architectures like GCN and GraphSAGE still faced several challenges related to the capability of preserving the multi-scaled topological structures.
View Article and Find Full Text PDFJ AOAC Int
March 2025
Department of Chemistry, Carleton University, 1125 Colonel By Drive, Ottawa, Ontario, K1S 5B6 Canada.
Background: Plant-based milk alternatives (PBMA) are increasingly popular due to rising lactose intolerance and environmental concerns over traditional dairy products. However, limited efforts have been made to develop rapid authentication methods to verify their biological origin.
Objective: In this study, we developed a rapid, on-site analytical method for the authentication and identification of PBMA made by six different plant species utilizing a portable Raman spectrometer coupled with machine learning.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!