Background: Osteoporosis is becoming more common worldwide, imposing a substantial burden on individuals and society. The onset of osteoporosis is subtle, early detection is challenging, and population-wide screening is infeasible. Thus, there is a need to develop a method to identify those at high risk for osteoporosis.

Objective: This study aimed to develop a machine learning algorithm to effectively identify people with low bone density, using readily available demographic and blood biochemical data.

Methods: Using NHANES 2017-2020 data, participants over 50 years old with complete femoral neck BMD data were selected. This cohort was randomly divided into training (70%) and test (30%) sets. Lasso regression selected variables for inclusion in six machine learning models built on the training data: logistic regression (LR), support vector machine (SVM), gradient boosting machine (GBM), naive Bayes (NB), artificial neural network (ANN) and random forest (RF). NHANES data from the 2013-2014 cycle was used as an external validation set input into the models to verify their generalizability. Model discrimination was assessed via AUC, accuracy, sensitivity, specificity, precision and F1 score. Calibration curves evaluated goodness-of-fit. Decision curves determined clinical utility. The SHAP framework analyzed variable importance.

Results: A total of 3,545 participants were included in the internal validation set of this study, of whom 1870 had normal bone density and 1,675 had low bone density Lasso regression selected 19 variables. In the test set, AUC was 0.785 (LR), 0.780 (SVM), 0.775 (GBM), 0.729 (NB), 0.771 (ANN), and 0.768 (RF). The LR model has the best discrimination and a better calibration curve fit, the best clinical net benefit for the decision curve, and it also reflects good predictive power in the external validation dataset The top variables in the LR model were: age, BMI, gender, creatine phosphokinase, total cholesterol and alkaline phosphatase.

Conclusion: The machine learning model demonstrated effective classification of low BMD using blood biomarkers. This could aid clinical decision making for osteoporosis prevention and management.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11080984PMC
http://dx.doi.org/10.3389/fpubh.2024.1347219DOI Listing

Publication Analysis

Top Keywords

machine learning
16
bone density
16
low bone
12
identify people
8
people low
8
lasso regression
8
regression selected
8
selected variables
8
external validation
8
validation set
8

Similar Publications

Objective: This study evaluated ResNet-50 and U-Net models for detecting and segmenting vertical misfit in dental implant crowns using periapical radiographic images.

Methods: Periapical radiographs of dental implant crowns were classified by two experts based on the presence of vertical misfit (reference group). The misfit area was manually annotated in images exhibiting vertical misfit.

View Article and Find Full Text PDF

Background: Retail involves directly delivering goods and services to end consumers. Natural disasters and epidemics/pandemics have significant potential to disrupt supply chains, leading to shortages, forecasting errors, price increases, and substantial financial strains on retailers. The COVID-19 pandemic highlighted the need for retail sectors to prepare for crisis impacts on sales forecasts by regularly assessing and adjusting sales volumes, consumer behavior, and forecasting models to adapt to changing conditions.

View Article and Find Full Text PDF

Terrestrial vegetation is a key component of the Earth system, regulating the exchange of carbon, water, and energy between land and atmosphere. Vegetation affects soil moisture dynamics by absorbing and transpiring soil water, thus modulating land-atmosphere interactions. Moreover, changes in vegetation structure (e.

View Article and Find Full Text PDF

Non-small cell lung cancer (NSCLC) frequently metastasizes to the brain, significantly worsened prognoses. This study aimed to develop an interpretable model for predicting survival in NSCLC patients with brain metastases (BM) integrating radiomic features and RNA sequencing data. 292 samples are collected and analyzed utilizing T1/T2 MRIs.

View Article and Find Full Text PDF

In the field of agriculture, particularly within the context of machine learning applications, quality datasets are essential for advancing research and development. To address the challenges of identifying different mango leaf types and recognizing the diverse and unique characteristics of mango varieties in Bangladesh, a comprehensive and publicly accessible dataset titled "BDMANGO" has been created. This dataset includes images essential for research, featuring six mango varieties: Amrapali, Banana, Chaunsa, Fazli, Haribhanga, and Himsagar, which were collected from different locations.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!