Background: Dyslipidemia has emerged as a significant clinical risk, with its associated complications, including atherosclerosis and ischemic cerebrovascular disease, presenting a grave threat to human well-being. Hence, it holds paramount importance to precisely predict the onset of dyslipidemia. This study aims to use ensemble technology to establish a machine learning model for the prediction of dyslipidemia.

Methods: This study included three consecutive years of physical examination data of 2,479 participants, and used the physical examination data of the first two years to predict whether the participants would develop dyslipidemia in the third year. Feature selection was conducted through statistical methods and the analysis of mutual information between features. Five machine learning models, including support vector machine (SVM), logistic regression (LR), random forest (RF), K nearest neighbor (KNN) and extreme gradient boosting (XGBoost), were utilized as base learners to construct the ensemble model. Area under the receiver operating characteristic curve (AUC), calibration curves, and decision curve analysis (DCA) were used to evaluate the model.

Results: Experimental results show that the ensemble model achieves superior performance across several metrics, achieving an AUC of 0.88 ± 0.01 ( < 0.001), surpassing the base learners by margins of 0.04 to 0.20. Calibration curves and DCA exhibited good predictive performance as well. Furthermore, this study explores the minimal necessary feature set for accurate prediction, finding that just the top 12 features were required for dependable outcomes. Among them, HbA1c and CEA are key indicators for model construction.

Conclusions: Our results suggest that the proposed ensemble model has good predictive performance and has the potential to become an effective tool for personal health management.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11540663PMC
http://dx.doi.org/10.3389/fphys.2024.1464744DOI Listing

Publication Analysis

Top Keywords

ensemble model
16
physical examination
12
examination data
12
machine learning
8
base learners
8
calibration curves
8
good predictive
8
predictive performance
8
ensemble
5
model
5

Similar Publications

The Future of HIV: Challenges in meeting the 2030 Ending the HIV Epidemic in the U.S. (EHE) reduction goal.

AIDS

January 2025

Center for Biomedical Modeling, Semel Institute for Neuroscience and Human Behavior, David Geffen School of Medicine, University of California, Los Angeles, CA.

Objectives: To predict the burden of HIV in the United States (US) nationally and by region, transmission type, and race/ethnicity through 2030.

Methods: Using publicly available data from the CDC NCHHSTP AtlasPlus dashboard, we generated 11-year prospective forecasts of incident HIV diagnoses nationally and by region (South, non-South), race/ethnicity (White, Hispanic/Latino, Black/African American), and transmission type (Injection-Drug Use, Male-to-Male Sexual Contact (MMSC), and Heterosexual Contact (HSC)). We employed weighted (W) and unweighted (UW) n-sub-epidemic ensemble models, calibrated using 12 years of historical data (2008-2019), and forecasted trends for 2020-2030.

View Article and Find Full Text PDF

Background: The prompt and accurate identification of mild cognitive impairment (MCI) is crucial for preventing its progression into more severe neurodegenerative diseases. However, current diagnostic solutions, such as biomarkers and cognitive screening tests, prove costly, time-consuming, and invasive, hindering patient compliance and the accessibility of these tests. Therefore, exploring a more cost-effective, efficient, and noninvasive method to aid clinicians in detecting MCI is necessary.

View Article and Find Full Text PDF

This study aimed to develop and evaluate an efficient method to automatically segment T1- and T2-weighted brain magnetic resonance imaging (MRI) images. We specifically compared the segmentation performance of individual convolutional neural network (CNN) models against an ensemble approach to advance the accuracy of MRI-guided radiotherapy (RT) planning..

View Article and Find Full Text PDF

Efficiently obtaining atomic-scale thermodynamic parameters characterizing crystallization from solution is key to developing the modeling strategies needed in the quest for digital design strategies for industrial crystallization processes. Based on the thermodynamics of crystal nucleation in confined solutions, we develop a simulation framework to efficiently estimate the solubility and surface tension of organic crystals in solution from a few unbiased molecular dynamics simulations at a reference temperature. We then show that such a result can be extended with minimal computational overhead to capture the solubility curve.

View Article and Find Full Text PDF

The consequences of climate change, accelerated by anthropogenic activities, have different effects on different ecosystems, and the severity of these effects is predicted to increase in the near future. The number of studies investigating how forest ecosystems respond to these changes is increasing. However, there remains a significant gap in research concerning how saproxylic organisms-one of the key contributors to the healthy functioning of these fragile ecosystems-will respond to the consequences of climate change.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!