Prediction of Coronary Artery Calcium Score Using Machine Learning in a Healthy Population.

Jongseok Lee Jae-Sung Lim Younggi Chu Chang Hee Lee Ohk-Hyun Ryu Hyun Hee Choi Yong Soon Park Chulho Kim

J Pers Med

Department of Neurology, Chuncheon Sacred Heart Hospital, Chuncheon 24253, Korea.

Published: August 2020

Background: Coronary artery calcium score (CACS) is a reliable predictor for future cardiovascular disease risk. Although deep learning studies using computed tomography (CT) images to predict CACS have been reported, no study has assessed the feasibility of machine learning (ML) algorithms to predict the CACS using clinical variables in a healthy general population. Therefore, we aimed to assess whether ML algorithms other than binary logistic regression (BLR) could predict high CACS in a healthy population with general health examination data.

Methods: This retrospective observational study included participants who had regular health screening including coronary CT angiography. High CACS was defined by the Agatston score ≥ 100. Univariable and multivariable BLR was performed to assess predictors for high CACS in the entire dataset. When performing ML prediction for high CACS, the dataset was randomly divided into a training and test dataset with a 7:3 ratio. BLR, catboost, and xgboost algorithms with 5-fold cross-validation and grid search technique were used to find the best performing classifier. Performance comparison of each ML algorithm was evaluated with the area under the receiver operating characteristic (AUROC) curve.

Results: A total of 2133 participants were included in the final analysis. Mean age and proportion of male sex were 55.4 ± 11.3 years and 1483 (69.5%), respectively. In multivariable BLR analysis, age (odds ratio [OR], 1.12; 95% confidence interval [CI], 1.10-1.15, < 0.001), male sex (OR, 2.91; 95% CI, 1.57-5.38, < 0.001), systolic blood pressure (OR, 1.02; 95% CI, 1.00-1.03, = 0.019), and low-density lipoprotein cholesterol (OR, 1.00; 95% CI, 0.99-1.00, = 0.047) were significant predictors for high CACS. Performance in predicting high CACS of xgboost was AUROC of 0.823, followed by catboost (0.750) and BLR (0.585). The comparison of AUROC between xgboost and BLR was significant ( for AUROC comparison < 0.001).

Conclusions: Xgboost ML algorithm was found to be a more reliable predictor of CACS in healthy participants compared to the BLR algorithm. ML algorithms may be useful for predicting CACS with only laboratory data in healthy participants.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7565334	PMC
http://dx.doi.org/10.3390/jpm10030096	DOI Listing

Publication Analysis

Top Keywords

high cacs

cacs

coronary artery

artery calcium

calcium score

machine learning

healthy population

reliable predictor

predict cacs

cacs healthy

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!