Background: Machine learning (ML) classifiers are increasingly used for predicting cardiovascular disease (CVD) and related risk factors using omics data, although these outcomes often exhibit categorical nature and class imbalances. However, little is known about which ML classifier, omics data, or upstream dimension reduction strategy has the strongest influence on prediction quality in such settings. Our study aimed to illustrate and compare different machine learning strategies to predict CVD risk factors under different scenarios.

Methods: We compared the use of six ML classifiers in predicting CVD risk factors using blood-derived metabolomics, epigenetics and transcriptomics data. Upstream omic dimension reduction was performed using either unsupervised or semi-supervised autoencoders, whose downstream ML classifier performance we compared. CVD risk factors included systolic and diastolic blood pressure measurements and ultrasound-based biomarkers of left ventricular diastolic dysfunction (LVDD; E/e' ratio, E/A ratio, LAVI) collected from 1,249 Finnish participants, of which 80% were used for model fitting. We predicted individuals with low, high or average levels of CVD risk factors, the latter class being the most common. We constructed multi-omic predictions using a meta-learner that weighted single-omic predictions. Model performance comparisons were based on the F1 score. Finally, we investigated whether learned omic representations from pre-trained semi-supervised autoencoders could improve outcome prediction in an external cohort using transfer learning.

Results: Depending on the ML classifier or omic used, the quality of single-omic predictions varied. Multi-omics predictions outperformed single-omics predictions in most cases, particularly in the prediction of individuals with high or low CVD risk factor levels. Semi-supervised autoencoders improved downstream predictions compared to the use of unsupervised autoencoders. In addition, median gains in Area Under the Curve by transfer learning compared to modelling from scratch ranged from 0.09 to 0.14 and 0.07 to 0.11 units for transcriptomic and metabolomic data, respectively.

Conclusions: By illustrating the use of different machine learning strategies in different scenarios, our study provides a platform for researchers to evaluate how the choice of omics, ML classifiers, and dimension reduction can influence the quality of CVD risk factor predictions.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11064347PMC
http://dx.doi.org/10.1186/s12911-024-02521-3DOI Listing

Publication Analysis

Top Keywords

cvd risk
28
risk factors
24
machine learning
16
learning strategies
12
dimension reduction
12
semi-supervised autoencoders
12
risk
8
omics data
8
data upstream
8
single-omic predictions
8

Similar Publications

Background: It is estimated that 61% of deaths caused by Cardiovascular Diseases (CVDs) globally are attributed to lifestyle-related risk factors including tobacco use, alcohol abuse, poor diet, and inadequate physical activity. Meanwhile, inadequate knowledge and misperceptions about CVDs are disproportionately increasing the prevalence of CVDs in Africa. Moreover, pre-diagnosis awareness/knowledge about CVDs among patients is essential in shaping the extent and scope of education to be provided by healthcare workers.

View Article and Find Full Text PDF

Supine Blood Pressure and Risk of Cardiovascular Disease and Mortality.

JAMA Cardiol

January 2025

Department of Medicine, Division of General Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts.

Importance: Nocturnal hypertension while asleep is associated with substantial increases in risk of cardiovascular disease (CVD) and death. Whether hypertension while supine is a risk factor associated with CVD independent of seated hypertension remains unknown.

Objective: To investigate the association between supine hypertension and CVD outcomes and by hypertension treatment status.

View Article and Find Full Text PDF

Purpose: This study analyzed longitudinal data to examine whether occupational sitting time is associated with increases in body mass index (BMI) and five-year cardiovascular disease (CVD) risk.

Methods: We included 2,000 employed men and women (aged 31-60) from the German Study on Mental Health at Work (S-MGA) for a BMI analysis and 1,635 participants free of CVD at baseline (2011/2012) for a CVD analysis. Occupational sitting time was categorized into five groups (< 5, 5 to < 15, 15 to < 25, 25 to < 35, and ≥ 35 h per week).

View Article and Find Full Text PDF

Background: The risk of cardiovascular disease (CVD) in patients with chronic kidney disease (CKD) is estimated to be far greater than that in the general population. Adropin regulates endothelial function and may play a role in the pathogenesis of CVD. Angiotensin-converting enzyme inhibitor (ACEI) treatment was reported to have a protective effect on both renal and cardiovascular function.

View Article and Find Full Text PDF

Background: Due to the low contrast between the vascular lumen and vessel wall, conventional computed tomography (CT) is not an effective method for visualizing the vessel wall. The purpose of this study was to assess the feasibility of vessel wall visualization using contrast-enhanced dual-energy CT (DECT)-derived water-calcium material decomposition (WMD) and subtraction-based dark-blood imaging (DBI). An additional objective of this study was to determine the association of descending aorta wall thickness (WT) and wall area (WA) with cardiovascular disease (CVD) risk factors and to ascertain the potential of DECT-derived WT and WA as image markers for identifying individuals at high risk for future CVD.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!