Various attempts have been made to predict the individual disease risk based on genotype data from genome-wide association studies (GWAS). However, most studies only investigated one or two classification algorithms and feature encoding schemes. In this study, we applied seven different classification algorithms on GWAS case-control data sets for seven different diseases to create models for disease risk prediction. Further, we used three different encoding schemes for the genotypes of single nucleotide polymorphisms (SNPs) and investigated their influence on the predictive performance of these models. Our study suggests that an additive encoding of the SNP data should be the preferred encoding scheme, as it proved to yield the best predictive performances for all algorithms and data sets. Furthermore, our results showed that the differences between most state-of-the-art classification algorithms are not statistically significant. Consequently, we recommend to prefer algorithms with simple models like the linear support vector machine (SVM) as they allow for better subsequent interpretation without significant loss of accuracy.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4540285PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0135832PLOS

Publication Analysis

Top Keywords

disease risk
12
classification algorithms
12
feature encoding
8
risk prediction
8
genome-wide association
8
association studies
8
encoding schemes
8
data sets
8
encoding
5
algorithms
5

Similar Publications

Advances in Diagnosis, Treatment and Prognostic in Aortoiliac Occlusive Disease - A Narrative Review.

Port J Card Thorac Vasc Surg

January 2025

Department of Biomedicine - Unit of Anatomy, Faculty of Medicine, University of Porto; RISE@Health, Porto, Portugal.

Background: Aortoiliac disease (AID) is a variant of peripheral artery disease involving the infrarenal aorta and iliac arteries. Similar to other arterial diseases, aortoiliac disease obstructs blood flow through narrowed lumens or by embolization of plaques. AID, when symptomatic, may present with a triad of claudication, impotence, and absence of femoral pulses, a triad also referred as Leriche Syndrome (LS).

View Article and Find Full Text PDF

Introduction: The present study aimed to explore the epidemiologic threats and factors associated with the coronavirus disease 2019 (COVID-19)-associated mucormycosis (CAM) epidemic that emerged in Egypt during the second COVID-19 wave. The study also aimed to explore the diagnostic features and the role of surgical interventions of CAM on the outcome of the disease in a central referral hospital.

Methodology: The study included 64 CAM patients from a referral hospital for CAM and a similar number of matched controls from COVID-19 patients who did not develop CAM.

View Article and Find Full Text PDF

Introduction: Acute kidney injury involves inflammation and intrinsic renal damage, and is a common complication of severe coronavirus disease 2019 (COVID-19). Baseline chronic kidney disease (CKD) confers an increased mortality risk. We determined the renal long-term outcomes of COVID-19 in patients with baseline CKD, and the risk factors prompting renal replacement therapy (RRT) initiation and mortality.

View Article and Find Full Text PDF

Persistent COVID-19 symptoms and associated factors in a tertiary hospital in Thailand.

J Infect Dev Ctries

December 2024

Division of Pulmonary and Critical Care Medicine, Department of Medicine, Faculty of Medicine, Thammasat University, Pathumthani 12120, Thailand.

Introduction: Coronavirus disease 2019 (COVID-19) is associated with long-term symptoms, but the spectrum of these symptoms remains unclear. We aimed to identify the prevalence and factors associated with persistent symptoms in patients at the post-COVID-19 outpatient clinic.

Methodology: This cross-sectional, observational study included hospitalized severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infected patients followed-up at a post-COVID-19 clinic between September 2021 and January 2022.

View Article and Find Full Text PDF

Introduction: We assessed the prevalence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection and associated socio-occupational factors among delivery riders from a Brazilian city at two time points during the pandemic.

Methodology: Surveys for antibody and viral RNA testing were conducted from November 2020 to January 2021, and from March to May 2021 in a group of 117 delivery riders. A questionnaire on socio-occupational characteristics and coronavirus disease 2019 (COVID-19) preventive measures was completed.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!