In this paper, we give an overview of methodological issues related to the use of statistical learning approaches when analyzing high-dimensional genetic data. The focus is set on regression models and machine learning algorithms taking genetic variables as input and returning a classification or a prediction for the target variable of interest; for example, the present or future disease status, or the future course of a disease. After briefly explaining the basic motivation and principle of these methods, we review different procedures that can be used to evaluate the accuracy of the obtained models and discuss common flaws that may lead to over-optimistic conclusions with respect to their prediction performance and usefulness.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s00439-019-01996-9DOI Listing

Publication Analysis

Top Keywords

statistical learning
8
learning approaches
8
approaches genetic
4
genetic epidemiology
4
epidemiology complex
4
complex diseases
4
diseases paper
4
paper overview
4
overview methodological
4
methodological issues
4

Similar Publications

Background: The impact of aortic arch (AA) morphology on the management of the procedural details and the clinical outcomes of the transfemoral artery (TF)-transcatheter aortic valve replacement (TAVR) has not been evaluated. The goal of this study was to evaluate the AA morphology of patients who had TF-TAVR using an artificial intelligence algorithm and then to evaluate its predictive value for clinical outcomes.

Materials And Methods: A total of 1480 consecutive patients undergoing TF-TAVR using a new-generation transcatheter heart valve at 12 institutes were included in this retrospective study.

View Article and Find Full Text PDF

Competitive fitness is a fundamental concept in evolutionary biology that captures the ability of organisms to survive, reproduce, and compete for resources in their environment. Competitive fitness is typically assessed in the lab by growing two or more competitors together and measuring the frequency of each at multiple time points. Traditional microbial competitive fitness assays are labor intensive and involve plating on solid medium and counting colonies.

View Article and Find Full Text PDF

Objectives: Placebo effects can relieve acute and chronic pain in both research and clinical treatments by learning mechanisms. However, the application of placebo-based treatment strategies in routine medical care is questioned. The current study investigated the opinions of patients with fibromyalgia and healthy controls regarding learning of placebo effects and their practical applications.

View Article and Find Full Text PDF

Background: The spinal column is a frequent site for metastases, affecting over 30% of solid tumor patients. Identifying the primary tumor is essential for guiding clinical decisions but often requires resource-intensive diagnostics.

Purpose: To develop and validate artificial intelligence (AI) models using noncontrast MRI to identify primary sites of spinal metastases, aiming to enhance diagnostic efficiency.

View Article and Find Full Text PDF

Quantitative measurements produced by mass spectrometry proteomics experiments offer a direct way to explore the role of proteins in molecular mechanisms. However, analysis of such data is challenging due to the large proportion of missing values. A common strategy to address this issue is to utilize an imputed dataset, which often introduces systematic bias into down-stream analyses if the imputation errors are ignored.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!