In GWAS of imaging phenotypes (e.g., by the ENIGMA and CHARGE consortia), the growing number of phenotypes considered presents a statistical challenge that other fields are not experiencing (e.g. psychiatry and the Psychiatric Genetics Consortium). However, the multivariate nature of MRI measurements may also be an advantage as many of the MRI phenotypes are correlated and multivariate methods could be considered. Here, we compared the statistical power of a multivariate GWAS versus the current univariate approach, which consists of multiple univariate analyses. To do so, we used results from twin models to estimate pertinent vectors of SNP effect sizes on brain imaging phenotypes, as well as the residual correlation matrices, necessary to estimate analytically the statistical power. We showed that for subcortical structure volumes and hippocampal subfields, a multivariate GWAS yields similar statistical power to the current univariate approach. Our analytical approach is as accurate but ~ 1000 times faster than simulations and we have released the code to facilitate the investigation of other scenarios, may they be outside the field of imaging genetics.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s10519-018-9936-9DOI Listing

Publication Analysis

Top Keywords

statistical power
16
multivariate gwas
12
power multivariate
8
field imaging
8
imaging genetics
8
imaging phenotypes
8
current univariate
8
univariate approach
8
statistical
5
multivariate
5

Similar Publications

Unveiling diabetes onset: Optimized XGBoost with Bayesian optimization for enhanced prediction.

PLoS One

January 2025

Department of Computer Science and Information Systems, College of Applied Sciences, AlMaarefa University, Ad Diriyah, Riyadh, Kingdom of Saudi Arabia.

Diabetes, a chronic condition affecting millions worldwide, necessitates early intervention to prevent severe complications. While accurately predicting diabetes onset or progression remains challenging due to complex and imbalanced datasets, recent advancements in machine learning offer potential solutions. Traditional prediction models, often limited by default parameters, have been superseded by more sophisticated approaches.

View Article and Find Full Text PDF

Distributed lag models (DLMs) estimate the health effects of exposure over multiple time lags prior to the outcome and are widely used in time series studies. Applying DLMs to retrospective cohort studies is challenging due to inconsistent lengths of exposure history across participants, which is common when using electronic health record databases. A standard approach is to define subcohorts of individuals with some minimum exposure history, but this limits power and may amplify selection bias.

View Article and Find Full Text PDF

We study the problem of testing multiple secondary endpoints conditional on a primary endpoint being significant in a two-stage group sequential procedure, focusing on two secondary endpoints. This extends our previous work with one secondary endpoint. The test for the secondary null hypotheses is a closed procedure.

View Article and Find Full Text PDF

Sample Size Adjustment in Sequential Multiple Assignment Randomized Trials.

Stat Med

February 2025

Department of Biostatistics and Computational Biology, University of Rochester, Rochester, NY.

Clinical trials are often designed based on limited information about effect sizes and precision parameters with risks of underpowered studies. This is more problematic for SMARTs where strategy effects are based on sequences of treatments. Sample size adjustment offers flexibility through re-estimating sample size during the trial to ensure adequate power at the final analysis.

View Article and Find Full Text PDF

Bioequivalence Design With Sampling Distribution Segments.

Stat Med

February 2025

Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, Ontario, Canada.

In bioequivalence design, power analyses dictate how much data must be collected to detect the absence of clinically important effects. Power is computed as a tail probability in the sampling distribution of the pertinent test statistics. When these test statistics cannot be constructed from pivotal quantities, their sampling distributions are approximated via repetitive, time-intensive computer simulation.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!