Motivation: Genome-wide association studies (GWAS) are an integral tool for studying the architecture of complex genotype and phenotype relationships. Linear mixed models (LMMs) are commonly used to detect associations between genetic markers and a trait of interest, while at the same time allowing to account for population structure and cryptic relatedness. Assumptions of LMMs include a normal distribution of the residuals and that the genetic markers are independent and identically distributed-both assumptions are often violated in real data. Permutation-based methods can help to overcome some of these limitations and provide more realistic thresholds for the discovery of true associations. Still, in practice, they are rarely implemented due to the high computational complexity.

Results: We propose permGWAS, an efficient LMM reformulation based on 4D tensors that can provide permutation-based significance thresholds. We show that our method outperforms current state-of-the-art LMMs with respect to runtime and that permutation-based thresholds have lower false discovery rates for skewed phenotypes compared to the commonly used Bonferroni threshold. Furthermore, using permGWAS we re-analyzed more than 500 Arabidopsis thaliana phenotypes with 100 permutations each in less than 8 days on a single GPU. Our re-analyses suggest that applying a permutation-based threshold can improve and refine the interpretation of GWAS results.

Availability And Implementation: permGWAS is open-source and publicly available on GitHub for download: https://github.com/grimmlab/permGWAS.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9486594PMC
http://dx.doi.org/10.1093/bioinformatics/btac455DOI Listing

Publication Analysis

Top Keywords

genome-wide association
8
association studies
8
genetic markers
8
efficient permutation-based
4
permutation-based genome-wide
4
studies normal
4
normal skewed
4
skewed phenotypic
4
phenotypic distributions
4
distributions motivation
4

Similar Publications

Polyamine oxidase (PAOs) are enzymes associated with polyamine catabolism and play important roles in growth and development and stress tolerance of plants. In the present study, genome-wide discovery and analysis of the PAO family in sorghum was done utilizing model PAO of Arabidopsis. Six PAO genes were found using publicly available genomic data.

View Article and Find Full Text PDF

This study aimed to investigate the genetic association between glioblastoma (GBM) and unsupervised deep learning-derived imaging phenotypes (UDIPs). We employed a combination of genome-wide association study (GWAS) data, single-nucleus RNA sequencing (snRNA-seq), and scPagwas (pathway-based polygenic regression framework) methods to explore the genetic links between UDIPs and GBM. Two-sample Mendelian randomization analyses were conducted to identify causal relationships between UDIPs and GBM.

View Article and Find Full Text PDF

Background: Bipolar Disorder (BD) is a complex disease. It is heterogeneous, both at the phenotypic and genetic level, although the extent and impact of this heterogeneity is not fully understood. One way to assess this heterogeneity is to look for patterns in the subphenotype data.

View Article and Find Full Text PDF

The genetic architecture of resistance to flubendiamide insecticide in Helicoverpa armigera (Hübner).

PLoS One

January 2025

Department of Entomology and Acarology, Luiz de Queiroz College of Agriculture (ESALQ), University of São Paulo (USP), Piracicaba, São Paulo, Brazil.

Insecticide resistance is a major problem in food production, environmental sustainability, and human health. The cotton bollworm Helicoverpa armigera is a globally distributed crop pest affecting over 300 crop species. H.

View Article and Find Full Text PDF

Sensitivity to Environmental Stress and Adversity and Lung Cancer.

JAMA Netw Open

January 2025

Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, Maryland.

Importance: Sensitivity to environmental stress and adversity may influence lung cancer risk, highlighting a critical link between psychosocial factors and cancer etiology.

Objective: To evaluate whether genetically estimated sensitivity to environmental stress and adversity is associated with lung cancer risk.

Design, Setting, And Participants: Data were obtained from a genome-wide association study identifying 37 independent genetic variants strongly associated with sensitivity to environmental stress and adversity and a cross-ancestry genome-wide meta-analysis from the International Lung Cancer Consortium.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!