Motivation: Genome-wide association studies (GWAS) are an integral tool for studying the architecture of complex genotype and phenotype relationships. Linear mixed models (LMMs) are commonly used to detect associations between genetic markers and a trait of interest, while at the same time allowing to account for population structure and cryptic relatedness. Assumptions of LMMs include a normal distribution of the residuals and that the genetic markers are independent and identically distributed-both assumptions are often violated in real data. Permutation-based methods can help to overcome some of these limitations and provide more realistic thresholds for the discovery of true associations. Still, in practice, they are rarely implemented due to the high computational complexity.
Results: We propose permGWAS, an efficient LMM reformulation based on 4D tensors that can provide permutation-based significance thresholds. We show that our method outperforms current state-of-the-art LMMs with respect to runtime and that permutation-based thresholds have lower false discovery rates for skewed phenotypes compared to the commonly used Bonferroni threshold. Furthermore, using permGWAS we re-analyzed more than 500 Arabidopsis thaliana phenotypes with 100 permutations each in less than 8 days on a single GPU. Our re-analyses suggest that applying a permutation-based threshold can improve and refine the interpretation of GWAS results.
Availability And Implementation: permGWAS is open-source and publicly available on GitHub for download: https://github.com/grimmlab/permGWAS.
Supplementary Information: Supplementary data are available at Bioinformatics online.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9486594 | PMC |
http://dx.doi.org/10.1093/bioinformatics/btac455 | DOI Listing |
Genome
January 2025
Damietta University Faculty of Science, New Damietta, Damietta, Egypt;
Polyamine oxidase (PAOs) are enzymes associated with polyamine catabolism and play important roles in growth and development and stress tolerance of plants. In the present study, genome-wide discovery and analysis of the PAO family in sorghum was done utilizing model PAO of Arabidopsis. Six PAO genes were found using publicly available genomic data.
View Article and Find Full Text PDFBrief Bioinform
November 2024
Department of Neurosurgery, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100730, China.
This study aimed to investigate the genetic association between glioblastoma (GBM) and unsupervised deep learning-derived imaging phenotypes (UDIPs). We employed a combination of genome-wide association study (GWAS) data, single-nucleus RNA sequencing (snRNA-seq), and scPagwas (pathway-based polygenic regression framework) methods to explore the genetic links between UDIPs and GBM. Two-sample Mendelian randomization analyses were conducted to identify causal relationships between UDIPs and GBM.
View Article and Find Full Text PDFPLoS One
January 2025
Department of Psychiatry, University of California San Diego, La Jolla, CA, United States of America.
Background: Bipolar Disorder (BD) is a complex disease. It is heterogeneous, both at the phenotypic and genetic level, although the extent and impact of this heterogeneity is not fully understood. One way to assess this heterogeneity is to look for patterns in the subphenotype data.
View Article and Find Full Text PDFPLoS One
January 2025
Department of Entomology and Acarology, Luiz de Queiroz College of Agriculture (ESALQ), University of São Paulo (USP), Piracicaba, São Paulo, Brazil.
Insecticide resistance is a major problem in food production, environmental sustainability, and human health. The cotton bollworm Helicoverpa armigera is a globally distributed crop pest affecting over 300 crop species. H.
View Article and Find Full Text PDFJAMA Netw Open
January 2025
Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, Maryland.
Importance: Sensitivity to environmental stress and adversity may influence lung cancer risk, highlighting a critical link between psychosocial factors and cancer etiology.
Objective: To evaluate whether genetically estimated sensitivity to environmental stress and adversity is associated with lung cancer risk.
Design, Setting, And Participants: Data were obtained from a genome-wide association study identifying 37 independent genetic variants strongly associated with sensitivity to environmental stress and adversity and a cross-ancestry genome-wide meta-analysis from the International Lung Cancer Consortium.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!