Control of false discoveries in grouped hypothesis testing for eQTL data.

Pratyaydipta Rudra Yi-Hui Zhou Andrew Nobel Fred A Wright

BMC Bioinformatics

Bioinformatics Research Center, Departments of Statistics and Biological Sciences, North Carolina State University, Raleigh, NC, USA.

Published: April 2024

Background: Expression quantitative trait locus (eQTL) analysis aims to detect the genetic variants that influence the expression of one or more genes. Gene-level eQTL testing forms a natural grouped-hypothesis testing strategy with clear biological importance. Methods to control family-wise error rate or false discovery rate for group testing have been proposed earlier, but may not be powerful or easily apply to eQTL data, for which certain structured alternatives may be defensible and may enable the researcher to avoid overly conservative approaches.

Results: In an empirical Bayesian setting, we propose a new method to control the false discovery rate (FDR) for grouped hypotheses. Here, each gene forms a group, with SNPs annotated to the gene corresponding to individual hypotheses. The heterogeneity of effect sizes in different groups is considered by the introduction of a random effects component. Our method, entitled Random Effects model and testing procedure for Group-level FDR control (REG-FDR), assumes a model for alternative hypotheses for the eQTL data and controls the FDR by adaptive thresholding. As a convenient alternate approach, we also propose Z-REG-FDR, an approximate version of REG-FDR, that uses only Z-statistics of association between genotype and expression for each gene-SNP pair. The performance of Z-REG-FDR is evaluated using both simulated and real data. Simulations demonstrate that Z-REG-FDR performs similarly to REG-FDR, but with much improved computational speed.

Conclusion: Our results demonstrate that the Z-REG-FDR method performs favorably compared to other methods in terms of statistical power and control of FDR. It can be of great practical use for grouped hypothesis testing for eQTL analysis or similar problems in statistical genomics due to its fast computation and ability to be fit using only summary data.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11007981	PMC
http://dx.doi.org/10.1186/s12859-024-05736-3	DOI Listing

Publication Analysis

Top Keywords

eqtl data

control false

grouped hypothesis

hypothesis testing

testing eqtl

eqtl analysis

false discovery

discovery rate

random effects

demonstrate z-reg-fdr

Similar Publications

A cross-tissue transcriptome-wide association study identifies new key genes in ischemic stroke.

Gene

January 2025

Shengli Clinical Medical College of Fujian Medical University, Fuzhou, Fujian, China; Department of Neurology, Fujian Provincial Hospital, Fuzhou, Fujian, China; Fujian Key Laboratory of Medical Analysis, Fujian Academy of Medical Sciences, Fuzhou, Fujian, China. Electronic address:

Zhiwei Song Yupeng Han Wangyu Li Yiya Xu Yingchao He

Background: Ischemic stroke (IS) is an important disease causing death and disability worldwide, and further investigation of IS-related genes through genome-wide association study (GWAS) data is valuable.

Methods: The study included GWAS data from 62,100 IS patients of European origin and 1,234,808 controls in a cross-tissue transcriptome association study (TWAS). A joint analysis was first performed by the Unified Test for Molecular Markers (UTMOST) and FUSION methods.

View Article and Find Full Text PDF

Similar Publications

Basic Science and Pathogenesis.

Alzheimers Dement

December 2024

Mayo Clinic, Jacksonville, FL, USA.

Joseph S Reddy Laura M Heath Mariet Allen William L Poehlman Yanling Wang

Background: Alzheimer's disease (AD) is a devastating neurodegenerative disorder with few therapies to treat, mitigate or prevent its onset. Understanding of this disease is predominantly based on research in non-Hispanic Whites (NHW) although AD disproportionately affects African Americans (AA) and Latin Americans (LA), underrepresented in AD research. To address this knowledge gap, the Accelerating Medicine Partnership for Alzheimer's Disease (AMP-AD) Diversity Working Group was launched to generate multi-omics data from post-mortem brain tissue from donors of predominantly AA and LA descent.

View Article and Find Full Text PDF

Similar Publications

Basic Science and Pathogenesis.

Alzheimers Dement

December 2024

University of Pittsburgh School of Public Health, Pittsburgh, PA, USA.

Vibha Acharya Kang-Hsien Fan Beth E Snitz Mary Ganguli Steven T DeKosky

Background: Many complex traits and diseases show sex-specific biases in clinical presentation and prevalence. For instance, two-thirds of AD cases are female. Studies suggest that women might have higher cognitive reserve but steeper cognitive decline in older age.

View Article and Find Full Text PDF

Similar Publications

Basic Science and Pathogenesis.

Alzheimers Dement

December 2024

Chambers-Grundy Center for Transformative Neuroscience, Department of Brain Health, School of Integrated Health Sciences, University of Nevada Las Vegas, Las Vegas, NV, USA.

Feixiong Cheng Yuan Hou Pengyue Zhang Fan Fan Jonathan L Haines

Background: Although high-throughput DNA/RNA sequencing technologies have generated massive genetic and genomic data in human disease, translation of these findings into new patient treatment has not materialized by lack of effective approaches, such as Artificial Intelligence (AL) and Machine Learning (ML) tools.

Method: To address this problem, we have used AI/ML approaches, Mendelian randomization (MR), and large patient's genetic and functional genomic data to evaluate druggable targets using Alzheimer's disease (AD) as a prototypical example. We utilized the genomic instruments from 9 expression quantitative trait loci (eQTL) and 3 protein quantitative trait loci (pQTL) datasets across five human brain regions from three biobanks.

View Article and Find Full Text PDF

Similar Publications

Basic Science and Pathogenesis.

Alzheimers Dement

December 2024

Dr. John T. Macdonald Foundation Department of Human Genetics, University of Miami Miller School of Medicine, Miami, FL, USA.

Liyong Wang Wanying Xu Farid Rajabli Katrina Celis Leina Lu

Background: Annotation of target genes of non-coding GWAS loci remains a challenge since 1) regulatory elements identified by GWAS can be metabases away from its actual target, 2) one regulatory element can target multiple genes, and 3) multiple regulatory elements can target one gene. AD GWAS in populations with different ancestries have identified different loci, suggesting ancestry-specific genetic risks. To understand the connection between associated loci (potential regulatory elements) and their target genes, we conducted Hi-C analysis in frontal cortex of African American (AA) and Non-Hispanic Whites (NHW) AD patients to map chromatin loops, which often represent enhancer-promoter (EP) interactions.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!