Genome-wide data with millions of single-nucleotide polymorphisms (SNPs) can be highly correlated due to linkage disequilibrium (LD). The ultrahigh dimensionality of big data brings unprecedented challenges to statistical modeling such as noise accumulation, the curse of dimensionality, computational burden, spurious correlations, and a processing and storing bottleneck. The traditional statistical approaches lose their power due to [Formula: see text] (n is the number of observations and p is the number of SNPs) and the complex correlation structure among SNPs. In this article, we propose an integrated distance correlation ridge regression (DCRR) approach to accommodate the ultrahigh dimensionality, joint polygenic effects of multiple loci, and the complex LD structures. Initially, a distance correlation (DC) screening approach is used to extensively remove noise, after which LD structure is addressed using a ridge penalized multiple logistic regression (LRR) model. The false discovery rate, true positive discovery rate, and computational cost were simultaneously assessed through a large number of simulations. A binary trait of Arabidopsis thaliana, the hypersensitive response to the bacterial elicitor AvrRpm1, was analyzed in 84 inbred lines (28 susceptibilities and 56 resistances) with 216,130 SNPs. Compared to previous SNP discovery methods implemented on the same data set, the DCRR approach successfully detected the causative SNP while dramatically reducing spurious associations and computational time.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4788225PMC
http://dx.doi.org/10.1534/genetics.115.179507DOI Listing

Publication Analysis

Top Keywords

linkage disequilibrium
8
genome-wide data
8
ultrahigh dimensionality
8
distance correlation
8
dcrr approach
8
discovery rate
8
exploiting linkage
4
disequilibrium ultrahigh-dimensional
4
ultrahigh-dimensional genome-wide
4
data
4

Similar Publications

Genomic selection for resistance to one pathogenic strain of in blue mussel .

Front Genet

January 2025

Ifremer, Ressources Biologiques et Environnement (RBE)-ASIM, La Tremblade, France.

Introduction: The blue mussel is one of the major aquaculture species worldwide. In France, this species faces a significant threat from infectious disease outbreaks in both mussel farms and the natural environment over the past decade. Diseases caused by various pathogens, particularly spp.

View Article and Find Full Text PDF

Association of antihypertensive drug target genes with stroke subtypes: A Mendelian randomization study.

J Stroke Cerebrovasc Dis

January 2025

School of Medicine, South China University of Technology, Guangzhou, China; Department of Cardiology, Hypertension Research Laboratory, Guangdong Cardiovascular Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China.

Objective: Epidemiological and genetic studies have elucidated the effect of antihypertensive medication (AHM) on stroke subtypes varying upon drug classes, but which drug target genes, how, and where mediated this association remains unknown. We aimed to investigate the impact of AHM on stroke subtypes.

Methods: Genetic instruments for the expression of AHM target genes were identified with expression quantitative trait loci in blood, which should be associated with systolic blood pressure (SBP) to proxy for the effect of AHM.

View Article and Find Full Text PDF

Cross-trait multivariate GWAS confirms health implications of pubertal timing.

Nat Commun

January 2025

Laboratory of Molecular Translational Medicine, Center for Translational Medicine, West China Second University Hospital, Sichuan University, Chengdu, China.

Pubertal timing is highly variable and is associated with long-term health outcomes. Phenotypes associated with pubertal timing include age at menarche, age at voice break, age at first facial hair and growth spurt, and pubertal timing seems to have a shared genetic architecture between the sexes. However, puberty phenotypes have primarily been assessed separately, failing to account for shared genetics, which limits the reliability of the purported health implications.

View Article and Find Full Text PDF

Objective: The study aimed to investigate the causal relationship between serum 25-hydroxyvitamin D (25(OH)D) levels and epilepsy using Mendelian randomization (MR), thereby addressing confounding and reverse causality issues in observational studies.

Methods: We employed a two-sample bidirectional MR design utilizing summary-level data from the IEU OpenGWAS project. Serum 25(OH)D levels were analyzed using the publicly available dataset ebi-a-GCST90000618, which included 496,946 European samples and 68,960,93 SNPs.

View Article and Find Full Text PDF

Habitat fragmentation increases the risk of local extinction of small reptiles: A case study on Phrynocephalus przewalskii.

Ecotoxicol Environ Saf

January 2025

Gansu Key Laboratory of Biomonitoring and Bioremediation for Environmental Pollution, School of Life Sciences, Lanzhou University, Lanzhou 730000, China. Electronic address:

Habitat fragmentation represents a multifaceted global conservation threat, exerting both direct and indirect effects on individual animals and communities. Reptiles, particularly smaller species with limited migratory abilities, are especially vulnerable to these changes. This study examines how small reptiles adapt their life history strategies in fragmented habitats and determines whether their responses are primarily due to phenotypic plasticity or genetic adaptation.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!