Canonical correlation analysis for gene-based pleiotropy discovery.

PLoS Comput Biol

School of Social and Community Medicine, University of Bristol, Bristol, United Kingdom; MRC Integrative Epidemiology Unit, University of Bristol, Bristol, United Kingdom.

Published: October 2014

AI Article Synopsis

  • Genome-wide association studies have identified numerous genetic variants linked to complex traits and diseases, leading to increased interest in analyzing variants for their relationships with multiple phenotypes (pleiotropy) and vice versa.
  • Canonical Correlation Analysis (CCA) is introduced as a method to explore these relationships by measuring correlations between multiple sets of variables, but requires careful selection of genetic attributes relative to sample size.
  • In a study involving 4286 women from the British Women's Heart and Health Study, the application of CCA revealed improved statistical power for detecting known genetic associations and highlighted new pleiotropic links, such as specific genes related to triglyceride levels and heart-related phenotypes.

Article Abstract

Genome-wide association studies have identified a wealth of genetic variants involved in complex traits and multifactorial diseases. There is now considerable interest in testing variants for association with multiple phenotypes (pleiotropy) and for testing multiple variants for association with a single phenotype (gene-based association tests). Such approaches can increase statistical power by combining evidence for association over multiple phenotypes or genetic variants respectively. Canonical Correlation Analysis (CCA) measures the correlation between two sets of multidimensional variables, and thus offers the potential to combine these two approaches. To apply CCA, we must restrict the number of attributes relative to the number of samples. Hence we consider modules of genetic variation that can comprise a gene, a pathway or another biologically relevant grouping, and/or a set of phenotypes. In order to do this, we use an attribute selection strategy based on a binary genetic algorithm. Applied to a UK-based prospective cohort study of 4286 women (the British Women's Heart and Health Study), we find improved statistical power in the detection of previously reported genetic associations, and identify a number of novel pleiotropic associations between genetic variants and phenotypes. New discoveries include gene-based association of NSF with triglyceride levels and several genes (ACSM3, ERI2, IL18RAP, IL23RAP and NRG1) with left ventricular hypertrophy phenotypes. In multiple-phenotype analyses we find association of NRG1 with left ventricular hypertrophy phenotypes, fibrinogen and urea and pleiotropic relationships of F7 and F10 with Factor VII, Factor IX and cholesterol levels.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4199483PMC
http://dx.doi.org/10.1371/journal.pcbi.1003876DOI Listing

Publication Analysis

Top Keywords

genetic variants
12
canonical correlation
8
correlation analysis
8
variants association
8
association multiple
8
multiple phenotypes
8
gene-based association
8
statistical power
8
nrg1 left
8
left ventricular
8

Similar Publications

Background: Predicting response to targeted cancer therapies increasingly relies on both simple and complex genetic biomarkers. Comprehensive genomic profiling using high-throughput assays must be evaluated for reproducibility and accuracy compared with existing methods.

Methods: This study is a multicenter evaluation of the Oncomine™ Comprehensive Assay Plus (OCA Plus) Pan-Cancer Research Panel for comprehensive genomic profiling of solid tumors.

View Article and Find Full Text PDF

Background: Glucagon-like peptide-1 receptor agonists (GLP1RAs) are widely used in manageing type 2 diabetes mellitus and weight control. Their potential in treating ageing-related diseases has been gaining attention in recent years. However, the long-term effects of GLP1RAs on these diseases have yet to be fully revealed.

View Article and Find Full Text PDF

GDBr: genomic signature interpretation tool for DNA double-strand break repair mechanisms.

Nucleic Acids Res

January 2025

Department of Convergent Bioscience and Informatics, College of Bioscience and Biotechnology, Chungnam National University, 99, Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea.

Large genetic variants can be generated via homologous recombination (HR), such as polymerase theta-mediated end joining (TMEJ) or single-strand annealing (SSA). Given that these HR-based mechanisms leave specific genomic signatures, we developed GDBr, a genomic signature interpretation tool for DNA double-strand break repair mechanisms using high-quality genome assemblies. We applied GDBr to a draft human pangenome reference.

View Article and Find Full Text PDF

Prader-Willi syndrome (PWS) is a genetic disorder associated with baseline respiratory impairment caused by multiple contributing etiologies. While this may be expected to increase the risk of severe COVID-19 infections in PWS patients, survey studies have suggested paradoxically low disease severity. To better characterize the course of COVID-19 infection in patients with PWS, this study analyses the outcomes of hospitalizations for COVID-19 among patients with and without PWS.

View Article and Find Full Text PDF

SOX9 encodes an SRY-related transcription factor critical for chondrogenesis and sex determination among other processes. Loss-of-function variants cause campomelic dysplasia and Pierre Robin Sequence, while both gain- and loss-of-function variants cause disorders of sex development. SOX9 has also been linked to scoliosis and cancers, but variants are undetermined.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!