A distance-based test of association between paired heterogeneous genomic data.

Bioinformatics

Department of Imaging Sciences, Institute of Clinical Sciences, Hammersmith Campus, Statistics Section, Department of Mathematics, South Kensington Campus and Department of Surgery and Cancer, Ovarian Cancer Action Research Centre, Hammersmith Campus, Imperial College London, London W12 0NN, UK.

Published: October 2013

Motivation: Due to rapid technological advances, a wide range of different measurements can be obtained from a given biological sample including single nucleotide polymorphisms, copy number variation, gene expression levels, DNA methylation and proteomic profiles. Each of these distinct measurements provides the means to characterize a certain aspect of biological diversity, and a fundamental problem of broad interest concerns the discovery of shared patterns of variation across different data types. Such data types are heterogeneous in the sense that they represent measurements taken at different scales or represented by different data structures.

Results: We propose a distance-based statistical test, the generalized RV (GRV) test, to assess whether there is a common and non-random pattern of variability between paired biological measurements obtained from the same random sample. The measurements enter the test through the use of two distance measures, which can be chosen to capture a particular aspect of the data. An approximate null distribution is proposed to compute P-values in closed-form and without the need to perform costly Monte Carlo permutation procedures. Compared with the classical Mantel test for association between distance matrices, the GRV test has been found to be more powerful in a number of simulation settings. We also demonstrate how the GRV test can be used to detect biological pathways in which genetic variability is associated to variation in gene expression levels in an ovarian cancer sample, and present results obtained from two independent cohorts.

Availability: R code to compute the GRV test is freely available from http://www2.imperial.ac.uk/∼gmontana

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btt450DOI Listing

Publication Analysis

Top Keywords

grv test
16
test association
8
variation gene
8
gene expression
8
expression levels
8
data types
8
test
7
data
5
measurements
5
distance-based test
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!