Knowledge-constrained K-medoids Clustering of Regulatory Rare Alleles for Burden Tests.

Evol Comput Mach Learn Data Min Bioinform

Center for Human Genetics Research, Department of Biomedical Informatics, Vanderbilt University, Nashville, TN, USA.

Published: January 2013

Rarely occurring genetic variants are hypothesized to influence human diseases, but statistically associating these rare variants to disease is challenging due to a lack of statistical power in most feasibly sized datasets. Several statistical tests have been developed to either collapse multiple rare variants from a genomic region into a single variable (presence/absence) or to tally the number of rare alleles within a region, relating the burden of rare alleles to disease risk. Both these approaches, however, rely on user-specification of a genomic region to generate these collapsed or burden variables, usually an entire gene. Recent studies indicate that most risk variants for common diseases are found within regulatory regions, not genes. To capture the effect of rare alleles within non-genic regulatory regions for burden tests, we contrast a simple sliding window approach with a knowledge-guided k-medoids clustering method to group rare variants into statistically powerful, biologically meaningful windows. We apply these methods to detect genomic regions that alter expression of nearby genes.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4274942PMC
http://dx.doi.org/10.1007/978-3-642-37189-9_4DOI Listing

Publication Analysis

Top Keywords

rare alleles
16
rare variants
12
k-medoids clustering
8
burden tests
8
genomic region
8
regulatory regions
8
rare
7
variants
5
knowledge-constrained k-medoids
4
clustering regulatory
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!