Prioritizing genes for X-linked diseases using population exome data.

Hum Mol Genet

Division of Medical Genetics, Department of Pediatrics, University of California San Francisco, San Francisco, CA 94143, USA Institute for Human Genetics, University of California San Francisco, San Francisco, CA 94143, USA

Published: February 2015

Many new disease genes can be identified through high-throughput sequencing. Yet, variant interpretation for the large amounts of genomic data remains a challenge given variation of uncertain significance and genes that lack disease annotation. As clinically significant disease genes may be subject to negative selection, we developed a prediction method that measures paucity of non-synonymous variation in the human population to infer gene-based pathogenicity. Integrating human exome data of over 6000 individuals from the NHLBI Exome Sequencing Project, we tested the utility of the prediction method based on the ratio of non-synonymous to synonymous substitution rates (dN/dS) on X-chromosome genes. A low dN/dS ratio characterized genes associated with childhood disease and outcome. Furthermore, we identify new candidates for diseases with early mortality and demonstrate intragenic localized patterns of variants that suggest pathogenic hotspots. Our results suggest that intrahuman substitution analysis is a valuable tool to help prioritize novel disease genes in sequence interpretation.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4291241PMC
http://dx.doi.org/10.1093/hmg/ddu473DOI Listing

Publication Analysis

Top Keywords

disease genes
12
exome data
8
prediction method
8
genes
6
disease
5
prioritizing genes
4
genes x-linked
4
x-linked diseases
4
diseases population
4
population exome
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!