Using multiple interdependency to separate functional from phylogenetic correlations in protein alignments.

Bioinformatics

Ontario Cancer Institute, University Health Network, Suite 703, 620 University Avenue, Toronto, Ontario, Canada M5G 2M9.

Published: April 2003

Motivation: Multiple sequence alignments of homologous proteins are useful for inferring their phylogenetic history and to reveal functionally important regions in the proteins. Functional constraints may lead to co-variation of two or more amino acids in the sequence, such that a substitution at one site is accompanied by compensatory substitutions at another site. It is not sufficient to find the statistical correlations between sites in the alignment because these may be the result of several undetermined causes. In particular, phylogenetic clustering will lead to many strong correlations.

Results: A procedure is developed to detect statistical correlations stemming from functional interaction by removing the strong phylogenetic signal that leads to the correlations of each site with many others in the sequence. Our method relies upon the accuracy of the alignment but it does not require any assumptions about the phylogeny or the substitution process. The effectiveness of the method was verified using computer simulations and then applied to predict functional interactions between amino acids in the Pfam database of alignments.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btg072DOI Listing

Publication Analysis

Top Keywords

amino acids
8
statistical correlations
8
multiple interdependency
4
interdependency separate
4
functional
4
separate functional
4
phylogenetic
4
functional phylogenetic
4
correlations
4
phylogenetic correlations
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!