The three-dimensional structures of homologous proteins are usually conserved during evolution, as are critical residues in a few short sequence motifs that often constitute the active site in enzymes. The precise spatial organization of such sites depends on the lengths and positions of the secondary structural elements connecting the motifs. We show how members of protein superfamilies, such as kinesins, myosins, and G(alpha) subunits of trimeric G proteins, are identified and classed by simply counting the number of amino acid residues between important sequence motifs in their nucleotide triphosphate-hydrolyzing domains. Subfamily-specific landmark patterns (motif to motif scores) are principally due to inserts and gaps in surface loops. Unusual protein sequences and possible sequence prediction errors are detected.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/s0969-2126(02)00854-7 | DOI Listing |
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!