The occurrence of all di- and tripeptide segments of proteins was counted in a large data base containing about 119 000 residues. It was found that the abundance of the amino acids does not determine the frequency of the various di- and tripeptide segments. In addition, the frequency of the various tripeptides cannot be predicted from the observed pair-frequency values. The pair-frequency distribution of amino acids is highly asymmetrical, pairs formed from identical residues are generally preferred and amino acids cannot be clustered on the basis of their first neighbour preferences. These data indicate the existence of general short range regularities in the primary structure of proteins. The consequences of these short range regularities were studied by comparing Chou-Fasman parameters with analogous parameters determined from the results of conformational energy calculations of single amino acids. This comparison shows that Chou-Fasman parameters carry significant information about the environment of each amino acid. The success of the Chou-Fasman's prediction and the properties of the pair and triplet distribution of the amino acid residues suggest that every amino acid has a characteristic sequential residue environment in proteins. The observed preferences could be invoked, for example, in protein design or in the study of the evolutionary relationship of proteins.

Download full-text PDF

Source
http://dx.doi.org/10.1111/j.1399-3011.1986.tb01046.xDOI Listing

Publication Analysis

Top Keywords

amino acids
20
amino acid
12
characteristic sequential
8
sequential residue
8
residue environment
8
amino
8
environment amino
8
di- tripeptide
8
tripeptide segments
8
distribution amino
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!