Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids.

PLoS One

Department of Biological Chemistry, Indian Association for the Cultivation of Science, 2A & 2B Raja S. C. Mullick Road, Kolkata-700032, West Bengal, India.

Published: July 2017

Comparison of amino acid sequence similarity is the fundamental concept behind the protein phylogenetic tree formation. By virtue of this method, we can explain the evolutionary relationships, but further explanations are not possible unless sequences are studied through the chemical nature of individual amino acids. Here we develop a new methodology to characterize the protein sequences on the basis of the chemical nature of the amino acids. We design various algorithms for studying the variation of chemical group transitions and various chemical group combinations as patterns in the protein sequences. The amino acid sequence of conventional myosin II head domain of 14 family members are taken to illustrate this new approach. We find two blocks of maximum length 6 aa as 'FPKATD' and 'Y/FTNEKL' without repeating the same chemical nature and one block of maximum length 20 aa with the repetition of chemical nature which are common among all 14 members. We also check commonality with another motor protein sub-family kinesin, KIF1A. Based on our analysis we find a common block of length 8 aa both in myosin II and KIF1A. This motif is located in the neck linker region which could be responsible for the generation of mechanical force, enabling us to find the unique blocks which remain chemically conserved across the family. We also validate our methodology with different protein families such as MYOI, Myosin light chain kinase (MLCK) and Rho-associated protein kinase (ROCK), Na+/K+-ATPase and Ca2+-ATPase. Altogether, our studies provide a new methodology for investigating the conserved amino acids' pattern in different proteins.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5145171PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0167651PLOS

Publication Analysis

Top Keywords

chemical nature
16
protein sequences
12
chemical group
12
amino acids
12
group combinations
8
amino acid
8
acid sequence
8
maximum length
8
protein
7
chemical
7

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!