Detection of circular permutations by Protein Language Models.

Comput Struct Biotechnol J

School of Bioengineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan, Shandong 250300, China.

Published: December 2024

Protein circular permutations are crucial for understanding protein evolution and functionality. Traditional detection methods face challenges: sequence-based approaches struggle with detecting distant homologs, while structure-based approaches are limited by the need for structure generation and often treat proteins as rigid bodies. Protein Language Model-based alignment tools have shown advantages in utilizing sequence information to overcome the challenges of detecting distant homologs without requiring structural input. However, many current Protein Language Model-based alignment methods, which rely on sequence alignment algorithms like the Smith-Waterman algorithm, face significant difficulties when dealing with circular permutation (CP) due to their dependency on linear sequence order. This sequence order dependency makes them unsuitable for accurately detecting CP. Our approach, named plmCP, combines classical genetic principles with modern alignment techniques leveraging Protein Language Models to address these limitations. By integrating genetic knowledge, the plmCP method avoids the sequence order dependency, allowing for effective detection of circular permutations and contributing significantly to protein research and engineering by embracing structural flexibility.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11757225PMC
http://dx.doi.org/10.1016/j.csbj.2024.12.029DOI Listing

Publication Analysis

Top Keywords

protein language
16
circular permutations
12
sequence order
12
detection circular
8
language models
8
detecting distant
8
distant homologs
8
language model-based
8
model-based alignment
8
order dependency
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!