Profile-based short linear protein motif discovery.

BMC Bioinformatics

Complex and Adaptive Systems Laboratory, University College Dublin, Ireland.

Published: May 2012

AI Article Synopsis

  • Short linear protein motifs, typically 3-10 amino acids long and found in disordered protein regions, are gaining attention for their functional independence.
  • Recent methods have been developed to identify over-represented motifs in proteins, expanding from regular expressions to more advanced profile-based techniques.
  • Although profile methods like MEME initially struggled with disordered protein motifs, incorporating evolutionary weighting improved their performance, suggesting that both methods have unique strengths and can complement each other in motif discovery.

Article Abstract

Background: Short linear protein motifs are attracting increasing attention as functionally independent sites, typically 3-10 amino acids in length that are enriched in disordered regions of proteins. Multiple methods have recently been proposed to discover over-represented motifs within a set of proteins based on simple regular expressions. Here, we extend these approaches to profile-based methods, which provide a richer motif representation.

Results: The profile motif discovery method MEME performed relatively poorly for motifs in disordered regions of proteins. However, when we applied evolutionary weighting to account for redundancy amongst homologous proteins, and masked out poorly conserved regions of disordered proteins, the performance of MEME is equivalent to that of regular expression methods. However, the two approaches returned different subsets within both a benchmark dataset, and a more realistic discovery dataset.

Conclusions: Profile-based motif discovery methods complement regular expression based methods. Whilst profile-based methods are computationally more intensive, they are likely to discover motifs currently overlooked by regular expression methods.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3534220PMC
http://dx.doi.org/10.1186/1471-2105-13-104DOI Listing

Publication Analysis

Top Keywords

motif discovery
12
regular expression
12
short linear
8
linear protein
8
disordered regions
8
regions proteins
8
profile-based methods
8
expression methods
8
methods
7
proteins
5

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!