A heuristic approximation to the score distribution of gapped alignments in the logarithmic domain is presented. The method applies to comparisons between random, unrelated protein sequences, using standard score matrices and arbitrary gap penalties. It is shown that gapped alignment behavior is essentially governed by a single parameter, alpha, depending on the penalty scheme and sequence composition. This treatment also predicts the position of the transition point between logarithmic and linear behavior. The approximation is tested by simulation and shown to be accurate over a range of commonly used substitution matrices and gap-penalties.

Download full-text PDF

Source
http://dx.doi.org/10.1089/cmb.1999.6.91DOI Listing

Publication Analysis

Top Keywords

gapped alignments
8
approximate statistics
4
statistics gapped
4
alignments heuristic
4
heuristic approximation
4
approximation score
4
score distribution
4
distribution gapped
4
alignments logarithmic
4
logarithmic domain
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!