Multiple sequence alignment using an exhaustive and greedy algorithm.

J Bioinform Comput Biol

Bioinformatics Institute, Singapore 138671, Singapore.

Published: April 2005

We describe an exhaustive and greedy algorithm for improving the accuracy of multiple sequence alignment. A simple progressive alignment approach is employed to provide initial alignments. The initial alignment is then iteratively optimized against an objective function. For any working alignment, the optimization involves three operations: insertions, deletions and shuffles of gaps. The optimization is exhaustive since the algorithm applies the above operations to all eligible positions of an alignment. It is also greedy since only the operation that gives the best improving objective score will be accepted. The algorithms have been implemented in the EGMA (Exhaustive and Greedy Multiple Alignment) package using Java programming language, and have been evaluated using the BAliBASE benchmark alignment database. Although EGMA is not guaranteed to produce globally optimized alignment, the tests indicate that EGMA is able to build alignments with high quality consistently, compared with other commonly used iterative and non-iterative alignment programs. It is also useful for refining multiple alignments obtained by other methods.

Download full-text PDF

Source
http://dx.doi.org/10.1142/s021972000500103xDOI Listing

Publication Analysis

Top Keywords

exhaustive greedy
12
alignment
10
multiple sequence
8
sequence alignment
8
greedy algorithm
8
multiple
4
exhaustive
4
alignment exhaustive
4
greedy
4
algorithm describe
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!