HMMerge: an ensemble method for multiple sequence alignment.

Bioinform Adv

Department of Computer Science, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA.

Published: April 2023

Motivation: Despite advances in method development for multiple sequence alignment over the last several decades, the alignment of datasets exhibiting substantial sequence length heterogeneity, especially when the input sequences include very short sequences (either as a result of sequencing technologies or of large deletions during evolution) remains an inadequately solved problem.

Results: We present HMMerge, a method to compute an alignment of datasets exhibiting high sequence length heterogeneity, or to add short sequences into a given 'backbone' alignment. HMMerge builds on the technique from its predecessor alignment methods, UPP and WITCH, which build an ensemble of profile HMMs to represent the backbone alignment and add the remaining sequences into the backbone alignment using the ensemble. HMMerge differs from UPP and WITCH by building a new 'merged' HMM from the ensemble, and then using that merged HMM to align the query sequences. We show that HMMerge is competitive with WITCH, with an advantage over WITCH when adding very short sequences into backbone alignments.

Availability And Implementation: HMMerge is freely available at https://github.com/MinhyukPark/HMMerge.

Supplementary Information: Supplementary data are available at online.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10148686	PMC
http://dx.doi.org/10.1093/bioadv/vbad052	DOI Listing

Publication Analysis

Top Keywords

short sequences

multiple sequence

alignment

sequence alignment

alignment datasets

datasets exhibiting

sequence length

length heterogeneity

upp witch

backbone alignment

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!