Post-Alignment Adjustment and Its Automation.

Genes (Basel)

Department of Biology, University of Ottawa, Marie-Curie Private, Ottawa, ON K1N 9A7, Canada.

Published: November 2021

Multiple sequence alignment (MSA) is the basis for almost all sequence comparison and molecular phylogenetic inferences. Large-scale genomic analyses are typically associated with automated progressive MSA without subsequent manual adjustment, which itself is often error-prone because of the lack of a consistent and explicit criterion. Here, I outlined several commonly encountered alignment errors that cannot be avoided by progressive MSA for nucleotide, amino acid, and codon sequences. Methods that could be automated to fix such alignment errors were then presented. I emphasized the utility of position weight matrix as a new tool for MSA refinement and illustrated its usage by refining the MSA of nucleotide and amino acid sequences. The main advantages of the position weight matrix approach include (1) its use of information from all sequences, in contrast to other commonly used methods based on pairwise alignment scores and inconsistency measures, and (2) its speedy computation, making it suitable for a large number of long viral genomic sequences.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8623120PMC
http://dx.doi.org/10.3390/genes12111809DOI Listing

Publication Analysis

Top Keywords

progressive msa
8
alignment errors
8
msa nucleotide
8
nucleotide amino
8
amino acid
8
position weight
8
weight matrix
8
msa
5
post-alignment adjustment
4
adjustment automation
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!