Study of LZ-word distribution and its application for sequence comparison.

J Theor Biol

College of Life Sciences, Zhejiang Sci-Tech University, Hangzhou 310018, People's Republic of China. Electronic address:

Published: November 2013

Lempel-Ziv complexity has been widely used for sequence comparison and achieved promising results, but until now components' distribution in exhaustive history has not been studied. This paper investigated the whole distribution of LZ-words and presented a novel statistical method for sequence comparison. With the components' length in mind, we revised Lempel-Ziv complexity and obtained various sets of LZ-words. Instead of calculating the LZ-words' contents, we defined a series of set operations on LZ-word set to compare biological sequences. In order to assess the effectiveness of the proposed method, we performed two sets of experiments and compared it with alignment-based methods.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7094135PMC
http://dx.doi.org/10.1016/j.jtbi.2013.07.008DOI Listing

Publication Analysis

Top Keywords

sequence comparison
12
lempel-ziv complexity
8
study lz-word
4
lz-word distribution
4
distribution application
4
application sequence
4
comparison lempel-ziv
4
complexity sequence
4
comparison achieved
4
achieved promising
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!