Repeats identification using improved suffix trees.

Int J Comput Biol Drug Des

School of Computer Science and Technology, Xidian University, Xi'an, Shaanxi 710071, China.

Published: March 2010

AI Article Synopsis

  • The paper discusses how suffix trees are crucial for efficiently executing querying algorithms, particularly in identifying repeated sequences.
  • It introduces the fast Rep(eats)Seeker algorithm that enhances suffix tree construction by differently numbering leaf and branch nodes and adding extra information to branch nodes.
  • Experimental results demonstrate that these enhancements significantly decrease the running time of the RepSeeker algorithm while maintaining accuracy, aligning with theoretical predictions.

Article Abstract

The suffix tree data structure plays an important role in the efficient implementations of some querying algorithms. This paper presents the fast Rep(eats)Seeker algorithm for repeats identification based on the improvements of suffix tree construction. The leaf nodes and the branch nodes are numbered in different ways during the construction of a suffix tree and extra information is added to the branch nodes. The experimental results show that improvements reduce the running time of the RepSeeker algorithm without losing the accuracy. The experimental results coincide with the theoretical expectations.

Download full-text PDF

Source
http://dx.doi.org/10.1504/IJCBDD.2009.030117DOI Listing

Publication Analysis

Top Keywords

suffix tree
12
repeats identification
8
branch nodes
8
identification improved
4
suffix
4
improved suffix
4
suffix trees
4
trees suffix
4
tree data
4
data structure
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!