Multiseed lossless filtration.

IEEE/ACM Trans Comput Biol Bioinform

INRIA/LORIA, 615, rue du Jardin Botanique, B.P. 101, 54602 Villers-lès-Nancy, France.

Published: November 2006

We study a method of seed-based lossless filtration for approximate string matching and related bioinformatics applications. The method is based on a simultaneous use of several spaced seeds rather than a single seed as studied by Burkhardt and Kärkkäinen. We present algorithms to compute several important parameters of seed families, study their combinatorial properties, and describe several techniques to construct efficient families. We also report a large-scale application of the proposed technique to the problem of oligonucleotide selection for an EST sequence database.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TCBB.2005.12DOI Listing

Publication Analysis

Top Keywords

lossless filtration
8
multiseed lossless
4
filtration study
4
study method
4
method seed-based
4
seed-based lossless
4
filtration approximate
4
approximate string
4
string matching
4
matching bioinformatics
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!