Motivation: The Full-text index in Minute space (FM-index) derived from the Burrows-Wheeler transform (BWT) is broadly used for fast string matching in large genomes or a huge set of sequencing reads. Several graphic processing unit (GPU) accelerated aligners based on the FM-index have been proposed recently; however, the construction of the index is still handled by central processing unit (CPU), only parallelized in data level (e.g. by performing blockwise suffix sorting in GPU), or not scalable for large genomes.

Results: To fulfill the need for a more practical, hardware-parallelizable indexing and matching approach, we herein propose sBWT based on a BWT variant (i.e. Schindler transform) that can be built with highly simplified hardware-acceleration-friendly algorithms and still suffices accurate and fast string matching in repetitive references. In our tests, the implementation achieves significant speedups in indexing and searching compared with other BWT-based tools and can be applied to a variety of domains.

Availability And Implementation: sBWT is implemented in C ++ with CPU-only and GPU-accelerated versions. sBWT is open-source software and is available at http://jhhung.github.io/sBWT/Supplementary information: Supplementary data are available at Bioinformatics online.

Contact: chyee@ntu.edu.tw or jhhung@nctu.edu.tw (also juihunghung@gmail.com).

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btw419DOI Listing

Publication Analysis

Top Keywords

schindler transform
8
fast string
8
string matching
8
processing unit
8
sbwt
4
sbwt memory
4
memory efficient
4
efficient implementation
4
implementation hardware-acceleration-friendly
4
hardware-acceleration-friendly schindler
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!