BitMapper: an efficient all-mapper based on bit-vector computing.

BMC Bioinformatics

Department of Computer Science, University of Missouri-Columbia, Columbia MO, 65203, USA.

Published: June 2015

Background: As the next-generation sequencing (NGS) technologies producing hundreds of millions of reads every day, a tremendous computational challenge is to map NGS reads to a given reference genome efficiently. However, existing methods of all-mappers, which aim at finding all mapping locations of each read, are very time consuming. The majority of existing all-mappers consist of 2 main parts, filtration and verification. This work significantly reduces verification time, which is the dominant part of the running time.

Results: An efficient all-mapper, BitMapper, is developed based on a new vectorized bit-vector algorithm, which simultaneously calculates the edit distance of one read to multiple locations in a given reference genome. Experimental results on both simulated and real data sets show that BitMapper is from several times to an order of magnitude faster than the current state-of-the-art all-mappers, while achieving higher sensitivity, i.e., better quality solutions.

Conclusions: We present BitMapper, which is designed to return all mapping locations of raw reads containing indels as well as mismatches. BitMapper is implemented in C under a GPL license. Binaries are freely available at http://home.ustc.edu.cn/%7Echhy.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4462005PMC
http://dx.doi.org/10.1186/s12859-015-0626-9DOI Listing

Publication Analysis

Top Keywords

efficient all-mapper
8
reference genome
8
mapping locations
8
bitmapper
5
bitmapper efficient
4
all-mapper based
4
based bit-vector
4
bit-vector computing
4
computing background
4
background next-generation
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!