SEAL: a distributed short read mapping and duplicate removal tool.

Bioinformatics

CRS4, Polaris, Ed. 1, I-09010 Pula, Italy.

Published: August 2011

Summary: SEAL is a scalable tool for short read pair mapping and duplicate removal. It computes mappings that are consistent with those produced by BWA and removes duplicates according to the same criteria employed by Picard MarkDuplicates. On a 16-node Hadoop cluster, it is capable of processing about 13 GB per hour in map+rmdup mode, while reaching a throughput of 19 GB per hour in mapping-only mode.

Availability: SEAL is available online at http://biodoop-seal.sourceforge.net/.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3137215PMC
http://dx.doi.org/10.1093/bioinformatics/btr325DOI Listing

Publication Analysis

Top Keywords

short read
8
mapping duplicate
8
duplicate removal
8
seal distributed
4
distributed short
4
read mapping
4
removal tool
4
tool summary
4
summary seal
4
seal scalable
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!