SeqScrub: a web tool for automatic cleaning and annotation of FASTA file headers for bioinformatic applications.

Gabriel Foley Leander Sützl Stephlina A D'Cunha Elizabeth Mj Gillam Mikael Bodén

Biotechniques

School of Chemistry & Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia.

Published: August 2019

Data consistency is necessary for effective bioinformatic analysis. SeqScrub is a web tool that parses and maintains consistent information about protein and DNA sequences in FASTA file format, checks if records are current, and adds taxonomic information by matching identifiers against entries in authoritative biological sequence databases. SeqScrub provides a powerful, yet simple workflow for managing, enriching and exchanging data, which is crucial to establish a record of provenance for sequences found from broad and varied searches; for example, using BLAST on continually updated genome sequence sets. Headers standardized using SeqScrub can be parsed by a majority of bioinformatic tools, stay uniformly named between collaborators and contain informative labels to aid management of reproducible, scientific data. SeqScrub is available at http://bioinf.scmb.uq.edu.au/seqscrub.

Download full-text PDF	Source
http://dx.doi.org/10.2144/btn-2018-0188	DOI Listing

Publication Analysis

Top Keywords

seqscrub web

web tool

fasta file

seqscrub

tool automatic

automatic cleaning

cleaning annotation

annotation fasta

file headers

headers bioinformatic

Similar Publications

SeqScrub: a web tool for automatic cleaning and annotation of FASTA file headers for bioinformatic applications.

Biotechniques

August 2019

School of Chemistry & Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia.

Gabriel Foley Leander Sützl Stephlina A D'Cunha Elizabeth Mj Gillam Mikael Bodén

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!