CGtag: complete genomics toolkit and annotation in a cloud-based Galaxy.

Gigascience

Department of Bioinformatics, Erasmus MC, Dr, Molewaterplein 5, 3015 GE Rotterdam, The Netherlands.

Published: January 2014

Background: Complete Genomics provides an open-source suite of command-line tools for the analysis of their CG-formatted mapped sequencing files. Determination of; for example, the functional impact of detected variants, requires annotation with various databases that often require command-line and/or programming experience; thus, limiting their use to the average research scientist. We have therefore implemented this CG toolkit, together with a number of annotation, visualisation and file manipulation tools in Galaxy called CGtag (Complete Genomics Toolkit and Annotation in a Cloud-based Galaxy).

Findings: In order to provide research scientists with web-based, simple and accurate analytical and visualisation applications for the selection of candidate mutations from Complete Genomics data, we have implemented the open-source Complete Genomics tool set, CGATools, in Galaxy. In addition we implemented some of the most popular command-line annotation and visualisation tools to allow research scientists to select candidate pathological mutations (SNV, and indels). Furthermore, we have developed a cloud-based public Galaxy instance to host the CGtag toolkit and other associated modules.

Conclusions: CGtag provides a user-friendly interface to all research scientists wishing to select candidate variants from CG or other next-generation sequencing platforms' data. By using a cloud-based infrastructure, we can also assure sufficient and on-demand computation and storage resources to handle the analysis tasks. The tools are freely available for use from an NBIC/CTMM-TraIT (The Netherlands Bioinformatics Center/Center for Translational Molecular Medicine) cloud-based Galaxy instance, or can be installed to a local (production) Galaxy via the NBIC Galaxy tool shed.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3905657PMC
http://dx.doi.org/10.1186/2047-217X-3-1DOI Listing

Publication Analysis

Top Keywords

complete genomics
20
cgtag complete
8
genomics toolkit
8
toolkit annotation
8
annotation cloud-based
8
cloud-based galaxy
8
annotation visualisation
8
select candidate
8
galaxy instance
8
galaxy
7

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!