Sigma: strain-level inference of genomes from metagenomic analysis for biosurveillance.

Bioinformatics

Computer Science and Mathematics Division, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA.

Published: January 2015

Motivation: Metagenomic sequencing of clinical samples provides a promising technique for direct pathogen detection and characterization in biosurveillance. Taxonomic analysis at the strain level can be used to resolve serotypes of a pathogen in biosurveillance. Sigma was developed for strain-level identification and quantification of pathogens using their reference genomes based on metagenomic analysis.

Results: Sigma provides not only accurate strain-level inferences, but also three unique capabilities: (i) Sigma quantifies the statistical uncertainty of its inferences, which includes hypothesis testing of identified genomes and confidence interval estimation of their relative abundances; (ii) Sigma enables strain variant calling by assigning metagenomic reads to their most likely reference genomes; and (iii) Sigma supports parallel computing for fast analysis of large datasets. The algorithm performance was evaluated using simulated mock communities and fecal samples with spike-in pathogen strains.

Availability And Implementation: Sigma was implemented in C++ with source codes and binaries freely available at http://sigma.omicsbio.org.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4287953PMC
http://dx.doi.org/10.1093/bioinformatics/btu641DOI Listing

Publication Analysis

Top Keywords

reference genomes
8
sigma
7
sigma strain-level
4
strain-level inference
4
genomes
4
inference genomes
4
metagenomic
4
genomes metagenomic
4
metagenomic analysis
4
analysis biosurveillance
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!