Biomarker identification from next-generation sequencing data for pathogen bacteria characterization and surveillance.

Biomark Med

Division of Bioinformatics & Biostatistics, National Center for Toxicological Research, US Food & Drug Administration, 3900 NCTR Rd., Jefferson, AR 72079, USA.

Published: August 2016

Aim: The purpose was to develop an analytical pipeline for specific gene analysis and biomarker discovery from next generation sequencing (NGS) data.

Materials & Methods: As a test case, the fliC gene reference sequences of 24 Salmonella enterica strains of 13 serotypes and NGS reads of 32 serovar Newport, 48 Montevideo and 115 Enteritidis outbreak isolates were retrieved from the National Center for Biotechnology Information database.

Results: Establishment of an analytical pipeline consisting of four steps: reference sequences retrieval and template sequence determination; NGS sequence reads retrieval; multiple sequence alignments and phylogenetic analysis; data mining and biomarker discovery.

Conclusion: The pipeline developed provides an effective bioinformatics tool for genetic diversity clarification and marker sequences discovery for pathogen characterization and surveillance.

Download full-text PDF

Source
http://dx.doi.org/10.2217/bmm.15.88DOI Listing

Publication Analysis

Top Keywords

characterization surveillance
8
analytical pipeline
8
reference sequences
8
biomarker identification
4
identification next-generation
4
next-generation sequencing
4
sequencing data
4
data pathogen
4
pathogen bacteria
4
bacteria characterization
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!