Sublineage structure analysis of Mycobacterium tuberculosis complex strains using multiple-biomarker tensors.

Cagri Ozcaglar Amina Shabbeer Scott Vandenberg Bülent Yener Kristin P Bennett

BMC Genomics

Computer Science Department, Rensselaer Polytechnic Institute, Troy, NY, USA.

Published: January 2012

Background: Strains of Mycobacterium tuberculosis complex (MTBC) can be classified into major lineages based on their genotype. Further subdivision of major lineages into sublineages requires multiple biomarkers along with methods to combine and analyze multiple sources of information in one unsupervised learning model. Typically, spacer oligonucleotide type (spoligotype) and mycobacterial interspersed repetitive units (MIRU) are used for TB genotyping and surveillance. Here, we examine the sublineage structure of MTBC strains with multiple biomarkers simultaneously, by employing a tensor clustering framework (TCF) on multiple-biomarker tensors.

Results: Simultaneous analysis of the spoligotype and MIRU type of strains using TCF on multiple-biomarker tensors leads to coherent sublineages of major lineages with clear and distinctive spoligotype and MIRU signatures. Comparison of tensor sublineages with SpolDB4 families either supports tensor sublineages, or suggests subdivision or merging of SpolDB4 families. High prediction accuracy of major lineage classification with supervised tensor learning on multiple-biomarker tensors validates our unsupervised analysis of sublineages on multiple-biomarker tensors.

Conclusions: TCF on multiple-biomarker tensors achieves simultaneous analysis of multiple biomarkers and suggest a new putative sublineage structure for each major lineage. Analysis of multiple-biomarker tensors gives insight into the sublineage structure of MTBC at the genomic level.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3194230	PMC
http://dx.doi.org/10.1186/1471-2164-12-S2-S1	DOI Listing

Publication Analysis

Top Keywords

multiple-biomarker tensors

sublineage structure

major lineages

multiple biomarkers

tcf multiple-biomarker

mycobacterium tuberculosis

tuberculosis complex

structure mtbc

simultaneous analysis

spoligotype miru

Similar Publications

Sublineage structure analysis of Mycobacterium tuberculosis complex strains using multiple-biomarker tensors.

BMC Genomics

January 2012

Computer Science Department, Rensselaer Polytechnic Institute, Troy, NY, USA.

Cagri Ozcaglar Amina Shabbeer Scott Vandenberg Bülent Yener Kristin P Bennett

View Article and Find Full Text PDF

Similar Publications

Examining the sublineage structure of Mycobacterium tuberculosis complex strains with multiple-biomarker tensors.

Proceedings (IEEE Int Conf Bioinformatics Biomed)

January 2010

Computer Science Department, Rensselaer Polytechnic Institute, Troy, NY USA.

Cagri Ozcaglar Amina Shabbeer Scott Vandenberg Bülent Yener Kristin P Bennett

Strains of the Mycobacterium tuberculosis complex (MTBC) can be classified into coherent lineages of similar traits based on their genotype. We present a tensor clustering framework to group MTBC strains into sublineages of the known major lineages based on two biomarkers: spacer oligonucleotide type (spoligotype) and mycobacterial interspersed repetitive units (MIRU). We represent genotype information of MTBC strains in a high-dimensional array in order to include information about spoligotype, MIRU, and their coexistence using multiple-biomarker tensors.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!