Recent technological and computational advances have made metagenomic assembly a viable approach to achieving high-resolution views of complex microbial communities. In previous benchmarking, short-read (SR) metagenomic assemblers had the highest accuracy, long-read (LR) assemblers generated the most contiguous sequences and hybrid (HY) assemblers balanced length and accuracy. However, no assessments have specifically compared the performance of these assemblers on low-abundance species, which include clinically relevant organisms in the gut.
View Article and Find Full Text PDFRecurrent urinary tract infections (rUTIs) are a major health burden worldwide, with history of infection being a significant risk factor. While the gut is a known reservoir for uropathogenic bacteria, the role of the microbiota in rUTI remains unclear. We conducted a year-long study of women with (n = 15) and without (n = 16) history of rUTI, from whom we collected urine, blood and monthly faecal samples for metagenomic and transcriptomic interrogation.
View Article and Find Full Text PDFBackground: Carbapenem-resistant Enterobacterales (CRE) are an urgent global health threat. Inferring the dynamics of local CRE dissemination is currently limited by our inability to confidently trace the spread of resistance determinants to unrelated bacterial hosts. Whole-genome sequence comparison is useful for identifying CRE clonal transmission and outbreaks, but high-frequency horizontal gene transfer (HGT) of carbapenem resistance genes and subsequent genome rearrangement complicate tracing the local persistence and mobilization of these genes across organisms.
View Article and Find Full Text PDFHuman-associated microbial communities comprise not only complex mixtures of bacterial species, but also mixtures of conspecific strains, the implications of which are mostly unknown since strain level dynamics are underexplored due to the difficulties of studying them. We introduce the Strain Genome Explorer (StrainGE) toolkit, which deconvolves strain mixtures and characterizes component strains at the nucleotide level from short-read metagenomic sequencing with higher sensitivity and resolution than other tools. StrainGE is able to identify strains at 0.
View Article and Find Full Text PDFBackground: Urinary tract infections (UTIs) affect 15 million women each year in the United States, with > 20% experiencing frequent recurrent UTIs. A recent placebo-controlled clinical trial found a 39% reduction in UTI symptoms among recurrent UTI sufferers who consumed a daily cranberry beverage for 24 weeks. Using metagenomic sequencing of stool from a subset of these trial participants, we assessed the impact of cranberry consumption on the gut microbiota, a reservoir for UTI-causing pathogens such as Escherichia coli, which causes > 80% of UTIs.
View Article and Find Full Text PDFBackground: Mixed infections of Mycobacterium tuberculosis and antibiotic heteroresistance continue to complicate tuberculosis (TB) diagnosis and treatment. Detection of mixed infections has been limited to molecular genotyping techniques, which lack the sensitivity and resolution to accurately estimate the multiplicity of TB infections. In contrast, whole genome sequencing offers sensitive views of the genetic differences between strains of M.
View Article and Find Full Text PDFA more complete understanding of the genetic basis of drug resistance in Mycobacterium tuberculosis is critical for prompt diagnosis and optimal treatment, particularly for toxic second-line drugs such as D-cycloserine. Here we used the whole-genome sequences from 498 strains of M. tuberculosis to identify new resistance-conferring genotypes.
View Article and Find Full Text PDFPLoS Med
September 2015
Advances in modern sequencing technologies allow us to generate sufficient data to analyze hundreds of bacterial genomes from a single machine in a single day. This potential for sequencing massive numbers of genomes calls for fully automated methods to produce high-quality assemblies and variant calls. We introduce Pilon, a fully automated, all-in-one tool for correcting draft assemblies and calling sequence variants of multiple sizes, including very large insertions and deletions.
View Article and Find Full Text PDFBackground: Immunosuppression is associated with a variety of idiopathic clinical syndromes that may have infectious causes. It has been hypothesized that the cord colitis syndrome, a complication of umbilical-cord hematopoietic stem-cell transplantation, is infectious in origin.
Methods: We performed shotgun DNA sequencing on four archived, paraffin-embedded endoscopic colon-biopsy specimens obtained from two patients with cord colitis.
Exceptionally accurate genome reference sequences have proven to be of great value to microbial researchers. Thus, to date, about 1800 bacterial genome assemblies have been "finished" at great expense with the aid of manual laboratory and computational processes that typically iterate over a period of months or even years. By applying a new laboratory design and new assembly algorithm to 16 samples, we demonstrate that assemblies exceeding finished quality can be obtained from whole-genome shotgun data and automated computation.
View Article and Find Full Text PDFProc Natl Acad Sci U S A
February 2012
The degree to which molecular epidemiology reveals information about the sources and transmission patterns of an outbreak depends on the resolution of the technology used and the samples studied. Isolates of Escherichia coli O104:H4 from the outbreak centered in Germany in May-July 2011, and the much smaller outbreak in southwest France in June 2011, were indistinguishable by standard tests. We report a molecular epidemiological analysis using multiplatform whole-genome sequencing and analysis of multiple isolates from the German and French outbreaks.
View Article and Find Full Text PDFMassively parallel DNA sequencing technologies are revolutionizing genomics by making it possible to generate billions of relatively short (~100-base) sequence reads at very low cost. Whereas such data can be readily used for a wide range of biomedical applications, it has proven difficult to use them to generate high-quality de novo genome assemblies of large, repeat-rich vertebrate genomes. To date, the genome assemblies generated from such data have fallen far short of those obtained with the older (but much more expensive) capillary-based sequencing approach.
View Article and Find Full Text PDF