Systematic processing of ribosomal RNA gene amplicon sequencing data.

Gigascience

Centre INRS-Institut Armand-Frappier, Institut national de la recherche scientifique, 531ad Boul. des Prairies, Laval, QC H7V-1B7, Canada.

Published: December 2019

Background: With the advent of high-throughput sequencing, microbiology is becoming increasingly data-intensive. Because of its low cost, robust databases, and established bioinformatic workflows, sequencing of 16S/18S/ITS ribosomal RNA (rRNA) gene amplicons, which provides a marker of choice for phylogenetic studies, has become ubiquitous. Many established end-to-end bioinformatic pipelines are available to perform short amplicon sequence data analysis. These pipelines suit a general audience, but few options exist for more specialized users who are experienced in code scripting, Linux-based systems, and high-performance computing (HPC) environments. For such an audience, existing pipelines can be limiting to fully leverage modern HPC capabilities and perform tweaking and optimization operations. Moreover, a wealth of stand-alone software packages that perform specific targeted bioinformatic tasks are increasingly accessible, and finding a way to easily integrate these applications in a pipeline is critical to the evolution of bioinformatic methodologies.

Results: Here we describe AmpliconTagger, a short rRNA marker gene amplicon pipeline coded in a Python framework that enables fine tuning and integration of virtually any potential rRNA gene amplicon bioinformatic procedure. It is designed to work within an HPC environment, supporting a complex network of job dependencies with a smart-restart mechanism in case of job failure or parameter modifications. As proof of concept, we present end results obtained with AmpliconTagger using 16S, 18S, ITS rRNA short gene amplicons and Pacific Biosciences long-read amplicon data types as input.

Conclusions: Using a selection of published algorithms for generating operational taxonomic units and amplicon sequence variants and for computing downstream taxonomic summaries and diversity metrics, we demonstrate the performance and versatility of our pipeline for systematic analyses of amplicon sequence data.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6901069PMC
http://dx.doi.org/10.1093/gigascience/giz146DOI Listing

Publication Analysis

Top Keywords

gene amplicon
12
amplicon sequence
12
ribosomal rna
8
rrna gene
8
gene amplicons
8
sequence data
8
amplicon
7
gene
5
bioinformatic
5
systematic processing
4

Similar Publications

Unlabelled: Marine protists form complex communities that are shaped by environmental and biological ecosystem properties, as well as ecological interactions between organisms. While all of these factors play a role in shaping protistan communities, the specific ways in which these properties and interactions influence protistan communities remain poorly understood. Fourteen years and 9 months of eukaryotic amplicon (18S-V4 rRNA gene) data collected monthly at the San Pedro Ocean Time-series (SPOT) station were used to evaluate the impacts that environmental and biological factors, and protist-protist interactions had on protistan community composition.

View Article and Find Full Text PDF

Plants are colonized by a vast array of microorganisms that outstrip plant cell densities and genes, thus referred to as plant's second genome or extended genome. The microbial communities exert a significant influence on the vigor, growth, development and productivity of plants by supporting nutrient acquisition, organic matter decomposition and tolerance against biotic and abiotic stresses such as heat, high salt, drought and disease, by regulating plant defense responses. The rhizosphere is a complex micro-ecological zone in the direct vicinity of plant roots and is considered a hotspot of microbial diversity.

View Article and Find Full Text PDF

First report of Anaplasma marginale and Anaplasma ovis in goats in Kelantan, Malaysia.

Trop Biomed

December 2024

Departments of Veterinary Parasitology and Entomology, University of Maiduguri, P.M.B. 1069, Maiduguri 600230, Nigeria.

Anaplasma species are obligate rickettsial intraerythrocytic pathogens that cause an important tick-borne disease of economic importance in livestock production in many countries. Anaplasma species have been detected from farm animals worldwide, there is a paucity of information on Anaplasma infections in goats from Malaysia. Thus, this study aimed to assess the infection rate and identify Anaplasma species and some selected risk factors in goats across selected districts in Kelantan, Malaysia.

View Article and Find Full Text PDF

Background: This cross-sectional study aimed to compare the composition of the submucosal microbiome of peri-implantitis with paired and unpaired healthy implant samples.

Methods: We evaluated submucosal plaque samples obtained in 39 cases, including 13 cases of peri-implantitis, 13 cases involving healthy implants from the same patient (paired samples), and 13 cases involving healthy implants from different individuals (unpaired samples). The patients were evaluated using next-generation genomic sequencing (Illumina) based on 16S rRNA gene amplification.

View Article and Find Full Text PDF

Background: Acquiring representative bacterial 16S rRNA gene community profiles in plant microbiome studies can be challenging due to the excessive co-amplification of host chloroplast and mitochondrial rRNA gene sequences that reduce counts of plant-associated bacterial sequences. Peptide Nucleic Acid (PNA) clamps prevent this by blocking PCR primer binding or binding within the amplified region of non-target DNA to stop the function of DNA polymerase. Here, we applied a universal chloroplast (p)PNA clamp and a newly designed mitochondria (m)PNA clamp to minimise host chloroplast and mitochondria amplification in 16S rRNA gene amplicon profiles of leaf, bark and root tissue of two oak species (Quercus robur and Q.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!