Testing three pipelines for 18S rDNA-based metabarcoding of soil faunal diversity.

Sci China Life Sci

Ecology, Conservation, and Environment Center, State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China.

Published: January 2013

A number of basic and applied questions in ecology and environmental management require the characterization of soil and leaf litter faunal diversity. Recent advances in high-throughput sequencing of barcode-gene amplicons ('metabarcoding') have made it possible to survey biodiversity in a robust and efficient way. However, one obstacle to the widespread adoption of this technique is the need to choose amongst many candidates for bioinformatic processing of the raw sequencing data. We compare three candidate pipelines for the processing of 18S small subunit rDNA metabarcode data from solid substrates: (i) USEARCH/CROP, (ii) Denoiser/UCLUST, and (iii) OCTUPUS. The three pipelines produced reassuringly similar and highly correlated assessments of community composition that are dominated by taxa known to characterize the sampled environments. However, OCTUPUS appears to inflate phylogenetic diversity, because of higher sequence noise. We therefore recommend either the USEARCH/CROP or Denoiser/UCLUST pipelines, both of which can be run within the QIIME (Quantitative Insights Into Microbial Ecology) environment.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s11427-012-4423-7DOI Listing

Publication Analysis

Top Keywords

three pipelines
8
faunal diversity
8
usearch/crop denoiser/uclust
8
testing three
4
pipelines
4
pipelines 18s
4
18s rdna-based
4
rdna-based metabarcoding
4
metabarcoding soil
4
soil faunal
4

Similar Publications

Insect-specific RNA viruses detection in Field-Caught Aedes aegypti mosquitoes from Argentina using NGS technology.

PLoS Negl Trop Dis

January 2025

Laboratorio de Ingeniería Genética y Biología Celular y Molecular-Área de virus de insectos, Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Quilmes, Buenos Aires, Argentina.

Mosquitoes are the primary vectors of arthropod-borne pathogens. Aedes aegypti is one of the most widespread mosquito species worldwide, responsible for transmitting diseases such as Dengue, Zika, and Chikungunya, among other medically significant viruses. Characterizing the array of viruses circulating in mosquitoes, particularly in Aedes aegypti, is a crucial tool for detecting and developing novel strategies to prevent arbovirus outbreaks.

View Article and Find Full Text PDF

Significance: Optimal meibography utilization and interpretation are hindered due to poor lid presentation, blurry images, or image artifacts and the challenges of applying clinical grading scales. These results, using the largest image dataset analyzed to date, demonstrate development of algorithms that provide standardized, real-time inference that addresses all of these limitations.

Purpose: This study aimed to develop and validate an algorithmic pipeline to automate and standardize meibomian gland absence assessment and interpretation.

View Article and Find Full Text PDF

Arbuscular mycorrhizal fungi (AMF, phylum Glomeromycota) are essential to plant community diversity and ecosystem functioning. However, increasing human land use represents a major threat to native AMF globally. Characterizing the loss of AMF diversity remains challenging because many taxa are undescribed, resulting in poor documentation of their biogeography and family-level disturbance sensitivity.

View Article and Find Full Text PDF

Data-driven pipeline modeling for predicting unknown protein adulteration in dairy products.

Food Chem

December 2024

Institute of Food Science and Technology, Chinese Academy of Agricultural Sciences (CAAS), Beijing 100193, PR China. Electronic address:

To preemptively predict unknown protein adulterants in food and curb the incidence of food fraud at its origin, data-driven models were developed using three machine learning (ML) algorithms. Among these, the random forest (RF)-based model achieved optimal performance, achieving accuracies of 96.2 %, 95.

View Article and Find Full Text PDF

Understanding the role of transcription and transcription factors (TFs) in cellular identity and disease, such as cancer, is essential. However, comprehensive data resources for cell line-specific TF-to-target gene annotations are currently limited. To address this, we employed a straightforward method to define regulons that capture the cell-specific aspects of TF binding and transcript expression levels.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!