The largest dataset of soil metagenomes has recently been released by the National Ecological Observatory Network (NEON), which performs annual shotgun sequencing of soils at 47 sites across the United States. NEON serves as a valuable educational resource, thanks to its open data and programming tutorials, but there is currently no introductory tutorial for accessing and analyzing the soil shotgun metagenomic dataset. Here, we describe methods for processing raw soil metagenome sequencing reads using a bioinformatics pipeline tailored to the high complexity and diversity of the soil microbiome. We describe the rationale, necessary resources, and implementation of steps such as cleaning raw reads, taxonomic classification, assembly into contigs or genomes, annotation of predicted genes using custom protein databases, and exporting data for downstream analysis. The workflow presented here aims to increase the accessibility of NEON's shotgun metagenome data, which can provide important clues about soil microbial communities and their ecological roles.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9178279PMC
http://dx.doi.org/10.12688/f1000research.51494.2DOI Listing

Publication Analysis

Top Keywords

national ecological
8
ecological observatory
8
soil metagenomes
8
soil
6
observatory network's
4
network's soil
4
metagenomes assembly
4
assembly basic
4
basic analysis
4
analysis largest
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!