Motivation: Many methods for microbial protein subcellular localization (SCL) prediction exist; however, none is readily available for analysis of metagenomic sequence data, despite growing interest from researchers studying microbial communities in humans, agri-food relevant organisms and in other environments (e.g. for identification of cell-surface biomarkers for rapid protein-based diagnostic tests). We wished to also identify new markers of water quality from freshwater samples collected from pristine versus pollution-impacted watersheds.

Results: We report PSORTm, the first bioinformatics tool designed for prediction of diverse bacterial and archaeal protein SCL from metagenomics data. PSORTm incorporates components of PSORTb, one of the most precise and widely used protein SCL predictors, with an automated classification by cell envelope. An evaluation using 5-fold cross-validation with in silico-fragmented sequences with known localization showed that PSORTm maintains PSORTb's high precision, while sensitivity increases proportionately with metagenomic sequence fragment length. PSORTm's read-based analysis was similar to PSORTb-based analysis of metagenome-assembled genomes (MAGs); however, the latter requires non-trivial manual classification of each MAG by cell envelope, and cannot make use of unassembled sequences. Analysis of the watershed samples revealed the importance of normalization and identified potential biomarkers of water quality. This method should be useful for examining a wide range of microbial communities, including human microbiomes, and other microbiomes of medical, environmental or industrial importance.

Availability And Implementation: Documentation, source code and docker containers are available for running PSORTm locally at https://www.psort.org/psortm/ (freely available, open-source software under GNU General Public License Version 3).

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7214030PMC
http://dx.doi.org/10.1093/bioinformatics/btaa136DOI Listing

Publication Analysis

Top Keywords

bacterial archaeal
8
archaeal protein
8
protein subcellular
8
subcellular localization
8
metagenomics data
8
metagenomic sequence
8
microbial communities
8
water quality
8
protein scl
8
cell envelope
8

Similar Publications

Bacteria and archaea acquire resistance to genetic parasites by preferentially integrating short fragments of foreign DNA at one end of a Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR). "Leader" DNA upstream of CRISPR loci regulates transcription and foreign DNA integration into the CRISPR. Here, we analyze 37,477 CRISPRs from 39,277 bacterial and 556 archaeal genomes to identify conserved sequence motifs in CRISPR leaders.

View Article and Find Full Text PDF

Bacteria, fungi, archaea, and viruses are reflective organisms that indicate soil health. Investigating the impact of crude oil pollution on the community structure and interactions among bacteria, fungi, archaea, and viruses in Calamagrostis epigejos soil can provide theoretical support for remediating crude oil pollution in Calamagrostis epigejos ecosystems. In this study, Calamagrostis epigejos was selected as the research subject and subjected to different levels of crude oil addition (0 kg/hm, 10 kg/hm, 40 kg/hm).

View Article and Find Full Text PDF

Diversity-generating retroelements (DGRs) create massive protein sequence variation (up to 10) in ecologically diverse microorganisms. A recent survey identified around 31,000 DGRs from more than 1,500 bacterial and archaeal genera, constituting more than 90 environment types. DGRs are especially enriched in the human gut microbiome and nano-sized microorganisms that seem to comprise most microbial life and maintain DGRs despite reduced genomes.

View Article and Find Full Text PDF

Microbiological datasets and associated environmental parameters from the French soil quality monitoring network (RMQS) offer an opportunity for long-term and large-scale soil quality monitoring. Soils supply important ecosystem services e.g.

View Article and Find Full Text PDF

Oral antibiotic treatment is well known to be one of the main factors affecting gut microbiota composition by altering bacterial diversity. It decreases the abundance of butyrate-producing bacteria such as Lachnospiraceae and Ruminococcaceae, while increasing abundance of Enterobacteriaceae. The recovery time of commensal bacteria post-antibiotic treatment varies among individuals, and often, complete recovery is not achieved.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!