Protein Sequence Annotation Tool (PSAT): a centralized web-based meta-server for high-throughput sequence annotations.

BMC Bioinformatics

Computing Applications and Research, Global Security Computing Applications Division, Lawrence Livermore National Security, Livermore, CA, 94550, USA.

Published: January 2016

AI Article Synopsis

  • PSAT is a web-based tool designed to facilitate high-throughput, genome-wide sequence annotations by integrating multiple bioinformatics tools.
  • The tool has been successfully applied to analyze the predicted peptide gene products of Herbaspirillum sp. strain RV1423, enabling the identification of unique metabolic pathways compared to closely related species.
  • PSAT stands out for its ability to perform rapid enzyme predictions and sequence annotations on large protein data sets, making it particularly useful for annotating complex or poorly annotated genomes.

Article Abstract

Background: Here we introduce the Protein Sequence Annotation Tool (PSAT), a web-based, sequence annotation meta-server for performing integrated, high-throughput, genome-wide sequence analyses. Our goals in building PSAT were to (1) create an extensible platform for integration of multiple sequence-based bioinformatics tools, (2) enable functional annotations and enzyme predictions over large input protein fasta data sets, and (3) provide a web interface for convenient execution of the tools.

Results: In this paper, we demonstrate the utility of PSAT by annotating the predicted peptide gene products of Herbaspirillum sp. strain RV1423, importing the results of PSAT into EC2KEGG, and using the resulting functional comparisons to identify a putative catabolic pathway, thereby distinguishing RV1423 from a well annotated Herbaspirillum species. This analysis demonstrates that high-throughput enzyme predictions, provided by PSAT processing, can be used to identify metabolic potential in an otherwise poorly annotated genome.

Conclusions: PSAT is a meta server that combines the results from several sequence-based annotation and function prediction codes, and is available at http://psat.llnl.gov/psat/. PSAT stands apart from other sequence-based genome annotation systems in providing a high-throughput platform for rapid de novo enzyme predictions and sequence annotations over large input protein sequence data sets in FASTA. PSAT is most appropriately applied in annotation of large protein FASTA sets that may or may not be associated with a single genome.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4721133PMC
http://dx.doi.org/10.1186/s12859-016-0887-yDOI Listing

Publication Analysis

Top Keywords

protein sequence
12
sequence annotation
12
enzyme predictions
12
psat
9
annotation tool
8
tool psat
8
sequence annotations
8
large input
8
input protein
8
protein fasta
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!