GAPP: A Proteogenomic Software for Genome Annotation and Global Profiling of Post-translational Modifications in Prokaryotes.

Mol Cell Proteomics

From the ‡Key Laboratory of Algal Biology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China;

Published: November 2016

Although the number of sequenced prokaryotic genomes is growing rapidly, experimentally verified annotation of prokaryotic genome remains patchy and challenging. To facilitate genome annotation efforts for prokaryotes, we developed an open source software called GAPP for genome annotation and global profiling of post-translational modifications (PTMs) in prokaryotes. With a single command, it provides a standard workflow to validate and refine predicted genetic models and discover diverse PTM events. We demonstrated the utility of GAPP using proteomic data from Helicobacter pylori, one of the major human pathogens that is responsible for many gastric diseases. Our results confirmed 84.9% of the existing predicted H. pylori proteins, identified 20 novel protein coding genes, and corrected four existing gene models with regard to translation initiation sites. In particular, GAPP revealed a large repertoire of PTMs using the same proteomic data and provided a rich resource that can be used to examine the functions of reversible modifications in this human pathogen. This software is a powerful tool for genome annotation and global discovery of PTMs and is applicable to any sequenced prokaryotic organism; we expect that it will become an integral part of ongoing genome annotation efforts for prokaryotes. GAPP is freely available at https://sourceforge.net/projects/gappproteogenomic/.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5098048PMC
http://dx.doi.org/10.1074/mcp.M116.060046DOI Listing

Publication Analysis

Top Keywords

genome annotation
20
annotation global
12
global profiling
8
profiling post-translational
8
post-translational modifications
8
sequenced prokaryotic
8
annotation efforts
8
efforts prokaryotes
8
proteomic data
8
genome
6

Similar Publications

Krait2: a versatile software for microsatellite investigation, visualization and marker development.

BMC Genomics

January 2025

Key Laboratory of Qinghai-Tibetan Plateau Animal Genetic Resource Reservation and Utilization, Sichuan Province and Ministry of Education, Southwest Minzu University, Chengdu, 610225, China.

Background: Microsatellites are highly polymorphic repeat sequences ubiquitously interspersed throughout almost all genomes which are widely used as powerful molecular markers in diverse fields. Microsatellite expansions play pivotal roles in gene expression regulation and are implicated in various neurological diseases and cancers. Although much effort has been devoted to developing efficient tools for microsatellite identification, there is still a lack of a powerful tool for large-scale microsatellite analysis.

View Article and Find Full Text PDF

A cross-tissue transcriptome-wide association study identifies new susceptibility genes for benign prostatic hyperplasia.

Sci Rep

January 2025

Department of Urology, The Second Hospital & Clinical Medical School, Lanzhou University, Lanzhou, 730030, People's Republic of China.

Benign prostatic hyperplasia (BPH) is a prevalent urinary system disorder. Despite evidence of a significant genetic component from previous studies, the specific pathogenic genes and biological mechanisms are still largely unknown. The study utilized the FinnGen R10 dataset, encompassing 177,901 individuals (36,601 cases and 141,300 controls), and the GTEx v8 EQTLs files to conduct single-tissue and cross-tissue transcriptome-wide association studies (TWAS).

View Article and Find Full Text PDF

More than 50% of families with suspected rare monogenic diseases remain unsolved after whole-genome analysis by short-read sequencing (SRS). Long-read sequencing (LRS) could help bridge this diagnostic gap by capturing variants inaccessible to SRS, facilitating long-range mapping and phasing and providing haplotype-resolved methylation profiling. To evaluate LRS's additional diagnostic yield, we sequenced a rare-disease cohort of 98 samples from 41 families, using nanopore sequencing, achieving per sample ∼36× average coverage and 32-kb read N50 from a single flow cell.

View Article and Find Full Text PDF

Tracing human trait evolution through integrative genomics and temporal annotations.

Cell Genom

January 2025

Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia. Electronic address:

Understanding the evolution of human traits is a fundamental yet challenging question. In a recent Cell Genomics article, Kun et al. integrate large-scale genomic and phenotypic data, including deep-learning-derived imaging phenotypes, with temporal annotations to estimate the timing of evolutionary changes that led to differences in traits between modern humans and primates or hominin ancestors.

View Article and Find Full Text PDF

Assembly and Annotation of the Tetraploid Salsola tragus (Russian thistle) Genome.

Genome Biol Evol

January 2025

Department of Agricultural Biology, 1177 Campus Delivery, Colorado State University, Fort Collins, CO, 80523, USA.

This report presents two phased chromosome-scale genome assemblies of allotetraploid Salsola tragus (2n=4x=36) and fills the current genomics resource gap for this species. Flow cytometry estimated 1C genome size was 1.319 Gbp.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!