In search of genome annotation consistency: solid gene clusters and how to use them.

3 Biotech

Mathematics and Computer Science, Argonne National Laboratory, 9700 S. Cass Ave., Argonne, IL, 60439, USA.

Published: June 2014

Maintaining consistency in genome annotations is important for supporting many computational tasks, particularly metabolic modeling. The SEED project has implemented a process that improves annotation consistencies across microbial genomes for proteins with conserved sequences and genomic context. In this research report, we describe this process and show how this effort has resulted in improvements to microbial genome annotations in the SEED. We also compare SEED annotation consistencies with other commonly used resources such as IMG (the Joint Genome Institute's Integrated Microbial Genomes system), RefSeq (the National Center for Biotechnology Information's Reference Sequence Database), Swiss-Prot (the annotated protein sequence database of the Swiss Institute of Bioinformatics, European Molecular Biology Laboratory and the European Bioinformatics Institute) and TrEMBL (Translated European Molecular Biology Laboratory nucleotide sequence data Library). Our analysis indicates that manual and computational efforts are paying off for the databases where consistency is a major goal.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4026451PMC
http://dx.doi.org/10.1007/s13205-013-0152-2DOI Listing

Publication Analysis

Top Keywords

genome annotations
8
annotation consistencies
8
microbial genomes
8
sequence database
8
european molecular
8
molecular biology
8
biology laboratory
8
search genome
4
genome annotation
4
annotation consistency
4

Similar Publications

MAI-TargetFisher: A proteome-wide drug target prediction method synergetically enhanced by artificial intelligence and physical modeling.

Acta Pharmacol Sin

January 2025

Shanghai Institute for Advanced Immunochemical Studies and School of Life Science and Technology, ShanghaiTech University, Shanghai, 201210, China.

Computational target identification plays a pivotal role in the drug development process. With the significant advancements of deep learning methods for protein structure prediction, the structural coverage of human proteome has increased substantially. This progress inspired the development of the first genome-wide small molecule targets scanning method.

View Article and Find Full Text PDF

Alternative splicing impacts most multi-exonic human genes. Inaccuracies during this process may have an important role in ageing and disease. Here, we investigate splicing accuracy using RNA-sequencing data from >14k control samples and 40 human body sites, focusing on split reads partially mapping to known transcripts in annotation.

View Article and Find Full Text PDF

Benzo (a) pyrene produced by food during high-temperature process enters the body through ingestion, which causes food safety issues to the human body. In order to alleviate the harm of foodborne benzo (a) pyrene to human health, a strain that can degrade benzo (a) pyrene was screened from Kefir, a traditional fermented product in Xinjiang. Bacillus cereus M72-4 is a Gram-positive bacteria sourced from Xinjiang traditional fermented product Kefir, under Benzo(a)pyrene stress conditions, there was 69.

View Article and Find Full Text PDF

Background: Paeonia lactiflora Pall., a member of Paeoniaceae family, is a medicinal herb widely used in traditional Chinese medicine. Chloroplasts are multifunctional organelles containing distinct genetic material.

View Article and Find Full Text PDF

In 2020, we isolated a strain, Marseille-Q4381, from healthy skin. We describe herein its genome sequence and annotation characteristics.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!