Enrichment of Triticum aestivum gene annotations using ortholog cliques and gene ontologies in other plants.

BMC Genomics

Information and Communications Technologies, National Research Council Canada, Ottawa, Ontario, K1A 0R6, Canada.

Published: April 2015

Background: While the gargantuan multi-nation effort of sequencing T. aestivum gets close to completion, the annotation process for the vast number of wheat genes and proteins is in its infancy. Previous experimental studies carried out on model plant organisms such as A. thaliana and O. sativa provide a plethora of gene annotations that can be used as potential starting points for wheat gene annotations, proven that solid cross-species gene-to-gene and protein-to-protein correspondences are provided.

Results: DNA and protein sequences and corresponding annotations for T. aestivum and 9 other plant species were collected from Ensembl Plants release 22 and curated. Cliques of predicted 1-to-1 orthologs were identified and an annotation enrichment model was defined based on existing gene-GO term associations and phylogenetic relationships among wheat and 9 other plant species. A total of 13 cliques of size 10 were identified, which represent putative functionally equivalent genes and proteins in the 10 plant species. Eighty-five new and more specific GO terms were associated with wheat genes in the 13 cliques of size 10, which represent a 65% increase compared with the previously 130 known GO terms. Similar expression patterns for 4 genes from Arabidopsis, barley, maize and rice in cliques of size 10 provide experimental evidence to support our model. Overall, based on clique size equal or larger than 3, our model enriched the existing gene-GO term associations for 7,838 (8%) wheat genes, of which 2,139 had no previous annotation.

Conclusions: Our novel comparative genomics approach enriches existing T. aestivum gene annotations based on cliques of predicted 1-to-1 orthologs, phylogenetic relationships and existing gene ontologies from 9 other plant species.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4426649PMC
http://dx.doi.org/10.1186/s12864-015-1496-2DOI Listing

Publication Analysis

Top Keywords

gene annotations
16
plant species
16
wheat genes
12
cliques size
12
aestivum gene
8
gene ontologies
8
genes proteins
8
cliques predicted
8
predicted 1-to-1
8
1-to-1 orthologs
8

Similar Publications

Prioritizing Context-Dependent Cancer Gene Signatures in Networks.

Cancers (Basel)

January 2025

Avantyx Pharmaceuticals, Miami, FL 33136, USA.

There are numerous ways of portraying cancer complexity based on combining multiple types of data. A common approach involves developing signatures from gene expression profiles to highlight a few key reproducible features that provide insight into cancer risk, progression, or recurrence. Normally, a selection of such features is made through relevance or significance, given a reference context.

View Article and Find Full Text PDF

Background: Genetic discontinuity represents abrupt breaks in genomic identity among species. Advances in genome sequencing have enhanced our ability to track and characterize genetic discontinuity in bacterial populations. However, exploring the degree to which bacterial diversity exists as a continuum or sorted into discrete and readily defined species remains a challenge in microbial ecology.

View Article and Find Full Text PDF

Background: Identification of global transcriptional events is crucial for genome annotation, as accurate annotation enhances the efficiency and comparability of genomic information across species. However, the annotation of transcripts in the cucumber genome remains to be improved, and many transcriptional events have not been well studied.

Results: We collected 1,904 high-quality public cucumber transcriptome samples from the National Center for Biotechnology Information (NCBI) to identify and annotate transcript isoforms in the cucumber genome.

View Article and Find Full Text PDF

Background: Phaius Lour. (Collabieae, Orchidaceae) is a small genus consisting of about 45 species, with highly ornamental and medicinal values. However, the phylogenetic relationship of Phaius among Calanthe s.

View Article and Find Full Text PDF

Comprehensive analysis of the multi-rings mitochondrial genome of Populus tomentosa.

BMC Genomics

January 2025

State Key Laboratory of Tree Genetics and Breeding, National Engineering Research Center of Tree Breeding and Ecological Restoration, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China.

Background: Populus tomentosa, known as Chinese white poplar, is indigenous and distributed across large areas of China, where it plays multiple important roles in forestry, agriculture, conservation, and urban horticulture. However, limited accessibility to the mitochondrial (mt) genome of P. tomentosa impedes phylogenetic and population genetic analyses and restricts functional gene research in Salicaceae family.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!