Semantic similarity measurement between gene ontology terms based on exclusively inherited shared information.

Gene

School of Information Science and Technology, Sun Yat-sen University, Guangzhou, PR China. Electronic address:

Published: March 2015

Quantifying the semantic similarities between pairs of terms in the Gene Ontology (GO) structure can help to explore the functional relationships between biological entities. A common approach to this problem is to measure the information they have in common based on the information content of their common ancestors. However, many studies have their limitations in measuring the information two GO terms share. This study presented a new measurement, exclusively inherited shared information (EISI) that captured the information shared by two terms based on an intuitive observation on the multiple inheritance relationships among the terms in the GO graph. EISI was derived from the information content of the exclusively inherited common ancestors (EICAs), which were screened from the common ancestors according to the attribute of their direct children. The effectiveness of EISI was evaluated against some state-of-the-art measurements on both artificial and real datasets, it produced more relevant results with experts' scores on the artificial dataset, and supported the prior knowledge of gene function in pathways on the Saccharomyces genome database (SGD). The promising features of EISI are the following: (1) it provides a more effective way to characterize the semantic relationship between two GO terms by taking into account multiple common ancestors related, and (2) can quickly detect all EICAs with time complexity of O(n), which is much more efficient than other methods based on disjunctive common ancestors. It is a promising alternative to multiple inheritance based methods for practical applications on large-scale dataset. The algorithm EISI was implemented in Matlab and is freely available from http://treaton.evai.pl/EISI/.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.gene.2014.12.062DOI Listing

Publication Analysis

Top Keywords

common ancestors
20
exclusively inherited
12
gene ontology
8
terms based
8
inherited shared
8
multiple inheritance
8
common
7
terms
6
based
5
ancestors
5

Similar Publications

Chromosome-level genome assembly and annotation of largemouth bronze gudgeon (Coreius guichenoti).

Sci Data

January 2025

Key Laboratory of Freshwater Biodiversity Conservation, Ministry of Agriculture and Rural Affairs, Yangtze River Fisheries Research Institute, Chinese Academy of Fishery Sciences, Wuhan, 430223, China.

Coreius guichenoti, mainly distributed in upstream regions of the Yangtze River China, is currently on the brink of extinction and listed as national secondary protected animal. In this study, we aimed to obtain the chromosome-level genome of C. guichenoti using PacBio and Hi-C techniques.

View Article and Find Full Text PDF

It is under debate whether intersubjectivity-the capacity to experience a sense of togetherness around an action-is unique to humans. In humans, heavy tickling-a repeated body probing play that causes an automatic response including uncontrollable laughter (gargalesis)-has been linked to the emergence of intersubjectivity as it is aimed at making others laugh (self-generated responses are inhibited), it is often asymmetrical (older to younger subjects), and it elicits agent-dependent responses (pleasant/unpleasant depending on social bond). Intraspecific tickling and the related gargalesis response have been reported in humans, chimpanzees, and anecdotally in other great apes, potentially setting the line between hominids and other anthropoids.

View Article and Find Full Text PDF

Background: Zinc finger homeodomain (ZF-HD) belongs to the plant-specific transcription factor (TF) family and is widely involved in plant growth, development and stress responses. Despite their importance, a comprehensive identification and analysis of ZF-HD genes in the soybean (Glycine max) genome and their possible roles under abiotic stress remain unexplored.

Results: In this study, 51 ZF-HD genes were identified in the soybean genome that were unevenly distributed on 17 chromosomes.

View Article and Find Full Text PDF

The synaptonemal complex (SC) is a protein-rich structure essential for meiotic recombination and faithful chromosome segregation. Acting like a zipper to paired homologous chromosomes during early prophase I, the complex is a symmetrical structure where central elements are connected on two sides by the transverse filaments to the chromatin-anchoring lateral elements. Despite being found in most major eukaryotic taxa implying a deeply conserved evolutionary origin, several components of the complex exhibit unusually high rates of sequence turnover.

View Article and Find Full Text PDF

The subfamily Mileewinae in China comprises one tribe (Mileewini), four genera (, , , ), and 71 species, yet only 11 mitochondrial genomes have been published. This study aimed to elucidate ambiguous diagnostic traits in traditional taxonomy and examined phylogenetic relationships among genera by sequencing mitochondrial genomes from 16 species. The lengths of the mitochondrial genomes ranged from 14,532 to 15,280 bp, exhibiting an AT content of 77.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!