Discovering implicit entity relation with the gene-citation-gene network.

PLoS One

Department of Information and Library Science, Indiana University, Bloomington, Indiana, United States of America.

Published: September 2014

AI Article Synopsis

  • The paper explores a Gene-Citation-Gene (GCG) network which identifies relationships between gene pairs based on citation connections in academic articles.
  • Using a dataset of 331,411 MEDLINE abstracts, the researchers found that while the GCG network has more gene pairs, it has a lower matching rate of known gene interactions compared to the traditional gene-gene (GG) network.
  • Despite this lower match rate, analyzing top-ranked genes from both networks shows that 35.53% match, and the study highlights cancer as a common disease related to the genes in both networks.

Article Abstract

In this paper, we apply the entitymetrics model to our constructed Gene-Citation-Gene (GCG) network. Based on the premise there is a hidden, but plausible, relationship between an entity in one article and an entity in its citing article, we constructed a GCG network of gene pairs implicitly connected through citation. We compare the performance of this GCG network to a gene-gene (GG) network constructed over the same corpus but which uses gene pairs explicitly connected through traditional co-occurrence. Using 331,411 MEDLINE abstracts collected from 18,323 seed articles and their references, we identify 25 gene pairs. A comparison of these pairs with interactions found in BioGRID reveal that 96% of the gene pairs in the GCG network have known interactions. We measure network performance using degree, weighted degree, closeness, betweenness centrality and PageRank. Combining all measures, we find the GCG network has more gene pairs, but a lower matching rate than the GG network. However, combining top ranked genes in both networks produces a matching rate of 35.53%. By visualizing both the GG and GCG networks, we find that cancer is the most dominant disease associated with the genes in both networks. Overall, the study indicates that the GCG network can be useful for detecting gene interaction in an implicit manner.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3866152PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0084639PLOS

Publication Analysis

Top Keywords

gcg network
24
gene pairs
20
network
10
network gene
8
matching rate
8
genes networks
8
gcg
7
gene
6
pairs
6
discovering implicit
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!