AI Article Synopsis

  • Scientists are using new computer methods called text-mining to analyze lots of scientific articles quickly, which helps them build networks of information that are too complex to understand by just reading.
  • They focused on a type of bacteria called PCC 6803, which hasn't been studied as much, to show how this technique can help find connections between genes that weren't known before.
  • By combining their findings with previous research and using special rules to search for new gene connections, they created a helpful tool that anyone can access to learn more about gene interactions.

Article Abstract

The increasing move towards open access full-text scientific literature enhances our ability to utilize advanced text-mining methods to construct information-rich networks that no human will be able to grasp simply from 'reading the literature'. The utility of text-mining for well-studied species is obvious though the utility for less studied species, or those with no prior track-record at all, is not clear. Here we present a concept for how advanced text-mining can be used to create information-rich networks even for less well studied species and apply it to generate an open-access gene-gene association network resource for PCC 6803, a representative model organism for cyanobacteria and first case-study for the methodology. By merging the text-mining network with networks generated from species-specific experimental data, network integration was used to enhance the accuracy of predicting novel interactions that are biologically relevant. A rule-based algorithm (filter) was constructed in order to automate the search for novel candidate genes with a high degree of likely association to known target genes by (1) ignoring established relationships from the existing literature, as they are already 'known', and (2) demanding multiple independent evidences for every novel and potentially relevant relationship. Using selected case studies, we demonstrate the utility of the network resource and filter to () discover novel candidate associations between different genes or proteins in the network, and () rapidly evaluate the potential role of any one particular gene or protein. The full network is provided as an open-source resource.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5970561PMC
http://dx.doi.org/10.7717/peerj.4806DOI Listing

Publication Analysis

Top Keywords

gene-gene association
8
association network
8
pcc 6803
8
advanced text-mining
8
information-rich networks
8
studied species
8
network resource
8
novel candidate
8
network
7
text-mining
5

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!