Background: Biomedical knowledge graphs have become important tools to computationally analyse the comprehensive body of biomedical knowledge. They represent knowledge as subject-predicate-object triples, in which the predicate indicates the relationship between subject and object. A triple can also contain provenance information, which consists of references to the sources of the triple (e.g. scientific publications or database entries). Knowledge graphs have been used to classify drug-disease pairs for drug efficacy screening, but existing computational methods have often ignored predicate and provenance information. Using this information, we aimed to develop a supervised machine learning classifier and determine the added value of predicate and provenance information for drug efficacy screening. To ensure the biological plausibility of our method we performed our research on the protein level, where drugs are represented by their drug target proteins, and diseases by their disease proteins.

Results: Using random forests with repeated 10-fold cross-validation, our method achieved an area under the ROC curve (AUC) of 78.1% and 74.3% for two reference sets. We benchmarked against a state-of-the-art knowledge-graph technique that does not use predicate and provenance information, obtaining AUCs of 65.6% and 64.6%, respectively. Classifiers that only used predicate information performed superior to classifiers that only used provenance information, but using both performed best.

Conclusion: We conclude that both predicate and provenance information provide added value for drug efficacy screening.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6127943PMC
http://dx.doi.org/10.1186/s13326-018-0189-6DOI Listing

Publication Analysis

Top Keywords

predicate provenance
20
drug efficacy
16
efficacy screening
16
biomedical knowledge
8
knowledge graphs
8
predicate
7
provenance
6
knowledge
5
drug
5
provenance knowledge
4

Similar Publications

Introduction: Computable biomedical knowledge artifacts (CBKs) are digital objects conveying biomedical knowledge in machine-interpretable structures. As more CBKs are produced and their complexity increases, the value obtained from sharing CBKs grows. Mobilizing CBKs and sharing them widely can only be achieved if the CBKs are findable, accessible, interoperable, reusable, and trustable (FAIR+T).

View Article and Find Full Text PDF

Knowledge graph (KG) publishes machine-readable representation of knowledge on the Web. Structured data in the knowledge graph is published using Resource Description Framework (RDF) where knowledge is represented as a triple (subject, predicate, object). Due to the presence of erroneous, outdated or conflicting data in the knowledge graph, the quality of facts cannot be guaranteed.

View Article and Find Full Text PDF

A Critical Appraisal of Global Testing Protocols for Zoonotic Parasites in Imported Seafood Applied to Seafood Safety in Australia.

Foods

April 2020

School of Animal and Veterinary Sciences & Graham Centre for Agricultural Innovation, Charles Sturt University, Wagga Wagga, NSW 2650, Australia.

It is not suggested that any country is intentionally exporting seafood which does not comply with Codex seafood-safety guidelines/codes/standards. However, with an open access resource such as fisheries, there is vast potential for errors to occur along convoluted supply chains, spanning multiple countries, which may negatively impact the safety of edible seafood products imported into Australia. Australian importation policy and inspection procedures are founded upon a bedrock of trust in the integrity, reliability and safety of the global seafood supply chain.

View Article and Find Full Text PDF

Neuronal diversity in the somatosensory system: bridging the gap between cell type and function.

Curr Opin Neurobiol

June 2019

Molecular Neurobiology Laboratory, The Salk Institute for Biological Studies, 10010 North Torrey Pines Rd, La Jolla, CA 92037, USA. Electronic address:

A recent flurry of genetic studies in mice have provided key insights into how the somatosensory system is organized at a cellular level to encode itch, pain, temperature, and touch. These studies are largely predicated on the idea that functional cell types can be identified by their unique developmental provenance and gene expression profile. However, the extent to which gene expression profiles can be correlated with functional cell types and circuit organization remains an open question.

View Article and Find Full Text PDF

ProvCaRe: Characterizing scientific reproducibility of biomedical research studies using semantic provenance metadata.

Int J Med Inform

January 2019

Department of Medicine, Brigham and Women's Hospital and Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA.

Objective: Reproducibility of research studies is key to advancing biomedical science by building on sound results and reducing inconsistencies between published results and study data. We propose that the available data from research studies combined with provenance metadata provide a framework for evaluating scientific reproducibility. We developed the ProvCaRe platform to model, extract, and query semantic provenance information from 435, 248 published articles.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!