Connecting the dots between PubMed abstracts.

PLoS One

Department of Computer Science, Virginia Tech, Blacksburg, Virginia, United States of America.

Published: May 2012

Background: There are now a multitude of articles published in a diversity of journals providing information about genes, proteins, pathways, and diseases. Each article investigates subsets of a biological process, but to gain insight into the functioning of a system as a whole, we must integrate information from multiple publications. Particularly, unraveling relationships between extra-cellular inputs and downstream molecular response mechanisms requires integrating conclusions from diverse publications.

Methodology: We present an automated approach to biological knowledge discovery from PubMed abstracts, suitable for "connecting the dots" across the literature. We describe a storytelling algorithm that, given a start and end publication, typically with little or no overlap in content, identifies a chain of intermediate publications from one to the other, such that neighboring publications have significant content similarity. The quality of discovered stories is measured using local criteria such as the size of supporting neighborhoods for each link and the strength of individual links connecting publications, as well as global metrics of dispersion. To ensure that the story stays coherent as it meanders from one publication to another, we demonstrate the design of novel coherence and overlap filters for use as post-processing steps.

Conclusions: WE DEMONSTRATE THE APPLICATION OF OUR STORYTELLING ALGORITHM TO THREE CASE STUDIES: i) a many-one study exploring relationships between multiple cellular inputs and a molecule responsible for cell-fate decisions, ii) a many-many study exploring the relationships between multiple cytokines and multiple downstream transcription factors, and iii) a one-to-one study to showcase the ability to recover a cancer related association, viz. the Warburg effect, from past literature. The storytelling pipeline helps narrow down a scientist's focus from several hundreds of thousands of relevant documents to only around a hundred stories. We argue that our approach can serve as a valuable discovery aid for hypothesis generation and connection exploration in large unstructured biological knowledge bases.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3250456PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0029509PLOS

Publication Analysis

Top Keywords

pubmed abstracts
8
biological knowledge
8
storytelling algorithm
8
study exploring
8
exploring relationships
8
relationships multiple
8
connecting dots
4
dots pubmed
4
abstracts background
4
background multitude
4

Similar Publications

Recovery support services as part of the continuum of care for alcohol or drug use disorders.

Addiction

January 2025

Harvard Medical School and Center for Addiction Medicine, Recovery Research Institute, at Massachusetts General Hospital, Boston, MA, USA.

Background: The definition of 'recovery' has evolved beyond merely control of problem substance use to include other aspects of health and wellbeing (known as 'recovery capital') which are important to prevent relapse to problematic alcohol or other drug (AOD) use. Developing a Recovery Oriented System of Care (ROSC) requires consideration of interventions or services (Recovery Support Services, RSS) designed to build recovery capital which are often delivered alongside established treatment structures. Lived experience and its application to the process of engaging people, changing behaviour and relapse prevention is an essential part of these services.

View Article and Find Full Text PDF

Introduction/purpose: Teleultrasound connects expert point-of-care ultrasound (POCUS) users with remote community and rural sites. Evolving technologies including handheld devices, upgraded image quality, and the ability to transmit over low bandwidth connections increase POCUS education, accessibility, and clinical integration. Potential teleultrasound venues include low-resource settings, prehospital care, and austere environments (high altitudes, microgravity, conflict zones, etc.

View Article and Find Full Text PDF

Amenorrhea is a common symptom of a whole range of nosologies among women of reproductive age, which can accompany any endocrinopathy in the stage of decompensation. In all the diversity of various links in the pathogenesis of reproductive disorders, the problem of immunopathology remains a little aside, however, the significance of these disorders is underestimated. This publication provides an overview of immune system abnormalities in a women with amenorrhea.

View Article and Find Full Text PDF

Background & Objective: Currently, there are many implants in clinical use, making it hard to choose the right one for the patient. The success rate of an implant depends on its diameter, length, and direction of insertion in bone. In implant dentistry, Finite Element Analysis (FEA) simulates intraoral conditions in vitro and analyzes the effects of implant material, diameter, size, and other components related to oral structure on the implant and peri-implant tissues.

View Article and Find Full Text PDF

Introduction: Guillain-Barré syndrome (GBS) is an inflammatory disorder of the peripheral nervous system, causing acute flaccid paralysis. There have been occasional reports linking Hepatitis A virus (HAV) to GBS. Here we aimed to evaluate the current literature on the association between GBS and HAV, exploring potential mechanisms and clinical implications.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!