Resource description framework (RDF) and Property Graph databases are emerging technologies that are used for storing graph-structured data. We compare these technologies through a molecular biology use case: glycan substructure search. Glycans are branched tree-like molecules composed of building blocks linked together by chemical bonds. The molecular structure of a glycan can be encoded into a direct acyclic graph where each node represents a building block and each edge serves as a chemical linkage between two building blocks. In this context, Graph databases are possible software solutions for storing glycan structures and Graph query languages, such as SPARQL and Cypher, can be used to perform a substructure search. Glycan substructure searching is an important feature for querying structure and experimental glycan databases and retrieving biologically meaningful data. This applies for example to identifying a region of the glycan recognised by a glycan binding protein (GBP). In this study, 19,404 glycan structures were selected from GlycomeDB (www.glycome-db.org) and modelled for being stored into a RDF triple store and a Property Graph. We then performed two different sets of searches and compared the query response times and the results from both technologies to assess performance and accuracy. The two implementations produced the same results, but interestingly we noted a difference in the query response times. Qualitative measures such as portability were also used to define further criteria for choosing the technology adapted to solving glycan substructure search and other comparable issues.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4684231PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0144578PLOS

Publication Analysis

Top Keywords

glycan substructure
16
substructure search
16
property graph
12
glycan
10
rdf triple
8
triple store
8
graph databases
8
building blocks
8
glycan structures
8
query response
8

Similar Publications

Polymers are widely produced and contribute significantly to environmental pollution due to their low recycling rates and persistence in natural environments. Biodegradable polymers, while promising for reducing environmental impact, account for less than 2% of total polymer production. To expand the availability of biodegradable polymers, research has explored structure-biodegradability relationships, yet most studies focus on specific polymers, necessitating further exploration across diverse polymers.

View Article and Find Full Text PDF

Klebsiella pneumoniae (KP) is a common opportunistic pathogen that emerged as a new critical threat to human health, due to its hypervirulence and widespread resistance against many antibiotics, including carbapenems. Alternative intervention strategies such as vaccines are not available. Cell-surface lipopolysaccharides (LPS) and capsular polysaccharides (CPS) are attractive targets for vaccine development.

View Article and Find Full Text PDF

Structure-activity relationship of synthesized glucans from Ganoderma lucidum with in vitro hypoglycemic activity.

Int J Biol Macromol

December 2024

State Key Laboratory of Phytochemistry and Plant Resources in West China, Yunnan Key Laboratory of Natural Medicinal Chemistry, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan 650201, China. Electronic address:

Article Synopsis
  • Synthetic polysaccharides can be utilized to design new drugs by examining their structure-activity relationships (SAR), particularly in terms of protein stability.
  • The study focuses on the glucan GLSWA-1 and its substructures, revealing that compound 2 enhances insulin secretion in a dose-dependent manner while maintaining insulin's thermal stability.
  • Molecular dynamics simulations indicate that compound 2 forms a "groove-binding model" with insulin, suggesting its potential as a hypoglycemic agent due to its favorable structural characteristics.
View Article and Find Full Text PDF

N-glycosylation is one of the most common protein modifications in eukaryotes, with immense importance at the molecular, cellular, and organismal level. Accurate and reliable N-glycan analysis is essential to obtain a systems-wide understanding of fundamental biological processes. Due to the structural complexity of glycans, their analysis is still highly challenging.

View Article and Find Full Text PDF

Integration of MALDI glycotyping and NMR analysis to uncover an O-antigen substructure from pathogenic Escherichia coli O111.

Int J Biol Macromol

December 2024

Graduate School of Life Science and Faculty of Advanced Life Science, Frontier Research Center for Advanced Material and Life Science, Hokkaido University, N21, W11, Sapporo 001-0021, Japan. Electronic address:

Escherichia coli O111 is a critical pathogenic E. coli serotype that causes severe, potentially fatal complications. Despite its reported variation, only one structure of the O-antigen polysaccharide from E.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!