Resource description framework (RDF) and Property Graph databases are emerging technologies that are used for storing graph-structured data. We compare these technologies through a molecular biology use case: glycan substructure search. Glycans are branched tree-like molecules composed of building blocks linked together by chemical bonds. The molecular structure of a glycan can be encoded into a direct acyclic graph where each node represents a building block and each edge serves as a chemical linkage between two building blocks. In this context, Graph databases are possible software solutions for storing glycan structures and Graph query languages, such as SPARQL and Cypher, can be used to perform a substructure search. Glycan substructure searching is an important feature for querying structure and experimental glycan databases and retrieving biologically meaningful data. This applies for example to identifying a region of the glycan recognised by a glycan binding protein (GBP). In this study, 19,404 glycan structures were selected from GlycomeDB (www.glycome-db.org) and modelled for being stored into a RDF triple store and a Property Graph. We then performed two different sets of searches and compared the query response times and the results from both technologies to assess performance and accuracy. The two implementations produced the same results, but interestingly we noted a difference in the query response times. Qualitative measures such as portability were also used to define further criteria for choosing the technology adapted to solving glycan substructure search and other comparable issues.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4684231 | PMC |
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0144578 | PLOS |
Environ Sci Technol
January 2025
Department of Civil and Environmental Engineering, Case Western Reserve University, Cleveland, Ohio 44106, United States.
Polymers are widely produced and contribute significantly to environmental pollution due to their low recycling rates and persistence in natural environments. Biodegradable polymers, while promising for reducing environmental impact, account for less than 2% of total polymer production. To expand the availability of biodegradable polymers, research has explored structure-biodegradability relationships, yet most studies focus on specific polymers, necessitating further exploration across diverse polymers.
View Article and Find Full Text PDFAngew Chem Int Ed Engl
December 2024
Department of Biomolecular Systems, Max Planck Institute of Colloids and Interfaces, Am Mühlenberg 1, 14476, Potsdam, Germany.
Klebsiella pneumoniae (KP) is a common opportunistic pathogen that emerged as a new critical threat to human health, due to its hypervirulence and widespread resistance against many antibiotics, including carbapenems. Alternative intervention strategies such as vaccines are not available. Cell-surface lipopolysaccharides (LPS) and capsular polysaccharides (CPS) are attractive targets for vaccine development.
View Article and Find Full Text PDFInt J Biol Macromol
December 2024
State Key Laboratory of Phytochemistry and Plant Resources in West China, Yunnan Key Laboratory of Natural Medicinal Chemistry, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan 650201, China. Electronic address:
Nat Commun
November 2024
Institute of Biochemistry, Department of Chemistry, University of Natural Resources and Life Sciences (BOKU), Muthgasse 18, Vienna, Austria.
N-glycosylation is one of the most common protein modifications in eukaryotes, with immense importance at the molecular, cellular, and organismal level. Accurate and reliable N-glycan analysis is essential to obtain a systems-wide understanding of fundamental biological processes. Due to the structural complexity of glycans, their analysis is still highly challenging.
View Article and Find Full Text PDFInt J Biol Macromol
December 2024
Graduate School of Life Science and Faculty of Advanced Life Science, Frontier Research Center for Advanced Material and Life Science, Hokkaido University, N21, W11, Sapporo 001-0021, Japan. Electronic address:
Escherichia coli O111 is a critical pathogenic E. coli serotype that causes severe, potentially fatal complications. Despite its reported variation, only one structure of the O-antigen polysaccharide from E.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!