SPANG: a SPARQL client supporting generation and reuse of queries for distributed RDF databases.

BMC Bioinformatics

National Institute for Basic Biology, National Institutes of Natural Sciences, Nishigonaka 38, Myodaiji, Okazaki, Aichi, 444-8585, Japan.

Published: February 2017

Background: Toward improved interoperability of distributed biological databases, an increasing number of datasets have been published in the standardized Resource Description Framework (RDF). Although the powerful SPARQL Protocol and RDF Query Language (SPARQL) provides a basis for exploiting RDF databases, writing SPARQL code is burdensome for users including bioinformaticians. Thus, an easy-to-use interface is necessary.

Results: We developed SPANG, a SPARQL client that has unique features for querying RDF datasets. SPANG dynamically generates typical SPARQL queries according to specified arguments. It can also call SPARQL template libraries constructed in a local system or published on the Web. Further, it enables combinatorial execution of multiple queries, each with a distinct target database. These features facilitate easy and effective access to RDF datasets and integrative analysis of distributed data.

Conclusions: SPANG helps users to exploit RDF datasets by generation and reuse of SPARQL queries through a simple interface. This client will enhance integrative exploitation of biological RDF datasets distributed across the Web. This software package is freely available at http://purl.org/net/spang .

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5299738PMC
http://dx.doi.org/10.1186/s12859-017-1531-1DOI Listing

Publication Analysis

Top Keywords

rdf datasets
16
spang sparql
8
sparql client
8
generation reuse
8
rdf
8
rdf databases
8
sparql queries
8
sparql
7
datasets
5
spang
4

Similar Publications

Faced with heterogeneity of healthcare data, we propose a novel approach for harmonizing data elements (i.e., attributes) across health data standards.

View Article and Find Full Text PDF

The expansion of multi-omics datasets raises significant challenges for data integration and querying. To overcome these challenges, we developed a generic RDF-based integration schema that connects various types of differential -omics data, epigenomics, and regulatory information. This schema employs the FALDO ontology to enable querying based on genomic locations.

View Article and Find Full Text PDF
Article Synopsis
  • Machine learning techniques, specifically principal component analysis (PCA) and diffusion maps (DM), are used to analyze complex datasets from molecular dynamics simulations of polylactide (PLA) and poly(3-hydroxybutyrate) (PHB) to evaluate glass transition temperature (Tg).
  • Four molecular descriptors—radial distribution functions (RDFs), mean square displacements (MSDs), relative square displacements (RSDs), and dihedral angles (DAs)—are employed to facilitate this evaluation.
  • The use of Gaussian Mixture Models (GMMs) to analyze the data reveals a clear distinction between melt and glass states and shows that Tg values derived from DM and certain descriptors align well with
View Article and Find Full Text PDF

RDF graph pair profile dataset for the data linking community.

Data Brief

December 2024

MISTEA, University of Montpellier, INRAE & Institut Agro, France, 2, place Pierre Viala, 34060 Montpellier Cedex 2, France.

As the number of RDF datasets published on the Web grows, it becomes increasingly important to link similar entities across these datasets. We present the "RDF graph pair profiles dataset", designed to help the data linking community develop tools and carry out evaluation work. This dataset includes profiles of 30 RDF graph pairs, classified according to ontology matching (OM), instance matching (IM) or both (OM + IM).

View Article and Find Full Text PDF
Article Synopsis
  • A highly valuable medicinal plant in China is suffering from replant disease, which is caused by factors like allelochemicals and microbes that create complex stress conditions.
  • This study established different treatments to investigate the plant's response to replant disease using sRNA-seq technology, leading to the identification of 201 miRNAs, with notable differences in expression profiles between continuously planted and normally planted samples.
  • The research found that certain miRNAs are downregulated in replant disease, hindering plant growth, while others that are upregulated can negatively affect root formation and nutrient metabolism, contributing to oxidative stress in the plant.
View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!