ChemProps: A RESTful API enabled database for composite polymer name standardization.

J Cheminform

Department of Mechanical Engineering and Materials Science, Duke University, Durham, NC, 27708, USA.

Published: March 2021

AI Article Synopsis

Article Abstract

The inconsistency of polymer indexing caused by the lack of uniformity in expression of polymer names is a major challenge for widespread use of polymer related data resources and limits broad application of materials informatics for innovation in broad classes of polymer science and polymeric based materials. The current solution of using a variety of different chemical identifiers has proven insufficient to address the challenge and is not intuitive for researchers. This work proposes a multi-algorithm-based mapping methodology entitled ChemProps that is optimized to solve the polymer indexing issue with easy-to-update design both in depth and in width. RESTful API is enabled for lightweight data exchange and easy integration across data systems. A weight factor is assigned to each algorithm to generate scores for candidate chemical names and optimized to maximize the minimum value of the score difference between the ground truth chemical name and the other candidate chemical names. Ten-fold validation is utilized on the 160 training data points to prevent overfitting issues. The obtained set of weight factors achieves a 100% test accuracy on the 54 test data points. The weight factors will evolve as ChemProps grows. With ChemProps, other polymer databases can remove duplicate entries and enable a more accurate "search by SMILES" function by using ChemProps as a common name-to-SMILES translator through API calls. ChemProps is also an excellent tool for auto-populating polymer properties thanks to its easy-to-update design.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7955638PMC
http://dx.doi.org/10.1186/s13321-021-00502-6DOI Listing

Publication Analysis

Top Keywords

restful api
8
api enabled
8
polymer
8
polymer indexing
8
easy-to-update design
8
candidate chemical
8
chemical names
8
data points
8
weight factors
8
chemprops
6

Similar Publications

Background: Stability during early postnatal life in preterm infants is related to better outcomes. Although vital signs are monitored continuously in Neonatal Intensive Care Unites, this monitoring does not include all physiological parameters nor data such as movement patterns. Although there are scattered sources of data, there is no centralized data hub for neonates information.

View Article and Find Full Text PDF

EnteroBase in 2025: exploring the genomic epidemiology of bacterial pathogens.

Nucleic Acids Res

January 2025

Leibniz Institute DSMZ, Germany-German Collection of Microorganisms and Cell Cultures, Inhoffenstr. 7B, 38124 Braunschweig, Germany.

Article Synopsis
  • EnteroBase is a web-based platform that offers curated databases of genome sequences from over 1.1 million bacterial isolates, including notable pathogens like Streptococcus and Mycobacterium tuberculosis.
  • The platform now features tools for detecting antimicrobial resistance and a new bubble plot tool for visualizing bacterial genomic structures.
  • Enhanced access and functionalities are provided through a user-friendly interface and a RESTful API, with ongoing development by an international consortium to improve and maintain the system.
View Article and Find Full Text PDF

The advent of artificial intelligence has positively transformed many areas of our lives, including the medical field. In this article, we propose the development of a medical diagnosis chatbot based on patients' symptoms, using artificial intelligence as an innovative solution. The aim of this tool is to provide doctors with a preliminary diagnosis based on the symptoms presented by patients.

View Article and Find Full Text PDF

The interoperability of healthcare data across various systems remains a big challenge, largely attributable to the disparate data schemas and APIs in use. This study showcases the integration of a FHIR layer into GameBus, a gamified health platform, aiming to enhance its interoperability. Traditionally, GameBus has relied on proprietary data schemas and REST APIs, which restricted data exchange with other platforms.

View Article and Find Full Text PDF

The European Bioinformatics Institute (EMBL-EBI)'s Job Dispatcher framework provides access to a wide range of core databases and analysis tools that are of key importance in bioinformatics. As well as providing web interfaces to these resources, web services are available using REST and SOAP protocols that enable programmatic access and allow their integration into other applications and analytical workflows and pipelines. This article describes the various options available to researchers and bioinformaticians who would like to use our resources via the web interface employing RESTful web services clients provided in Perl, Python, and Java or who would like to use Docker containers to integrate the resources into analysis pipelines and workflows.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!