CGRdb2.0: A Python Database Management System for Molecules, Reactions, and Chemical Data.

J Chem Inf Model

Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Kita 21 Nishi 10, Kita-ku, 001-0021 Sapporo, Japan.

Published: May 2022

This work introduces CGRdb2.0─an open-source database management system for molecules, reactions, and chemical data. CGRdb2.0 is a Python package connecting to a PostgreSQL database that enables native searches for molecules and reactions without complicated SQL syntax. The library provides out-of-the-box implementations for similarity and substructure searches for molecules, as well as similarity and substructure searches for reactions in two ways─based on reaction components and based on the Condensed Graph of Reaction approach, the latter significantly accelerating the performance. In benchmarking studies with the RDKit database cartridge, we demonstrate that CGRdb2.0 performs searches faster for smaller data sets, while allowing for interactive access to the retrieved data.

Download full-text PDF

Source
http://dx.doi.org/10.1021/acs.jcim.1c01105DOI Listing

Publication Analysis

Top Keywords

molecules reactions
12
cgrdb20 python
8
database management
8
management system
8
system molecules
8
reactions chemical
8
chemical data
8
searches molecules
8
similarity substructure
8
substructure searches
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!