Visualizing chemical space networks with RDKit and NetworkX.

J Cheminform

University Libraries, Rodgers Library for Science and Engineering, The University of Alabama, Tuscaloosa, AL, 35487, USA.

Published: December 2022

This article demonstrates how to create Chemical Space Networks (CSNs) using a Python RDKit and NetworkX workflow. CSNs are a type of network visualization that depict compounds as nodes connected by edges, defined as a pairwise relationship such as a 2D fingerprint similarity value. A step by step approach is presented for creating two different CSNs in this manuscript, one based on RDKit 2D fingerprint Tanimoto similarity values, and another based on maximum common substructure similarity values. Several different CSN visualization features are included in the tutorial including methods to represent nodes with color based on bioactivity attribute value, edges with different line styles based on similarity value, as well as replacing the circle nodes with 2D structure depictions. Finally, some common network property and analysis calculations are presented including the clustering coefficient, degree assortativity, and modularity. All code is provided in the form of Jupyter Notebooks and is available on GitHub with a permissive BSD-3 open-source license: https://github.com/vfscalfani/CSN_tutorial.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9798653PMC
http://dx.doi.org/10.1186/s13321-022-00664-xDOI Listing

Publication Analysis

Top Keywords

chemical space
8
space networks
8
rdkit networkx
8
similarity values
8
visualizing chemical
4
networks rdkit
4
networkx article
4
article demonstrates
4
demonstrates create
4
create chemical
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!