Representing clusters using a maximum common edge substructure algorithm applied to reduced graphs and molecular graphs.

J Chem Inf Model

Department of Information Studies, University of Sheffield, 211 Portobello Street, Regent Court, Sheffield, United Kingdom.

Published: May 2007

AI Article Synopsis

  • Chemical databases group similar molecules to help medicinal chemists easily explore shared activities, but using fingerprints can obscure structural similarities.
  • The authors propose using maximum common substructures (MCES) with reduced graphs—where nodes reflect functional groups—to better represent clusters of molecules.
  • This method allows real-time MCES calculations, resulting in easily interpretable cluster representatives that link back to the original molecules, aiding chemists in assessing potential activities and conducting detailed R-group analyses.

Article Abstract

Chemical databases are routinely clustered, with the aim of grouping molecules which share similar structural features. Ideally, medicinal chemists are then able to browse a few representatives of the cluster in order to interpret the shared activity of the cluster members. However, when molecules are clustered using fingerprints, it may be difficult to decipher the structural commonalities which are present. Here, we seek to represent a cluster by means of a maximum common substructure based on the shared functionality of the cluster members. Previously, we have used reduced graphs, where each node corresponds to a generalized functional group, as topological molecular descriptors for virtual screening. In this work, we precluster a database using any clustering method. We then represent the molecules in a cluster as reduced graphs. By repeated application of a maximum common edge substructure (MCES) algorithm, we obtain one or more reduced graph cluster representatives. The sparsity of the reduced graphs means that the MCES calculations can be performed in real time. The reduced graph cluster representatives are readily interpretable in terms of functional activity and can be mapped directly back to the molecules to which they correspond, giving the chemist a rapid means of assessing potential activities contained within the cluster. Clusters of interest are then subject to a detailed R-group analysis using the same iterated MCES algorithm applied to the molecular graphs.

Download full-text PDF

Source
http://dx.doi.org/10.1021/ci600444gDOI Listing

Publication Analysis

Top Keywords

reduced graphs
16
maximum common
12
common edge
8
edge substructure
8
algorithm applied
8
molecular graphs
8
cluster
8
cluster members
8
mces algorithm
8
reduced graph
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!