Flexible Analog Search with Kernel PCA Embedded Molecule Vectors.

Comput Struct Biotechnol J

Department of Bioengineering, Stanford University, Shriram Center Room 213, 443 Via Ortega MC 4245, Stanford, CA 94305, United States.

Published: March 2017

Studying analog series to find structural transformations that enhance the activity and ADME properties of lead compounds is an important part of drug development. Matched molecular pair (MMP) search is a powerful tool for analog analysis that imitates researchers' ability to select pairs of compounds that differ only by small well-defined transformations. Abstraction is a challenge for existing MMP search algorithms, which can result in the omission of relevant, inexact MMPs, and inclusion of irrelevant, contextually dissimilar MMPs. In this work, we present a new method for MMP search that returns approximate results and enables flexible control over abstraction of contextual information. We illustrate the concepts and mechanics of our method with a series of exemplar MMP queries, and then benchmark search accuracy using MMPs found by fragment indexing. We show that we can search for MMPs in a context dependent manner, and accurately approximate context independent fragment index based MMP search over a range of fingerprint and dataset conditions. Our method can be used to search for pairwise correspondences among analog sets and bolster MMP datasets where data is missing or incomplete.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5396859PMC
http://dx.doi.org/10.1016/j.csbj.2017.03.003DOI Listing

Publication Analysis

Top Keywords

mmp search
16
search
8
mmp
6
flexible analog
4
analog search
4
search kernel
4
kernel pca
4
pca embedded
4
embedded molecule
4
molecule vectors
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!