Mapping algorithms for molecular similarity analysis and ligand-based virtual screening: design of DynaMAD and comparison with MAD and DMC.

J Chem Inf Model

Department of Life Science Informatics, B-IT, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstr. 2, D-53113 Bonn, Germany.

Published: September 2006

The DynaMAD algorithm maps database compounds to specific ranges of descriptor values based on activity classes to help identify new active molecules, enhancing previous methods MAD and DMC.
In comparisons of virtual screening across 24 compound activity classes and about 2 million compounds, DynaMAD outperformed MAD and DMC, achieving average hit rates of 55% and recovery rates of 33%.
The study highlights that dynamic mapping methods not only enriched the selection of active molecules in small sample sizes but also effectively identified remote molecular similarities, showcasing their value in molecular analysis and virtual screening.

Here, we introduce the DynaMAD algorithm that is designed to map database compounds to combinations of activity-class-dependent descriptor value ranges in order to identify novel active molecules. The method combines and extends key features of two previously developed algorithms, MAD and DMC. These methods were first described as compound-mapping algorithms for large-scale virtual screening applications. DynaMAD and DMC operate in chemical spaces of stepwise increasing dimensionality. However, in contrast to DMC, which utilizes binary transformed descriptors, DynaMAD uses unmodified descriptor value distributions. The performance of these mapping methods was compared in detail in virtual screening trials on 24 different compound activity classes against a background of about 2 million database compounds. In these calculations, all three approaches produced results of considerable predictive value, and the enrichment of active molecules in small selection sets consisting of only about 20 or fewer database compounds emerged as a common feature. Furthermore, mapping methods were capable of recognizing remote molecular similarity relationships. Overall, DynaMAD performed better than MAD and DMC, producing average hit and recovery rates of 55% and 33%, respectively, over all 24 classes. Taken together, our findings suggest that dynamic compound mapping to combinations of activity-class-selective descriptor settings has significant potential for molecular similarity analysis and ligand-based virtual screening.

Download full-text PDF	Source
http://dx.doi.org/10.1021/ci060083o	DOI Listing

Publication Analysis

Top Keywords

virtual screening

molecular similarity

mad dmc

database compounds

similarity analysis

analysis ligand-based

ligand-based virtual

active molecules

mapping methods

dynamad

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!