Probabilistic Modeling of Conformational Space for 3D Machine Learning Approaches.

Mol Inform

Center for Bioinformatics, University of Tübingen, Sand 1, 72076 Tübingen, Germany phone/fax:+49 7071 29 77175/+49 7071 29 5091.

Published: May 2010

We present a new probabilistic encoding of the conformational space of a molecule that allows for the integration into common similarity calculations. The method uses distance profiles of flexible atom-pairs and computes generative models that describe the distance distribution in the conformational space. The generative models permit the use of probabilistic kernel functions and, therefore, our approach can be used to extend existing 3D molecular kernel functions, as applied in support vector machines, to build QSAR models. The resulting kernels are valid 4D kernel functions and reduce the dependency of the model quality on suitable conformations of the molecules. We showed in several experiments the robust performance of the 4D kernel function, which was extended by our approach, in comparison to the original 3D-based kernel function. The new method compares the conformational space of two molecules within one kernel evaluation. Hence, the number of kernel evaluations is significantly reduced in comparison to common kernel-based conformational space averaging techniques. Additionally, the performance gain of the extended model correlates with the flexibility of the data set and enables an a priori estimation of the model improvement.

Download full-text PDF

Source
http://dx.doi.org/10.1002/minf.201000036DOI Listing

Publication Analysis

Top Keywords

conformational space
20
kernel functions
12
generative models
8
kernel function
8
kernel
7
conformational
5
space
5
probabilistic modeling
4
modeling conformational
4
space machine
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!