Predicting the behaviour of proteins in hydrophobic interaction chromatography. 2. Using a statistical description of their surface amino acid distribution.

J Chromatogr A

Centre for Biochemical Engineering and Biotechnology, Department of Chemical and Biotechnology Engineering, University of Chile, Beauchef 861, Santiago, Chile.

Published: February 2006

This paper focuses on the prediction of the dimensionless retention time (DRT) of proteins in hydrophobic interaction chromatography (HIC) by means of mathematical models based on the statistical description of the amino acid surface distribution. Previous models characterises the protein surface as a whole. However, most of the time it is not the whole protein but some of its specific regions that interact with the environment. It seems much more natural to use local measurements of the characteristics of the surface. Therefore, the statistical characterisation of the distribution of an amino acid property on the protein surface was carried out from the systematic calculation of the local average of this property in a neighbourhood placed sequentially on each of the amino acids on the protein surface. This process allowed us to characterise the distribution of this property quantitatively using three main statistics: average, standard deviation and maximum. In particular, if the property considered is a hydrophobicity scale, these statistics allowed us to characterise the average hydrophobicity and the hydrophobic content of the most hydrophobic cluster or hotspot, as well as the heterogeneity of the hydrophobicity distribution on the protein surface. We tested the performance of the DRT predictive models based on these statistics on a set of 15 proteins. We obtained better predictive results with respect to the models previously reported. The best predictive model was a linear model based on the maximum. This statistic was calculated using an index of the mobilities of amino acids in chromatography. The predictive performance of this model (measured as the Jack Knife MSE) was 26.9% better than those obtained by the best model which does not consider the amino acid distribution and 19.5% better than the model based on the hydrophobic imbalance (HI). In addition, the best performance was obtained by a linear multivariable model based on the HI and the maximum. The difference between the experimental data and the prediction carried out by this model was smaller than those observed previously. In fact, this model obtained better predictive capacities than a previous linear multivariable model decreasing the Jack Knife MSE in 8.7%. In addition, this model allowed us to diminish the number of variables required, increasing, in this way, the degrees of freedom of the model.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.chroma.2005.12.033DOI Listing

Publication Analysis

Top Keywords

amino acid
16
protein surface
16
model based
12
model
11
proteins hydrophobic
8
hydrophobic interaction
8
interaction chromatography
8
statistical description
8
acid distribution
8
models based
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!