Sequence-based prediction of protein crystallization, purification and production propensity.

Bioinformatics

Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Canada.

Published: July 2011

Motivation: X-ray crystallography-based protein structure determination, which accounts for majority of solved structures, is characterized by relatively low success rates. One solution is to build tools which support selection of targets that are more likely to crystallize. Several in silico methods that predict propensity of diffraction-quality crystallization from protein chains were developed. We show that the quality of their predictions drops when applied to more recent crystallization trails, which calls for new solutions. We propose a novel approach that alleviates drawbacks of the existing methods by using a recent dataset and improved protocol to annotate progress along the crystallization process, by predicting the success of the entire process and steps which result in the failed attempts, and by utilizing a compact and comprehensive set of sequence-derived inputs to generate accurate predictions.

Results: The proposed PPCpred (predictor of protein Production, Purification and Crystallization) predict propensity for production of diffraction-quality crystals, production of crystals, purification and production of the protein material. PPCpred utilizes comprehensive set of inputs based on energy and hydrophobicity indices, composition of certain amino acid types, predicted disorder, secondary structure and solvent accessibility, and content of certain buried and exposed residues. Our method significantly outperforms alignment-based predictions and several modern crystallization propensity predictors. Receiver operating characteristic (ROC) curves show that PPCpred is particularly useful for users who desire high true positive (TP) rates, i.e. low rate of mispredictions for solvable chains. Our model reveals several intuitive factors that influence the success of individual steps and the entire crystallization process, including the content of Cys, buried His and Ser, hydrophobic/hydrophilic segments and the number of predicted disordered segments.

Availability: http://biomine.ece.ualberta.ca/PPCpred/.

Contact: lkurgan@ece.ualberta.ca.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3117383PMC
http://dx.doi.org/10.1093/bioinformatics/btr229DOI Listing

Publication Analysis

Top Keywords

purification production
8
predict propensity
8
crystallization process
8
comprehensive set
8
crystallization
7
protein
5
production
5
sequence-based prediction
4
prediction protein
4
protein crystallization
4

Similar Publications

Hydrogen (H), as a high-energy-density molecule, offers a clean solution to carry energy. However, the high diffusivity and low volumetric density of H pose a challenge for long-term storage and transportation. Liquid organic hydrogen carriers (LOHCs) have been suggested as a strategic way to store and transport hydrogen in stable molecules.

View Article and Find Full Text PDF

(Pers.) Fries is an edible fungus species belonging to the Polygonaceae family. Polysaccharides, the predominant bioactive compounds in , have been widely used due to its abundant nutritional and medicinal benefits.

View Article and Find Full Text PDF

Background: Vibrio parahaemolyticus is a marine bacterium causing seafood-associated gastrointestinal illness in humans and acute hepatopancreatic necrosis disease (AHPND) in shrimp. Bacteriophages have emerged as promising biocontrol agents against V. parahaemolyticus.

View Article and Find Full Text PDF

Background: Bovine viral diarrhoea virus genotype 1 (BVDV-1) and bluetongue virus (BTV) are potent viral pathogens that may be transmitted through semen, resulting in the spread of diseases via artificial insemination. Thus, establishing an early detection method for BVDV-1 and BTV infection is important for the trading of semen. In this study, we developed two RT‒ddPCR methods to detect BVDV-1 and BTV, and each method was evaluated for repeatability, limit of detection and specificity.

View Article and Find Full Text PDF

Pyomelanogenic P. aeruginosa, frequently isolated from patients with urinary tract infections and cystic fibrosis, possesses the ability to withstand oxidative stress, contributing to virulence and resulting in persistent infections. Whole genome sequence analysis of U804, a pyomelanogenic, multidrug-resistant, clinical isolate, demonstrates the mechanism underlying pyomelanin overproduction.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!