Prediction of protein allergenicity using local description of amino acid sequence.

Front Biosci

Data Mining Department, Institute for Infocomm Research, 21 Heng Mui Keng Terrace, Singapore 119613.

Published: May 2008

The constant increase in atopic allergy and other hypersensitivity reactions has intensified the need for successful therapeutic approaches. Existing bioinformatic tools for predicting allergenic potential are primarily based on sequence similarity searches along the entire protein sequence and do not address the dual issues of conformational and overlapping B-cell epitope recognition sites. In this study, we report AllerPred, a computational system that is capable of capturing multiple overlapping continuous and discontinuous B-cell epitope binding patterns in allergenic proteins using SVM as its prediction engine. A novel representation of local protein sequence descriptors enables the system to model multiple overlapping continuous and discontinuous B-cell epitope binding patterns within a protein sequence. The model was rigorously trained and tested using 669 IUIS allergens and 1237 non-allergens. Testing results showed that the area under the receiver operating curve (AROC) of SVM models is 0.81 with 76 percent sensitivity at specificity of 76 percent . This approach consistently outperforms existing allergenicity prediction systems using a standardized testing dataset of experimentally validated allergens and non-allergen sequences.

Download full-text PDF

Source
http://dx.doi.org/10.2741/3138DOI Listing

Publication Analysis

Top Keywords

protein sequence
12
b-cell epitope
12
multiple overlapping
8
overlapping continuous
8
continuous discontinuous
8
discontinuous b-cell
8
epitope binding
8
binding patterns
8
sequence
5
prediction protein
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!