Peptide-Major Histocompatibility Complex Class I Binding Prediction Based on Deep Learning With Novel Feature.

Front Genet

Department of Computer Science and Technology, School of Life Science and Technology, Harbin Institute of Technology, Harbin, China.

Published: November 2019

AI Article Synopsis

  • Peptide-based vaccine development relies on accurately predicting how well peptide ligands bind to MHC I proteins, and current machine learning tools mostly use shallow neural networks for this task.
  • Recent research suggests that deep neural networks, specifically convolutional neural networks (CNNs), are more effective in learning from small datasets, which is crucial when only limited peptide data is available for some alleles.
  • By incorporating detailed features like sequence order, hydropathy index, and peptide length into a characteristic matrix for input, the proposed CNN-based approach outperforms traditional methods in predicting peptide-MHC binding affinity.

Article Abstract

Peptide-based vaccine development needs accurate prediction of the binding affinity between major histocompatibility complex I (MHC I) proteins and their peptide ligands. Nowadays more and more machine learning methods have been developed to predict binding affinity and some of them have become the popular tools. However most of them are designed by the shallow neural networks. Bengio said that deep neural networks can learn better fits with less data than shallow neural networks. In our case, some of the alleles only have dozens of peptide data. In addition, we transform each peptide into a characteristic matrix and input it into the model. As we know when dealing with the problem that the input is a matrix, convolutional neural network (CNN) can find the most critical features by itself. Obviously, compared with the traditional neural network model, CNN is more suitable for predicting binding affinity. Different from the previous studies which are based on blocks substitution matrix (BLOSUM), we used novel feature to do the prediction. Since we consider that the order of the sequence, hydropathy index, polarity and the length of the peptide could affect the binding affinity and the properties of these amino acids are key factors for their binding to MHC, we extracted these information from each peptide. In order to make full use of the data we have obtained, we have integrated different lengths of peptides into 15mer based on the binding mode of peptide to MHC I. In order to demonstrate that our method is reliable to predict peptide-MHC binding, we compared our method with several popular methods. The experiments show the superiority of our method.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6892951PMC
http://dx.doi.org/10.3389/fgene.2019.01191DOI Listing

Publication Analysis

Top Keywords

binding affinity
16
neural networks
12
histocompatibility complex
8
binding
8
novel feature
8
shallow neural
8
neural network
8
peptide
6
neural
5
peptide-major histocompatibility
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!