Background: To further our understanding of immunopeptidomics, improved tools are needed to identify peptides presented by major histocompatibility complex class I (MHC-I). Many existing tools are limited by their reliance upon chemical affinity data, which is less biologically relevant than sampling by mass spectrometry, and other tools are limited by incomplete exploration of machine learning approaches. Herein, we assemble publicly available data describing human peptides discovered by sampling the MHC-I immunopeptidome with mass spectrometry and use this database to train random forest classifiers (ForestMHC) to predict presentation by MHC-I.

Results: As measured by precision in the top 1% of predictions, our method outperforms NetMHC and NetMHCpan on test sets, and it outperforms both these methods and MixMHCpred on new data from an ovarian carcinoma cell line. We also find that random forest scores correlate monotonically, but not linearly, with known chemical binding affinities, and an information-based analysis of classifier features shows the importance of anchor positions for our classification. The random-forest approach also outperforms a deep neural network and a convolutional neural network trained on identical data. Finally, we use our large database to confirm that gene expression partially determines peptide presentation.

Conclusions: ForestMHC is a promising method to identify peptides bound by MHC-I. We have demonstrated the utility of random forest-based approaches in predicting peptide presentation by MHC-I, assembled the largest known database of MS binding data, and mined this database to show the effect of gene expression on peptide presentation. ForestMHC has potential applicability to basic immunology, rational vaccine design, and neoantigen binding prediction for cancer immunotherapy. This method is publicly available for applications and further validation.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6321722PMC
http://dx.doi.org/10.1186/s12859-018-2561-zDOI Listing

Publication Analysis

Top Keywords

peptide presentation
12
predicting peptide
8
major histocompatibility
8
histocompatibility complex
8
complex class
8
machine learning
8
identify peptides
8
tools limited
8
mass spectrometry
8
random forest
8

Similar Publications

A label-free, flexible, and disposable aptasensor was designed for the rapid on-site detection of vancomycin (VAN) levels. The electrochemical sensor was based on lab-printed carbon electrodes (C-PE) enriched with cauliflower-shaped gold nanostructures (AuNSs), on which VAN-specific aptamers were immobilized as biorecognition elements and short-chain thiols as blocking agents. The AuNSs, characterized by scanning electron microscopy (SEM) and atomic force microscopy (AFM), enhanced the electrochemical properties of the platform and the aptamer immobilization active sites.

View Article and Find Full Text PDF

[Solid, endometrial-like and transitional growth patterns of ovarian high-grade serous carcinoma: a clinicopathological analysis of 25 cases].

Zhonghua Bing Li Xue Za Zhi

February 2025

Department of Pathology, the Affiliated Suzhou Hospital of Nanjing Medical University, Suzhou Municipal Hospital, Gusu School, Nanjing Medical University, Suzhou 215002, China.

To investigate the clinicopathological characteristics of solid, endometrial-like and transitional (SET) cell growth subtype in high-grade serous ovarian carcinoma (HGSC). Clinical data of 25 cases of HGSC-SET were collected from January 2020 to March 2024 at the Affiliated Suzhou Hospital of Nanjing Medical University, and their histological features were analyzed. Immunohistochemical stains were used to analyze the expression of ER, PR, PAX8, WT-1, p16, p53 and Ki-67.

View Article and Find Full Text PDF

Background: α-Synuclein (α-Syn) pathology is present in 30-50 % of Alzheimer's disease (AD) patients, and its interactions with tau proteins may further exacerbate pathological changes in AD. However, the specific role of different aggregation forms of α-Syn in the progression of AD remains unclear.

Objectives: To explore the relationship between various aggregation types of CSF α-Syn and Alzheimer's disease progression.

View Article and Find Full Text PDF

Enzymatic hydrolysis approach is commonly employed for preparation of active peptides, while the limited purity and yield of produced peptides hinder further development of action mechanisms. This study presents the biotechnological approach for the efficient production of recombinant angiotensin converting enzyme (ACE) inhibitory peptide LYPVK and investigates its potential antihypertensive action mechanism. DNA encoding sequence of recombinant peptide was designed to form in tandem, which was expressed in Escherichia coli BL21 (DE3).

View Article and Find Full Text PDF

A Cell-penetrating bispecific antibody suppresses hepatitis B virus replication and secretion.

Virus Res

January 2025

Medical Research Center, Yuebei People's Hospital, Shantou University Medical College, 512025, Shaoguan, China; Shenzhen Immuthy Biotech Co., Ltd, 518107, Shenzhen, Guangdong, China. Electronic address:

Hepatitis B virus (HBV) represents one of the major pathogenic factor that leads to chronic liver diseases and the development of hepatocellular carcinoma (HCC). The currently approved anti-HBV drugs cannot eradicate the virus or block the development of HCC. HBV nucleocapsid consists of the hepatitis B core antigen (HBcAg) and the HBV relaxed-circular partially double-stranded DNA (rcDNA), indispensable in virus replication.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!