On the viability of unsupervised T-cell receptor sequence clustering for epitope preference.

Bioinformatics

Antwerp Unit for Data Analysis and Computation in Immunology and Sequencing (AUDACIS).

Published: May 2019

Motivation: The T-cell receptor (TCR) is responsible for recognizing epitopes presented on cell surfaces. Linking TCR sequences to their ability to target specific epitopes is currently an unsolved problem, yet one of great interest. Indeed, it is currently unknown how dissimilar TCR sequences can be before they no longer bind the same epitope. This question is confounded by the fact that there are many ways to define the similarity between two TCR sequences. Here we investigate both issues in the context of TCR sequence unsupervised clustering.

Results: We provide an overview of the performance of various distance metrics on two large independent datasets with 412 and 2835 TCR sequences respectively. Our results confirm the presence of structural distinct TCR groups that target identical epitopes. In addition, we put forward several recommendations to perform unsupervised T-cell receptor sequence clustering.

Availability And Implementation: Source code implemented in Python 3 available at https://github.com/pmeysman/TCRclusteringPaper.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/bty821DOI Listing

Publication Analysis

Top Keywords

tcr sequences
16
t-cell receptor
12
unsupervised t-cell
8
receptor sequence
8
tcr
7
viability unsupervised
4
sequence clustering
4
clustering epitope
4
epitope preference
4
preference motivation
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!