Structure-free antibody paratope similarity prediction for epitope binning via protein language models.

iScience

Biotechnology Discovery Research, Lilly Biotechnology Center, 10300 Campus Point Drive, San Diego, CA 92121, USA.

Published: February 2023

Antibodies are an important group of biological molecules that are used as therapeutics and diagnostic tools. Although millions of antibody sequences are available, identifying their structural and functional similarity and their antigen binding sites remains a challenge at large scale. Here, we present a fast, sequence-based computational method for antibody paratope prediction based on protein language models. The paratope information is then used to measure similarity among antibodies via protein language models. Our computational method enables binning of antibody discovery hits into groups as the function of epitope engagement. We further demonstrate the utility of the method by identifying antibodies targeting highly similar epitopes of the same antigens from a large pool of antibody sequences, using two case studies: SARS CoV2 Receptor Binding Domain (RBD) and Epidermal Growth Factor Receptor (EGFR). Our approach highlights the potential in accelerating antibody discovery by enhancing hit prioritization and diversity selection.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9941125PMC
http://dx.doi.org/10.1016/j.isci.2023.106036DOI Listing

Publication Analysis

Top Keywords

protein language
12
language models
12
antibody paratope
8
antibody sequences
8
computational method
8
antibody discovery
8
antibody
5
structure-free antibody
4
paratope similarity
4
similarity prediction
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!