Learned features of antibody-antigen binding affinity.

Front Mol Biosci

Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, United States.

Published: February 2023

AI Article Synopsis

  • Identifying predictors of antibody binding affinity is crucial for developing effective therapeutic antibodies, but the diversity in antibody structures makes this difficult.
  • Researchers utilized the structural antibody database (SAbDab) to explore features that differentiate high- and low-binding affinities over a wide range.
  • The study found that simple features based on contact counts can perform just as well as complex features, and combining all feature sets yields the best results, but classification performance plateaus, indicating a need for more structural data to improve predictions.

Article Abstract

Defining predictors of antigen-binding affinity of antibodies is valuable for engineering therapeutic antibodies with high binding affinity to their targets. However, this task is challenging owing to the huge diversity in the conformations of the complementarity determining regions of antibodies and the mode of engagement between antibody and antigen. In this study, we used the structural antibody database (SAbDab) to identify features that can discriminate high- and low-binding affinity across a 5-log scale. First, we abstracted features based on previously learned representations of protein-protein interactions to derive 'complex' feature sets, which include energetic, statistical, network-based, and machine-learned features. Second, we contrasted these complex feature sets with additional 'simple' feature sets based on counts of contacts between antibody and antigen. By investigating the predictive potential of 700 features contained in the eight complex and simple feature sets, we observed that simple feature sets perform comparably to complex feature sets in classification of binding affinity. Moreover, combining features from all eight feature-sets provided the best classification performance (median cross-validation AUROC and F1-score of 0.72). Of note, classification performance is substantially improved when several sources of data leakage (e.g., homologous antibodies) are not removed from the dataset, emphasizing a potential pitfall in this task. We additionally observe a classification performance plateau across diverse featurization approaches, highlighting the need for additional affinity-labeled antibody-antigen structural data. The findings from our present study set the stage for future studies aimed at multiple-log enhancement of antibody affinity through feature-guided engineering.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9989197PMC
http://dx.doi.org/10.3389/fmolb.2023.1112738DOI Listing

Publication Analysis

Top Keywords

feature sets
24
binding affinity
12
classification performance
12
antibody antigen
8
complex feature
8
simple feature
8
affinity
6
feature
6
sets
6
features
5

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!