Focused learning by antibody language models using preferential masking of non-templated regions.

bioRxiv

Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037 USA.

Published: October 2024

Existing antibody language models () are pre-trained using a masked language modeling () objective with uniform masking probabilities. While these models excel at predicting germline residues, they often struggle with mutated and non-templated residues, which are crucial for antigen-binding specificity and concentrate in the complementarity-determining regions (). Here, we demonstrate that preferential masking of the non-templated CDR3 is a compute-efficient strategy to enhance model performance. We pre-trained two antibody LMs () using either uniform or preferential masking and observed that the latter improves residue prediction accuracy in the highly variable CDR3. Preferential masking also improves antibody classification by native chain pairing and binding specificity, suggesting improved CDR3 understanding and indicating that non-random, learnable patterns help govern antibody chain pairing. We further show that specificity classification is largely informed by residues in the CDRs, demonstrating that AbLMs learn meaningful patterns that align with immunological understanding.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11565838PMC
http://dx.doi.org/10.1101/2024.10.23.619908DOI Listing

Publication Analysis

Top Keywords

preferential masking
16
antibody language
8
language models
8
masking non-templated
8
chain pairing
8
antibody
5
masking
5
focused learning
4
learning antibody
4
preferential
4

Similar Publications

Chemotherapies remain standard therapy for cancers but have limited efficacy and cause significant side effects, highlighting the need for targeted approaches. In the progression of cancer, tumors increase matrix metalloproteinase (MMP) activity. Leveraging and therapeutically redirecting tumor MMPs through activatable cell-penetrating peptide (ACPP) technology offers new approaches for tumor-selective drug delivery and for studying how drug payloads engage the tumor immune microenvironment.

View Article and Find Full Text PDF

Identification of plasma proteins binding oxidized phospholipids using pull-down proteomics and OxLDL masking assay.

J Lipid Res

November 2024

Department of Pharmaceutical Chemistry, Institute of Pharmaceutical Sciences, University of Graz, Graz, Austria. Electronic address:

Article Synopsis
  • Oxidized phospholipids (OxPLs) are recognized as harmful substances that promote inflammation, highlighting the need to understand how they can be detoxified by various plasma proteins.
  • Researchers conducted pull-down-proteomic analysis to identify around 150 non-immunoglobulin proteins that specifically bind to oxidized phospholipids, particularly OxPAPC.
  • The study confirmed that these proteins, alongside known oxidized phospholipid-binding proteins, can effectively mask OxPLs, potentially influencing their recognition by the immune system.
View Article and Find Full Text PDF

Existing antibody language models () are pre-trained using a masked language modeling () objective with uniform masking probabilities. While these models excel at predicting germline residues, they often struggle with mutated and non-templated residues, which are crucial for antigen-binding specificity and concentrate in the complementarity-determining regions (). Here, we demonstrate that preferential masking of the non-templated CDR3 is a compute-efficient strategy to enhance model performance.

View Article and Find Full Text PDF

DTI Analysis of the Peritumoral Zone of Diffuse Low-grade Gliomas in Progressing Patients.

World Neurosurg

November 2024

Université de Lorraine, Centre National de la Recherche Scientifique (CNRS), Centre de Recherche en Automatique de Nancy (CRAN), Nancy, France; Université de Lorraine, CHRU-Nancy, Service de Neurochirurgie, Nancy, France.

Article Synopsis
  • DLGGs are rare brain tumors that can progress even after treatment, complicating effective planning for surgery and other therapies due to their infiltration into white matter tracts.
  • The study involved five patients and analyzed DTI signals alongside traditional imaging methods to identify changes associated with tumor progression over a year.
  • Results indicated that prior to visible changes on standard imaging, abnormal DTI signals in the tumor's periphery could indicate early tumor infiltration, suggesting a potential tool for predicting tumor behavior and adjusting treatment plans.
View Article and Find Full Text PDF

CD47 is a cell surface glycoprotein that is expressed on normal human tissues and has a key role as a marker of self. Tumor cells have coopted CD47 overexpression to evade immune surveillance and thus blockade of CD47 is a highly active area of clinical exploration in oncology. However, clinical development of CD47-targeted agents has been complicated by its robust expression in normal tissues and the toxicities that arise from blocking this inhibitory signal.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!