Motivation: Pre-trained protein language and/or structural models are often fine-tuned on drug development properties (i.e. developability properties) to accelerate drug discovery initiatives. However, these models generally rely on a single structural conformation and/or a single sequence as a molecular representation. We present a physics-based model, whereby 3D conformational ensemble representations are fused by a transformer-based architecture and concatenated to a language representation to predict antibody protein properties. Antibody language ensemble fusion enables the direct infusion of thermodynamic information into latent space and this enhances property prediction by explicitly infusing dynamic molecular behavior that occurs during experimental measurement.
Results: We showcase the antibody language ensemble fusion model on two developability properties: hydrophobic interaction chromatography retention time and temperature of aggregation (Tagg). We find that (i) 3D conformational ensembles that are generated from molecular simulation can further improve antibody property prediction for small datasets, (ii) the performance benefit from 3D conformational ensembles matches shallow machine learning methods in the small data regime, and (iii) fine-tuned large protein language models can match smaller antibody-specific language models at predicting antibody properties.
Availability And Implementation: AbLEF codebase is available at https://github.com/merck/AbLEF.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11256947 | PMC |
http://dx.doi.org/10.1093/bioinformatics/btae268 | DOI Listing |
Proc Natl Acad Sci U S A
January 2025
Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139.
Protein language models (PLMs) have demonstrated impressive success in modeling proteins. However, general-purpose "foundational" PLMs have limited performance in modeling antibodies due to the latter's hypervariable regions, which do not conform to the evolutionary conservation principles that such models rely on. In this study, we propose a transfer learning framework called Antibody Mutagenesis-Augmented Processing (AbMAP), which fine-tunes foundational models for antibody-sequence inputs by supervising on antibody structure and binding specificity examples.
View Article and Find Full Text PDFMAbs
December 2025
Department of Purification, Microbiology and Virology, Genentech Inc, South San Francisco, CA, USA.
In early-stage development of therapeutic monoclonal antibodies, assessment of the viability and ease of their purification typically requires extensive experimentation. However, the work required for upstream protein expression and downstream purification development often conflicts with timeline pressures and material constraints, limiting the number of molecules and process conditions that can reasonably be assessed. Recently, high-throughput batch-binding screen data along with improved molecular descriptors have enabled development of robust quantitative structure-property relationship (QSPR) models that predict monoclonal antibody chromatographic binding behavior from the amino acid sequence.
View Article and Find Full Text PDFObjectives: The current gold standard for immunofluorescent (IF) visualization of neuromuscular junctions (NMJs) in muscle utilizes frozen tissue sections with fluorescent conjugated antibodies to demarcate neurons and IF alpha-bungarotoxin (α-BTX) to demarcate motor endplates. Frozen tissue sectioning comes with inherent inescapable limitations, including cryosectioning artifact and limited sample shelf-life. However, a parallel approach to identify NMJs in paraffin-embedded tissue sections has not been previously described.
View Article and Find Full Text PDFAdv Sci (Weinh)
January 2025
Department of Otolaryngology, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, Guangdong, 510120, China.
Adeno-associated virus (AAV) vectors are a leading platform for gene therapy. Recently, AAV-mediated gene therapy in the inner ear has progressed from laboratory use to clinical trials, but the lower transduction rates in outer hair cells (OHCs) in the organ of Corti and in vestibular hair cells in adult mice still pose a challenge. OHCs are particularly vulnerable to inner ear insults.
View Article and Find Full Text PDFJ Vis Exp
December 2024
Department of Epigenetics and Molecular Carcinogenesis, University of Texas MD Anderson Cancer Center; Department of Gynecologic Oncology and Reproductive Medicine, University of Texas MD Anderson Cancer Center;
The CUT&RUN technique facilitates detection of protein-DNA interactions across the genome. Typical applications of CUT&RUN include profiling changes in histone tail modifications or mapping transcription factor chromatin occupancy. Widespread adoption of CUT&RUN is driven, in part, by technical advantages over conventional ChIP-seq that include lower cell input requirements, lower sequencing depth requirements, and increased sensitivity with reduced background signal due to a lack of cross-linking agents that otherwise mask antibody epitopes.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!