Non-H3 CDR template selection in antibody modeling through machine learning.

PeerJ

Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, United States of America.

Published: January 2019

Antibodies are proteins generated by the adaptive immune system to recognize and counteract a plethora of pathogens through specific binding. This adaptive binding is mediated by structural diversity in the six complementary determining region (CDR) loops (H1, H2, H3, L1, L2 and L3), which also makes accurate structural modeling of CDRs challenging. Both homology and modeling approaches have been used; to date, the former has achieved greater accuracy for the non-H3 loops. The homology modeling of non-H3 CDRs is more accurate because non-H3 CDR loops of the same length and type can be grouped into a few structural clusters. Most antibody-modeling suites utilize homology modeling for the non-H3 CDRs, differing only in the alignment algorithm and how/if they utilize structural clusters. While RosettaAntibody and SAbPred do not explicitly assign query CDR sequences to clusters, two other approaches, PIGS and Kotai Antibody Builder, utilize sequence-based rules to assign CDR sequences to clusters. While the manually curated sequence rules can identify better structural templates, because their curation requires extensive literature search and human effort, they lag behind the deposition of new antibody structures and are infrequently updated. In this study, we propose a machine learning approach (Gradient Boosting Machine [GBM]) to learn the structural clusters of non-H3 CDRs from sequence alone. The GBM method simplifies feature selection and can easily integrate new data, compared to manual sequence rule curation. We compare the classification results using the GBM method to that of RosettaAntibody in a 3-repeat 10-fold cross-validation (CV) scheme on the cluster-annotated antibody database PyIgClassify and we observe an improvement in the classification accuracy of the concerned loops from 84.5% ± 0.24% to 88.16% ± 0.056%. The GBM models reduce the errors in specific cluster membership misclassifications when the involved clusters have relatively abundant data. Based on the factors identified, we suggest methods that can enrich structural classes with sparse data to further improve prediction accuracy in future studies.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6330961PMC
http://dx.doi.org/10.7717/peerj.6179DOI Listing

Publication Analysis

Top Keywords

homology modeling
12
non-h3 cdrs
12
structural clusters
12
non-h3 cdr
8
machine learning
8
cdr loops
8
modeling non-h3
8
cdr sequences
8
sequences clusters
8
gbm method
8

Similar Publications

Background: Christianson syndrome (CS) is an x-linked recessive neurodevelopmental and neurodegenerative condition characterized by severe intellectual disability, cerebellar degeneration, ataxia, and epilepsy. Mutations to the gene encoding NHE6 are responsible for CS, and we recently demonstrated that a mutation to the rat gene causes a similar phenotype in the spontaneous rat model, which exhibits cerebellar degeneration with motor dysfunction. In previous work, we used the PhP.

View Article and Find Full Text PDF

Computational identification of novel natural inhibitors against triple mutant DNA gyrase A in fluoroquinolone-resistant Typhimurium.

Biochem Biophys Rep

March 2025

Department of Integrative Biology, School of Biosciences and Technology, Vellore Institute of Technology (VIT), Vellore, 632014, Tamil Nadu, India.

The rising resistance to fluoroquinolones in Typhimurium poses a significant global health challenge. This computational research addresses the pressing need for new therapeutic drugs by utilizing various computational tools to identify potential natural compounds that can inhibit the triple mutant DNA gyrase subunit A enzyme, which is crucial in fluoroquinolone resistance. Initially, the three-dimensional structure of the wild-type DNA gyrase A protein was modeled using homology modeling, and followed by mutagenesis to create the clinically relevant triple mutant (SER83PHE, ASP87GLY, ALA119SER) DNA gyrase A protein structure.

View Article and Find Full Text PDF

Introduction: The study of attention has been pivotal in advancing our comprehension of cognition. The goal of this study is to investigate which EEG data representations or features are most closely linked to attention, and to what extent they can handle the cross-subject variability.

Methods: We explore the features obtained from the univariate time series from a single EEG channel, such as time domain features and recurrence plots, as well as representations obtained directly from the multivariate time series, such as global field power or functional brain networks.

View Article and Find Full Text PDF

Metacaspases-Like Proteases of Trichomonas vaginalis: In Silico Identification and Characterization.

J Basic Microbiol

January 2025

Laboratorio de Bioquímica y Genética Molecular, Facultad de Química, Universidad Autónoma de Yucatán, Mérida, Yucatán, México.

Metacaspases (MCA), are cysteine-dependent proteases closely related to caspases. In protozoa, MCA plays an important role in programmed cell death (PCD). In Trichomonas vaginalis, a kind of PCD that resembles apoptosis has been described, but the activators of this mechanism have not been demonstrated.

View Article and Find Full Text PDF

The increasing challenge of marine biofouling, mainly due to barnacle settlement, necessitates the development of effective antifoulants with minimal environmental toxicity. In this study, fifteen derivatives of brusatol were synthesized and characterized using C-NMR, H-NMR, and mass spectrometry. All the semi-synthesized compounds obtained using the Multi-Target-Directed Ligand (MTDL) strategy, when evaluated as anti-settlement agents against barnacles, showed promising activity.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!