Heavy chain sequence-based classifier for the specificity of human antibodies.

Yaqi Wang Guoqin Mai Min Zou Haoyu Long Yao-Qing Chen Litao Sun Dechao Tian Yang Zhao Guozhi Jiang Zicheng Cao Xiangjun Du

Brief Bioinform

School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, P.R. China.

Published: January 2022

Antibodies specifically bind to antigens and are an essential part of the immune system. Hence, antibodies are powerful tools in research and diagnostics. High-throughput sequencing technologies have promoted comprehensive profiling of the immune repertoire, which has resulted in large amounts of antibody sequences that remain to be further analyzed. In this study, antibodies were downloaded from IMGT/LIGM-DB and Sequence Read Archive databases. Contributing features from antibody heavy chains were formulated as numerical inputs and fed into an ensemble machine learning classifier to classify the antigen specificity of six classes of antibodies, namely anti-HIV-1, anti-influenza virus, anti-pneumococcal polysaccharide, anti-citrullinated protein, anti-tetanus toxoid and anti-hepatitis B virus. The classifier was validated using cross-validation and a testing dataset. The ensemble classifier achieved a macro-average area under the receiver operating characteristic curve (AUC) of 0.9246 from the 10-fold cross-validation, and 0.9264 for the testing dataset. Among the contributing features, the contribution of the complementarity-determining regions was 53.1% and that of framework regions was 46.9%, and the amino acid mutation rates occupied the first and second ranks among the top five contributing features. The classifier and insights provided in this study could promote the mechanistic study, isolation and utilization of potential therapeutic antibodies.

Download full-text PDF	Source
http://dx.doi.org/10.1093/bib/bbab516	DOI Listing

Publication Analysis

Top Keywords

contributing features

testing dataset

antibodies

classifier

heavy chain

chain sequence-based

sequence-based classifier

classifier specificity

specificity human

human antibodies

Similar Publications

Generative Adversarial Networks for Neuroimage Translation.

J Comput Biol

December 2024

Electrical, Computer and Biomedical Engineering, Toronto Metropolitan University, Toronto, Canada.

Cassandra Czobit Reza Samavi

Image-to-image translation has gained popularity in the medical field to transform images from one domain to another. Medical image synthesis via domain transformation is advantageous in its ability to augment an image dataset where images for a given class are limited. From the learning perspective, this process contributes to the data-oriented robustness of the model by inherently broadening the model's exposure to more diverse visual data and enabling it to learn more generalized features.

View Article and Find Full Text PDF

Similar Publications

GeniePool 2.0: advancing variant analysis through CHM13-T2T, AlphaMissense, gnomAD V4 integration, and variant co-occurrence queries.

Database (Oxford)

December 2024

The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva 84105, Israel.

Grisha Weintraub Noam Hadar Ehud Gudes Shlomi Dolev Ohad S Birk

Originally developed to meet the challenges of genomic data deluge, GeniePool emerged as a pioneering platform, enabling efficient storage, accessibility, and analysis of vast genomic datasets, enabled due to its data lake architecture. Building on this foundation, GeniePool 2.0 advances genomic analysis through the integration of cutting-edge variant databases, such as CHM13-T2T, AlphaMissense, and gnomAD V4, coupled with the capability for variant co-occurrence queries.

View Article and Find Full Text PDF

Similar Publications

The role of KRT18 in lung adenocarcinoma development: integrative bioinformatics and experimental validation.

Discov Oncol

December 2024

School of Pharmacy, Shaoyang University, Shaoyang, 422000, Hunan, China.

Yongjie Li Min Zeng Yinan Qin Fen Feng Hailiang Wei

Lung adenocarcinoma (LUAD) represents one of the most common subtypes of lung cancer with high rates of incidence and mortality, which contributes to substantial health and economic demand across the globe. Treatment today mainly consists of surgery, radiotherapy, and chemotherapy, but their efficacy in advanced stages is often suboptimal and emphasizes the clear need for new biomarkers and therapeutic targets. Using comprehensive bioinformatics analyses consisting of the Cancer Genome Atlas (TCGA), Gene Expression Omnibus (GEO), Human Protein Atlas (HPA) and Clinical Proteomic Tumor Analysis Consortium (CPTAC), immune infiltration analysis and functional enrichment analysis, and single-cell analysis, we examined the potential of keratin 18 (KRT18) as a candidate biomarker in advanced LUAD.

View Article and Find Full Text PDF

Similar Publications

Exploring Gene Expression and Alternative Splicing in Duck Embryonic Myoblasts via Full-Length Transcriptome Sequencing.

Vet Sci

November 2024

College of Animal Science and Technology, Jiangxi Agricultural University, Nanchang 330045, China.

Jintao Wu Shuibing Liu Dongcheng Jiang Ya'nan Zhou Hongxia Jiang

The duck industry is vital for supplying high-quality protein, making research into the development of duck skeletal muscle critical for improving meat and egg production. In this study, we leveraged Oxford Nanopore Technologies (ONT) sequencing to perform full-length transcriptome sequencing of myoblasts harvested from the leg muscles of duck embryos at embryonic day 13 (E13), specifically examining both the proliferative (GM) and differentiation (DM) phases. Our analysis identified a total of 5797 novel transcripts along with 2332 long non-coding RNAs (lncRNAs), revealing substantial changes in gene expression linked to muscle development.

View Article and Find Full Text PDF

Similar Publications

TIPE drives a cancer stem-like phenotype by promoting glycolysis via PKM2/HIF-1α axis in melanoma.

Elife

December 2024

Center of Translational Medicine, Zibo Central Hospital Affiliated to Binzhou Medical University, Zibo, China.

Maojin Tian Le Yang Ziqian Zhao Jigang Li Lianqing Wang

TIPE () has been identified as an oncogene and participates in tumor biology. However, how its role in the metabolism of tumor cells during melanoma development remains unclear. Here, we demonstrated that TIPE promoted glycolysis by interacting with pyruvate kinase M2 (PKM2) in melanoma.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!