ESMSec: Prediction of Secreted Proteins in Human Body Fluids Using Protein Language Models and Attention.

Int J Mol Sci

Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun 130012, China.

Published: June 2024

The secreted proteins of human body fluid have the potential to be used as biomarkers for diseases. These biomarkers can be used for early diagnosis and risk prediction of diseases, so the study of secreted proteins of human body fluid has great application value. In recent years, the deep-learning-based transformer language model has transferred from the field of natural language processing (NLP) to the field of proteomics, leading to the development of protein language models (PLMs) for protein sequence representation. Here, we propose a deep learning framework called ESM Predict Secreted Proteins (ESMSec) to predict three types of proteins secreted in human body fluid. The ESMSec is based on the ESM2 model and attention architecture. Specifically, the protein sequence data are firstly put into the ESM2 model to extract the feature information from the last hidden layer, and all the input proteins are encoded into a fixed 1000 × 480 matrix. Secondly, multi-head attention with a fully connected neural network is employed as the classifier to perform binary classification according to whether they are secreted into each body fluid. Our experiment utilized three human body fluids that are important and ubiquitous markers. Experimental results show that ESMSec achieved average accuracy of 0.8486, 0.8358, and 0.8325 on the testing datasets for plasma, cerebrospinal fluid (CSF), and seminal fluid, which on average outperform the state-of-the-art (SOTA) methods. The outstanding performance results of ESMSec demonstrate that the ESM can improve the prediction performance of the model and has great potential to screen the secretion information of human body fluid proteins.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11204320PMC
http://dx.doi.org/10.3390/ijms25126371DOI Listing

Publication Analysis

Top Keywords

human body
24
body fluid
20
secreted proteins
16
proteins human
12
body fluids
8
protein language
8
language models
8
protein sequence
8
esm2 model
8
proteins
7

Similar Publications

Pulmovermis cyanovitellosus Coil and Kuntz, 1960 is a species of hemiurid trematode that localizes in the lung of sea snakes, an unusual trait for this group of parasites. Recent molecular phylogenetic studies based on 28S rRNA gene sequences have shown that this species is closely related to members of the genus Lecithochirium Lühe, 1901. This finding is unexpected given that Pulmovermis Coil and Kuntz, 1960 and Lecithochirium are currently classified in different subfamilies of Hemiuridae (Pulmoverminae Sandars, 1961 vs.

View Article and Find Full Text PDF

Biting midges of genus Leptoconops Skuse 1889 are small blood-feeding insects recognized as highly irritating diurnal pests in certain regions around the globe. In Europe, their presence is poorly documented, except in France and Italy. Following reports of human discomfort in a tourist area of Menorca, Balearic Islands (Spain), a small-scale study was conducted to identify the biting species and assess their preferred biting sites using a human-landing assay along a habitat gradient in a coastal dune area.

View Article and Find Full Text PDF

Purpose: Photobiomodulation (PBM) is a non-invasive therapeutic procedure that consists of irradiating a local area of the skin with red and near-infrared lasers or light emitting diodes (LEDs). Local PBM has been studied as a method to improve exercise performance and recovery. This review aims to evaluate the efficacy of whole-body PBM for exercise performance and recovery, comparing its findings to the established effects of localized PBM.

View Article and Find Full Text PDF

Alpelisib is a phosphatidylinositol 3-kinase inhibitor approved by the US Food and Drug Administration for the treatment of hormone receptor-positive metastatic breast cancer with (phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit α) mutation. In recent years a number of adverse effects have been observed to be associated with this therapy, the most notable of which is hyperglycemia. A literature search was conducted to include case studies, case series, systematic reviews, and meta-analyses within the last 10 years that evaluated patients with mutated hormone receptor-positive, human epidermal growth factor receptor 2 negative metastatic breast cancer.

View Article and Find Full Text PDF

Impaired insulin secretion contributes to the pathogenesis of type 1 diabetes mellitus through autoimmune destruction of pancreatic β-cells and the pathogenesis of severe forms of type 2 diabetes mellitus through β-cell dedifferentiation and other mechanisms. Replenishment of malfunctioning β-cells via islet transplantation has the potential to induce long-term glycemic control in the body. However, this treatment option cannot widely be implemented in clinical due to healthy islet donor shortage.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!