Protein-protein interaction prediction based on ordinal regression and recurrent convolutional neural networks.

BMC Bioinformatics

Department of Computer Science and Technology, Tongji University, No. 4800 Caoan Road, Shanghai, 201804, China.

Published: October 2021

Background: Protein protein interactions (PPIs) are essential to most of the biological processes. The prediction of PPIs is beneficial to the understanding of protein functions and thus is helpful to pathological analysis, disease diagnosis and drug design etc. As the amount of protein data is growing fast in the post genomic era, high-throughput experimental methods are expensive and time-consuming for the prediction of PPIs. Thus, computational methods have attracted researcher's attention in recent years. A large number of computational methods have been proposed based on different protein sequence encoders.

Results: Notably, the confidence score of a protein sequence pair could be regarded as a kind of measurement to PPIs. The higher the confidence score for one protein pair is, the more likely the protein pair interacts. Thus in this paper, a deep learning framework, called ordinal regression and recurrent convolutional neural network (OR-RCNN) method, is introduced to predict PPIs from the perspective of confidence score. It mainly contains two parts: the encoder part of protein sequence pair and the prediction part of PPIs by confidence score. In the first part, two recurrent convolutional neural networks (RCNNs) with shared parameters are applied to construct two protein sequence embedding vectors, which can automatically extract robust local features and sequential information from the protein pairs. Based on it, the two embedding vectors are encoded into one novel embedding vector by element-wise multiplication. By taking the ordinal information behind confidence score into consideration, ordinal regression is used to construct multiple sub-classifiers in the second part. The results of multiple sub-classifiers are aggregated to obtain the final confidence score. Following that, the existence of PPIs is determined by the confidence score. We set a threshold [Formula: see text], and say the interaction exists between the protein pair if its confidence score is bigger than [Formula: see text].

Conclusions: We applied our method to predict PPIs on data sets S. cerevisiae and Homo sapiens. Through experimental verification, our method outperforms state-of-the-art PPI prediction models.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8501564PMC
http://dx.doi.org/10.1186/s12859-021-04369-0DOI Listing

Publication Analysis

Top Keywords

confidence score
32
protein sequence
16
ordinal regression
12
recurrent convolutional
12
convolutional neural
12
protein
12
prediction ppis
12
protein pair
12
regression recurrent
8
neural networks
8

Similar Publications

Background: Recently, there has been a surge in virtual reality (VR)-based training for upper limb (UL) rehabilitation, which has yielded mixed results. Therefore, we aimed to explore the effects of conventional therapy combined with VR-based training on UL dysfunction during post-stroke rehabilitation.

Methods: Studies published in English before May 2023 were retrieved from PubMed, Embase, and the Cochrane Library.

View Article and Find Full Text PDF

Impact of Panvascular Disease on Exercise Capacity and Clinical Outcomes in Patients with Heart Failure with Reduced Ejection Fraction.

CJC Open

December 2024

Department of Cardiology, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China (USTC), Hefei, China.

Background: The aim of this study was to assess the impact of panvascular disease (PVD) on quality of life (QOL), exercise capacity, and clinical outcomes, in patients with heart failure (HF) with reduced ejection fraction (HFrEF).

Methods: We performed a post hoc analysis of the Heart Failure: A Controlled Trial Investigating Outcomes of Exercise Training (HF-ACTION; NCT00047437). Patients with PVD were defined as those having coronary heart disease, stroke, or peripheral vascular disease at baseline.

View Article and Find Full Text PDF

Background: This study investigated the association between coronary artery calcification (CAC) and triglyceride glucose-body mass index (TyG-BMI) in patients receiving maintenance hemodialysis (MHD).

Methods: We used computed tomography (CT) to assess coronary artery calcification score (CACS) using the Agatston method. The TyG index was multiplied by BMI to derive the TyG-BMI index.

View Article and Find Full Text PDF

Background: Immune checkpoint inhibitors (ICIs) are effective in a subset of patients with metastatic solid tumors. However, the patients who would benefit most from ICIs in biliary tract cancer (BTC) are still controversial.

Materials And Methods: We molecularly characterized tissues and blood from 32 patients with metastatic BTC treated with the ICI pembrolizumab as second-line therapy.

View Article and Find Full Text PDF

CCfrag: scanning folding potential of coiled-coil fragments with AlphaFold.

Bioinform Adv

December 2024

Department of Protein Evolution, Max Planck Institute for Biology, Tübingen 72076, Germany.

Motivation: Coiled coils are a widespread structural motif consisting of multiple α-helices that wind around a central axis to bury their hydrophobic core. While AlphaFold has emerged as an effective coiled-coil modeling tool, capable of accurately predicting changes in periodicity and core geometry along coiled-coil stalks, it is not without limitations, such as the generation of spuriously bent models and the inability to effectively model globally non-canonical-coiled coils. To overcome these limitations, we investigated whether dividing full-length sequences into fragments would result in better models.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!