Background: The recognition of protein interaction sites is of great significance in many biological processes, signaling pathways and drug designs. However, most sites on protein sequences cannot be defined as interface or non-interface sites because only a small part of protein interactions had been identified, which will cause the lack of prediction accuracy and generalization ability of predictors in protein interaction sites prediction. Therefore, it is necessary to effectively improve prediction performance of protein interaction sites using large amounts of unlabeled data together with small amounts of labeled data and background knowledge today.

Results: In this work, three semi-supervised support vector machine-based methods are proposed to improve the performance in the protein interaction sites prediction, in which the information of unlabeled protein sites can be involved. Herein, five features related with the evolutionary conservation of amino acids are extracted from HSSP database and Consurf Sever, i.e., residue spatial sequence spectrum, residue sequence information entropy and relative entropy, residue sequence conserved weight and residual Base evolution rate, to represent the residues within the protein sequence. Then three predictors are built for identifying the interface residues from protein surface using three types of semi-supervised support vector machine algorithms.

Conclusion: The experimental results demonstrated that the semi-supervised approaches can effectively improve prediction performance of protein interaction sites when unlabeled information is involved into the predictors and one of them can achieve the best prediction performance, i.e., the accuracy of 70.7%, the sensitivity of 62.67% and the specificity of 78.72%, respectively. With comparison to the existing studies, the semi-supervised models show the improvement of the predication performance.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6929468PMC
http://dx.doi.org/10.1186/s12859-019-3274-7DOI Listing

Publication Analysis

Top Keywords

protein interaction
24
interaction sites
24
prediction performance
12
performance protein
12
protein
11
sites
9
sites unlabeled
8
sites prediction
8
effectively improve
8
improve prediction
8

Similar Publications

The COVID-19 pandemic posed a threat to global society. Delta and Omicron are concerning variants due to the risk of increasing human-to-human transmissibility and immune evasion. This study aims to evaluate the binding ability of these variants toward the angiotensin-converting enzyme 2 receptor and antibodies using a computational approach.

View Article and Find Full Text PDF

The present study explores the conformational dynamics of the membrane protein of Middle East Respiratory Syndrome Coronavirus (MERS-CoV) within the Endoplasmic Reticulum-Golgi Intermediate Compartment (ERGIC) complex using an all-atomistic molecular dynamics simulation approach. Significant structural changes were observed in the N-terminal, C-terminal, transmembrane, and beta-sheet sandwich domains of the MERS-CoV membrane protein. This study also highlights the structural similarities between the MERS-CoV and the SARS-CoV-2 membrane proteins, particularly in how both exhibit a distinct kink in the transmembrane helix caused by aromatic residue-lipid interactions.

View Article and Find Full Text PDF

ZAR1/2-Regulated Epigenetic Modifications are Essential for Age-Associated Oocyte Quality Maintenance and Zygotic Activation.

Adv Sci (Weinh)

January 2025

Department of Obstetrics and Gynecology, Zhejiang Key Laboratory of Precise Protection and Promotion of Fertility, Zhejiang Provincial Clinical Research Center for Reproductive Health and Disease, Assisted Reproduction Unit, Sir Run Run Shaw Hospital, School of Medicine, Zhejiang University, Hangzhou, 310016, China.

The developmental competence and epigenetic progression of oocytes gradually become dysregulated with increasing maternal age. However, the mechanisms underlying age-related epigenetic regulation in oocytes remain poorly understood. Zygote arrest proteins 1 and 2 (ZAR1/2) are two maternal factors with partially redundant roles in maintaining oocyte quality, mainly known by regulating mRNA stability.

View Article and Find Full Text PDF

Circular RNAs in cancer: roles, mechanisms, and therapeutic potential across colorectal, gastric, liver, and lung carcinomas.

Discov Oncol

January 2025

Department of Bioscience and Biotechnology, Banasthali Vidyapith, Niwai-Tonk, Rajasthan, 304022, India.

The prominence of circular RNAs (circRNAs) has surged in cancer research due to their distinctive properties and impact on cancer development. This review delves into the role of circRNAs in four key cancer types: colorectal cancer (CRC), gastric cancer (GC), liver cancer (HCC), and lung cancer (LUAD). The focus lies on their potential as cancer biomarkers and drug targets.

View Article and Find Full Text PDF

Asthma is a complex disease with varied clinical manifestations resulting from the interaction between environmental and genetic factors. While chronic airway inflammation and hyperresponsiveness are central features, the etiology of asthma is multifaceted, leading to a diversity of phenotypes and endotypes. Although most research into the genetics of asthma focused on the analysis of single nucleotide polymorphisms (SNPs), studies highlight the importance of structural variations, such as copy number variations (CNVs), in the inheritance of complex characteristics, but their role has not yet been fully elucidated in asthma.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!