Motivation: Protein function prediction, based on the patterns of connection in a protein-protein interaction (or association) network, is perhaps the most studied of the classical, fundamental inference problems for biological networks. A highly successful set of recent approaches use random walk-based low-dimensional embeddings that tend to place functionally similar proteins into coherent spatial regions. However, these approaches lose valuable local graph structure from the network when considering only the embedding. We introduce GLIDER, a method that replaces a protein-protein interaction or association network with a new graph-based similarity network. GLIDER is based on a variant of our previous GLIDE method, which was designed to predict missing links in protein-protein association networks, capturing implicit local and global (i.e. embedding-based) graph properties.

Results: GLIDER outperforms competing methods on the task of predicting GO functional labels in cross-validation on a heterogeneous collection of four human protein-protein association networks derived from the 2016 DREAM Disease Module Identification Challenge, and also on three different protein-protein association networks built from the STRING database. We show that this is due to the strong functional enrichment that is present in the local GLIDER neighborhood in multiple different types of protein-protein association networks. Furthermore, we introduce the GLIDER graph neighborhood as a way for biologists to visualize the local neighborhood of a disease gene. As an application, we look at the local GLIDER neighborhoods of a set of known Parkinson's Disease GWAS genes, rediscover many genes which have known involvement in Parkinson's disease pathways, plus suggest some new genes to study.

Availability And Implementation: All code is publicly available and can be accessed here: https://github.com/kap-devkota/GLIDER.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9237677PMC
http://dx.doi.org/10.1093/bioinformatics/btac322DOI Listing

Publication Analysis

Top Keywords

protein-protein association
16
association networks
16
function prediction
8
protein-protein interaction
8
interaction association
8
association network
8
introduce glider
8
local glider
8
parkinson's disease
8
glider
7

Similar Publications

PEDV is a highly contagious enteric pathogen that can cause severe diarrhea and death in neonatal pigs. Despite extensive research, the molecular mechanisms of host's response to PEDV infection remain unclear. In this study, differentially expressed genes (DEGs), time-specific coexpression modules, and key regulatory genes associated with PEDV infection were identified.

View Article and Find Full Text PDF

There is growing evidence that programmed cell death plays a significant role in the pathogenesis of chronic thromboembolic pulmonary hypertension (CTEPH). Anoikis is a newly discovered type of programmed death and has garnered great attention. However, the precise involvement of Anoikis in the progression of CTEPH remains poorly understood.

View Article and Find Full Text PDF

Bulk and Single-Cell Transcriptome Analyses Unravel Gene Signatures of Mitochondria-Associated Programmed Cell Death in Diabetic Foot Ulcer.

J Cell Mol Med

December 2024

Department of Orthopedics, Shenshan Medical Center, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Shanwei, Guangdong, P. R. China.

Mitochondrial programmed cell death (PCD) plays a critical role in the pathogenesis of diabetic foot ulcers (DFU). In this study, we performed a comprehensive transcriptome analysis to identify potential hub genes and key cell types associated with PCD and mitochondria in DFU. Using intersection analysis of PCD- and mitochondria-related genes, we identified candidate hub genes through protein-protein interaction and random forest analysis.

View Article and Find Full Text PDF

The duck industry is vital for supplying high-quality protein, making research into the development of duck skeletal muscle critical for improving meat and egg production. In this study, we leveraged Oxford Nanopore Technologies (ONT) sequencing to perform full-length transcriptome sequencing of myoblasts harvested from the leg muscles of duck embryos at embryonic day 13 (E13), specifically examining both the proliferative (GM) and differentiation (DM) phases. Our analysis identified a total of 5797 novel transcripts along with 2332 long non-coding RNAs (lncRNAs), revealing substantial changes in gene expression linked to muscle development.

View Article and Find Full Text PDF

Background: Extracellular vesicles are easily accessible in various biofluids and allow the assessment of disease-related changes in the proteome. This has made them a promising target for biomarker studies, especially in the field of neurodegeneration where access to diseased tissue is very limited. Genetic variants in the LRRK2 gene have been linked to both familial and sporadic forms of Parkinson's disease.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!