GRL-PUL: predicting microbe-drug association based on graph representation learning and positive unlabeled learning.

Mol Omics

School of Computer Science and Technology, Guangdong University of Technology, Guangzhou 510006, China.

Published: November 2024

Extensive research has confirmed the widespread presence of microorganisms in the human body and their crucial impact on human health, with drugs being an effective method of regulation. Hence it is essential to identify potential microbe-drug associations (MDAs). Owing to the limitations of wet experiments, such as high costs and long durations, computational methods for binary classification tasks have become valuable alternatives for traditional experimental approaches. Since validated negative MDAs are absent in existing datasets, most methods randomly sample negatives from unlabeled data, which evidently leads to false negative issues. In this manuscript, we propose a novel model based on graph representation learning and positive-unlabeled learning (GRL-PUL), to infer potential MDAs. Firstly, we screen reliable negative samples by applying weighted matrix factorization and the PU-bagging strategy on the known microbe-drug bipartite network. Then, we combine muti-model attributes and constructed a microbe-drug heterogeneous network. After that, graph attention auto-encoder module, an encoder combining graph convolutional networks and graph attention networks, is introduced to extract informative embeddings based on the microbe-drug heterogeneous network. Lastly, we adopt a modified random forest as the final classifier. Comparison experiments with five baseline models on three benchmark datasets show that our model surpasses other methods in terms of the AUC, AUPR, ACC, F1-score and MCC. Moreover, several case studies show that GRL-PUL could capably predict latent MDAs. Notably, we further verify the effectiveness of a reliable negative sample selection module by migrating it to other state-of-the-art models, and the experimental results demonstrate its ability to substantially improve their prediction performance.

Download full-text PDF	Source
http://dx.doi.org/10.1039/d4mo00117f	DOI Listing

Publication Analysis

Top Keywords

based graph

graph representation

representation learning

reliable negative

microbe-drug heterogeneous

heterogeneous network

graph attention

microbe-drug

graph

grl-pul predicting

Similar Publications

GraphkmerDTA: integrating local sequence patterns and topological information for drug-target binding affinity prediction and applications in multi-target anti-Alzheimer's drug discovery.

Mol Divers

January 2025

Key Laboratory of Prevention and Treatment of Cardiovascular and Cerebrovascular Diseases Ministry of Education, Jiangxi Province Key Laboratory of Biomaterials and Biofabrication for Tissue Engineering, Gannan Medical University, Ganzhou, 341000, Jiangxi, China.

Zuolong Zhang Gang Luo Yixuan Ma Zhaoqi Wu Shuo Peng

Identifying drug-target binding affinity (DTA) plays a critical role in early-stage drug discovery. Despite the availability of various existing methods, there are still two limitations. Firstly, sequence-based methods often extract features from fixed length protein sequences, requiring truncation or padding, which can result in information loss or the introduction of unwanted noise.

View Article and Find Full Text PDF

Similar Publications

ModeHunter: A Package for the Reductionist Analysis, Animation, and Application of Elastic Biomolecular Motion.

J Phys Chem B

January 2025

Department of Physiology and Biophysics, Weill Cornell Medical College, New York, New York 10065, United States.

Willy Wriggers Maytha Alshammari Joseph N Stember Sebastian Stolzenberg Essam Metwally

ModeHunter is a modular Python software package for the simulation of 3D biophysical motion across spatial resolution scales using modal analysis of elastic networks. It has been curated from our in-house Python scripts over the last 15 years, with a focus on detecting similarities of elastic motion between atomic structures, coarse-grained graphs, and volumetric data obtained from biophysical or biomedical imaging origins, such as electron microscopy or tomography. With ModeHunter, normal modes of biophysical motion can be analyzed with various static visualization techniques or brought to life by dynamics animation in terms of single or multimode trajectories or decoy ensembles.

View Article and Find Full Text PDF

Similar Publications

Altered Connectome Topology in Newborns at Risk for Cognitive Developmental Delay: A Cross-Etiologic Study.

Hum Brain Mapp

January 2025

Center for MR Research, University Children's Hospital Zurich, Zurich, Switzerland.

Anna Speckert Kelly Payette Walter Knirsch Michael von Rhein Patrice Grehten

The human brain connectome is characterized by the duality of highly modular structure and efficient integration, supporting information processing. Newborns with congenital heart disease (CHD), prematurity, or spina bifida aperta (SBA) constitute a population at risk for altered brain development and developmental delay (DD). We hypothesize that, independent of etiology, alterations of connectomic organization reflect neural circuitry impairments in cognitive DD.

View Article and Find Full Text PDF

Similar Publications

High-recombining genomic regions affect demography inference based on ancestral recombination graphs.

Genetics

January 2025

Max Planck Research Group Behavioural Genomics, Max Planck Institute for Evolutionary Biology, August-Thienemann-Straße 2, 24306 Plön, Germany.

Jun Ishigohoka Miriam Liedvogel

Multiple methods of demography inference are based on the ancestral recombination graph. This powerful approach uses observed mutations to model local genealogies changing along chromosomes by historical recombination events. However, inference of underlying genealogies is difficult in regions with high recombination rate relative to mutation rate due to the lack of mutations representing genealogies.

View Article and Find Full Text PDF

Similar Publications

Unraveling the Differential Efficiency of Dorsal and Ventral Pathways in Visual Semantic Decoding.

Int J Neural Syst

January 2025

Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China, Huzhou 313001, P. R. China.

Wei Huang Ying Tang Sizhuo Wang Jingpeng Li Kaiwen Cheng

Visual semantic decoding aims to extract perceived semantic information from the visual responses of the human brain and convert it into interpretable semantic labels. Although significant progress has been made in semantic decoding across individual visual cortices, studies on the semantic decoding of the ventral and dorsal cortical visual pathways remain limited. This study proposed a graph neural network (GNN)-based semantic decoding model on a natural scene dataset (NSD) to investigate the decoding differences between the dorsal and ventral pathways in process various parts of speech, including verbs, nouns, and adjectives.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!