BioNet: a large-scale and heterogeneous biological network model for interaction prediction with graph convolution.

Xi Yang Wei Wang Jing-Lun Ma Yan-Long Qiu Kai Lu Dong-Sheng Cao Cheng-Kun Wu

Brief Bioinform

Institute for Quantum Information & State Key Laboratory of High Performance Computing, College of Computer Science and Technology, National University of Defense Technology, Changsha 410073, China.

Published: January 2022

Understanding chemical-gene interactions (CGIs) is essential for drug screening, and while wet lab experiments are tedious and costly, computational methods offer a more efficient approach for large-scale analysis.
The study introduces BioNet, a deep biological network model that uses a graph encoder-decoder architecture to predict interactions between chemicals and genes, leveraging a large dataset that includes over 79,000 entities and more than 34 million relations.
BioNet demonstrates impressive performance in predictions, achieving a high ROC curve score of 0.952, and its findings have been validated against external data, particularly in relation to cancer and COVID-19 interactions.

Motivation: Understanding chemical-gene interactions (CGIs) is crucial for screening drugs. Wet experiments are usually costly and laborious, which limits relevant studies to a small scale. On the contrary, computational studies enable efficient in-silico exploration. For the CGI prediction problem, a common method is to perform systematic analyses on a heterogeneous network involving various biomedical entities. Recently, graph neural networks become popular in the field of relation prediction. However, the inherent heterogeneous complexity of biological interaction networks and the massive amount of data pose enormous challenges. This paper aims to develop a data-driven model that is capable of learning latent information from the interaction network and making correct predictions.

Results: We developed BioNet, a deep biological networkmodel with a graph encoder-decoder architecture. The graph encoder utilizes graph convolution to learn latent information embedded in complex interactions among chemicals, genes, diseases and biological pathways. The learning process is featured by two consecutive steps. Then, embedded information learnt by the encoder is then employed to make multi-type interaction predictions between chemicals and genes with a tensor decomposition decoder based on the RESCAL algorithm. BioNet includes 79 325 entities as nodes, and 34 005 501 relations as edges. To train such a massive deep graph model, BioNet introduces a parallel training algorithm utilizing multiple Graphics Processing Unit (GPUs). The evaluation experiments indicated that BioNet exhibits outstanding prediction performance with a best area under Receiver Operating Characteristic (ROC) curve of 0.952, which significantly surpasses state-of-theart methods. For further validation, top predicted CGIs of cancer and COVID-19 by BioNet were verified by external curated data and published literature.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8690188	PMC
http://dx.doi.org/10.1093/bib/bbab491	DOI Listing

Publication Analysis

Top Keywords

graph convolution

chemicals genes

bionet

graph

bionet large-scale

large-scale heterogeneous

biological

heterogeneous biological

biological network

network model

Similar Publications

Data-model interactive Rul prediction of stochastic degradation devices with multiple uncertainty quantification and multi-sensor information fusion.

ISA Trans

December 2024

College of Information Engineering, Zhejiang University of Technology, Hangzhou, 310023, China.

Caoyuan Gu Qi Wu Baokang Zhang Yaowei Wang Wen-An Zhang

This paper proposes an improved remaining useful life (RUL) prediction method for stochastic degradation devices monitored by multi-source sensors under data-model interactive framework. Firstly, the interrelationships among sensors are established using k-nearest neighbor (KNN), and the composite health index (CHI) is constructed by aggregating the multi-source sensor information through the graph convolutional network (GCN). Secondly, a stochastic degradation model with triple uncertainty at any initial degradation level is established to improve the matching degree between the stochastic degradation model and the actual degradation process.

View Article and Find Full Text PDF

Similar Publications

MO-GCN: A multi-omics graph convolutional network for discriminative analysis of schizophrenia.

Brain Res Bull

January 2025

School of Biomedical Sciences and Engineering, South China University of Technology, Guangzhou International Campus, Guangzhou 511442, China; Guangdong Province Key Laboratory of Biomedical Engineering, South China University of Technology, Guangzhou 510006, China; Department of Nuclear Medicine and Radiology, Institute of Development, Aging and Cancer, Tohoku University, Sendai 980-8575, Japan. Electronic address:

Haiyuan Wang Runlin Peng Yuanyuan Huang Liqin Liang Wei Wang

The methodology of machine learning with multi-omics data has been widely adopted in the discriminative analyses of schizophrenia, but most of these studies ignored the cooperative interactions and topological attributes of multi-omics networks. In this study, we constructed three types of brain graphs (BGs), three types of gut graphs (GGs), and nine types of brain-gut combined graphs (BGCGs) for each individual. We proposed a novel methodology of multi-omics graph convolutional network (MO-GCN) with an attention mechanism to construct a classification model by integrating all BGCGs.

View Article and Find Full Text PDF

Similar Publications

Biomarkers.

Alzheimers Dement

December 2024

Department of Neurology, Mayo Clinic, Rochester, MN, USA.

William C Wakefield Leland R Barnard Hugo Botha Jonathan Graff-Radford Kejal Kantarci

Background: Many proposed clinical decision support systems (CDSS) require multiple disparate data elements as input, which makes implementation difficult, and furthermore have a black-box nature leading to low interpretability. Fluorodeoxyglucose Positron Emission Tomography (FDG-PET) is an established modality for the diagnosis of dementia, and a CDSS that uses only an FDG-PET image to produce a reliable and understandable result would ease both of these challenges to clinical application.

Method: A deep variational autoencoder (VAE) was used to extract a latent representation of each image through prior training from FDG-PET brain images (n=2000).

View Article and Find Full Text PDF

Similar Publications

Alzheimer's Imaging Consortium.

Alzheimers Dement

December 2024

Department of Neurology, Mayo Clinic, Rochester, MN, USA.

William C Wakefield Leland R Barnard Hugo Botha Jonathan Graff-Radford Kejal Kantarci

View Article and Find Full Text PDF

Similar Publications

Drug Development.

Alzheimers Dement

December 2024

Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA.

Dhawal Priyadarshi Heng Huang Nathan J Sahelijo Reza Shirkavand Gyungah R Jun

Background: The prohibitive costs of drug development for Alzheimer's Disease (AD) emphasize the need for alternative in silico drug repositioning strategies. Graph learning algorithms, capable of learning intrinsic features from complex network structures, can leverage existing databases of biological interactions to improve predictions in drug efficacy. We developed a novel machine learning framework, the PreSiBOGNN, that integrates muti-modal information to predict cognitive improvement at the subject level for precision medicine in AD.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!