Learning global dependencies and multi-semantics within heterogeneous graph for predicting disease-related lncRNAs.

Brief Bioinform

School of Information Science and Engineering (School of Software), Yanshan University, Qinhuangdao 066004, China.

Published: September 2022

Motivation: Long noncoding RNAs (lncRNAs) play an important role in the occurrence and development of diseases. Predicting disease-related lncRNAs can help to understand the pathogenesis of diseases deeply. The existing methods mainly rely on multi-source data related to lncRNAs and diseases when predicting the associations between lncRNAs and diseases. There are interdependencies among node attributes in a heterogeneous graph composed of all lncRNAs, diseases and micro RNAs. The meta-paths composed of various connections between them also contain rich semantic information. However, the existing methods neglect to integrate attribute information of intermediate nodes in meta-paths.

Results: We propose a novel association prediction model, GSMV, to learn and deeply integrate the global dependencies, semantic information of meta-paths and node-pair multi-view features related to lncRNAs and diseases. We firstly formulate the global representations of the lncRNA and disease nodes by establishing a self-attention mechanism to capture and learn the global dependencies among node attributes. Second, starting from the lncRNA and disease nodes, respectively, multiple meta-pathways are established to reveal different semantic information. Considering that each meta-path contains specific semantics and has multiple meta-path instances which have different contributions to revealing meta-path semantics, we design a graph neural network based module which consists of a meta-path instance encoding strategy and two novel attention mechanisms. The proposed meta-path instance encoding strategy is used to learn the contextual connections between nodes within a meta-path instance. One of the two new attention mechanisms is at the meta-path instance level, which learns rich and informative meta-path instances. The other attention mechanism integrates various semantic information from multiple meta-paths to learn the semantic representation of lncRNA and disease nodes. Finally, a dilated convolution-based learning module with adjustable receptive fields is proposed to learn multi-view features of lncRNA-disease node pairs. The experimental results prove that our method outperforms seven state-of-the-art comparing methods for lncRNA-disease association prediction. Ablation experiments demonstrate the contributions of the proposed global representation learning, semantic information learning, pairwise multi-view feature learning and the meta-path instance encoding strategy. Case studies on three cancers further demonstrate our method's ability to discover potential disease-related lncRNA candidates.

Contact: zhang@hlju.edu.cn or peiliangwu@ysu.edu.cn.

Supplementary Information: Supplementary data are available at Briefings in Bioinformatics online.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bib/bbac361DOI Listing

Publication Analysis

Top Keywords

meta-path instance
20
lncrnas diseases
16
global dependencies
12
lncrna disease
12
disease nodes
12
instance encoding
12
encoding strategy
12
meta-path
9
heterogeneous graph
8
predicting disease-related
8

Similar Publications

Article Synopsis
  • Drug repositioning aims to speed up drug discovery by exploring existing drugs for new disease applications through drug-disease associations (DDAs).
  • Traditional prediction methods struggle with developing end-to-end frameworks for learning detailed representations of these associations and often miss out on useful data from paths in heterogeneous networks.
  • The proposed method, MilGNet, utilizes a novel deep learning approach that employs a heterogeneous graph neural network to enhance prediction accuracy and interpretability, showing superior performance in experiments and case studies on specific drugs and diseases.
View Article and Find Full Text PDF

Genetically and semantically aware homogeneous network for prediction and scoring of comorbidities.

Comput Biol Med

December 2024

Department of Computer Science and Engineering, CEG Campus, Anna University, Chennai, Tamil Nadu, India. Electronic address:

Objective: Patients with comorbidities are highly prone to mortality risk than those suffering from a single disease. Therefore, quantification and prediction of disease comorbidities is necessary to stratify the mortality risk of the patients, predict the probability of their occurrence, design treatment strategies, and to prevent the progression of diseases. Enriching comorbidity disease relationships with rich semantics established by genetic components play a vital role in effectively quantifying and predicting comorbidities.

View Article and Find Full Text PDF

Heterogeneous Information Networks (HINs) are information networks with multiple types of nodes and edges. The concept of meta-path, i.e.

View Article and Find Full Text PDF

There is a growing body of evidence suggesting that microRNAs (miRNAs), small biological molecules, play a crucial role in the diagnosis, treatment, and prognostic assessment of diseases. However, it is often inefficient to verify the association between miRNAs and diseases (MDA) through traditional experimental methods. Based on this situation, researchers have proposed various computational-based methods, but the existing methods often have many drawbacks in terms of predictive effectiveness and accuracy.

View Article and Find Full Text PDF

Learning global dependencies and multi-semantics within heterogeneous graph for predicting disease-related lncRNAs.

Brief Bioinform

September 2022

School of Information Science and Engineering (School of Software), Yanshan University, Qinhuangdao 066004, China.

Motivation: Long noncoding RNAs (lncRNAs) play an important role in the occurrence and development of diseases. Predicting disease-related lncRNAs can help to understand the pathogenesis of diseases deeply. The existing methods mainly rely on multi-source data related to lncRNAs and diseases when predicting the associations between lncRNAs and diseases.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!