Biolinguistic graph fusion model for circRNA-miRNA association prediction.

Brief Bioinform

School of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230601, China.

Published: January 2024

AI Article Synopsis

  • Emerging evidence shows that circular RNAs (circRNAs) and microRNAs (miRNAs) are important in regulating various diseases, but current experimental methods to study their associations are flawed and tedious.
  • The proposed BGF-CMAP model combines advanced techniques like gradient boosting, natural language processing, and graph embedding to better predict connections between circRNAs and miRNAs.
  • BGF-CMAP demonstrated high accuracy (82.90%) in predicting these associations, outperforming previous models and confirming many of its predictions with existing experimental data.

Article Abstract

Emerging clinical evidence suggests that sophisticated associations with circular ribonucleic acids (RNAs) (circRNAs) and microRNAs (miRNAs) are a critical regulatory factor of various pathological processes and play a critical role in most intricate human diseases. Nonetheless, the above correlations via wet experiments are error-prone and labor-intensive, and the underlying novel circRNA-miRNA association (CMA) has been validated by numerous existing computational methods that rely only on single correlation data. Considering the inadequacy of existing machine learning models, we propose a new model named BGF-CMAP, which combines the gradient boosting decision tree with natural language processing and graph embedding methods to infer associations between circRNAs and miRNAs. Specifically, BGF-CMAP extracts sequence attribute features and interaction behavior features by Word2vec and two homogeneous graph embedding algorithms, large-scale information network embedding and graph factorization, respectively. Multitudinous comprehensive experimental analysis revealed that BGF-CMAP successfully predicted the complex relationship between circRNAs and miRNAs with an accuracy of 82.90% and an area under receiver operating characteristic of 0.9075. Furthermore, 23 of the top 30 miRNA-associated circRNAs of the studies on data were confirmed in relevant experiences, showing that the BGF-CMAP model is superior to others. BGF-CMAP can serve as a helpful model to provide a scientific theoretical basis for the study of CMA prediction.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10939421PMC
http://dx.doi.org/10.1093/bib/bbae058DOI Listing

Publication Analysis

Top Keywords

circrna-mirna association
8
graph embedding
8
circrnas mirnas
8
bgf-cmap
5
biolinguistic graph
4
graph fusion
4
model
4
fusion model
4
model circrna-mirna
4
association prediction
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!