MvMRL: a multi-view molecular representation learning method for molecular property prediction.

Brief Bioinform

Guangxi Key Lab of Human-Machine Interaction and Intelligent Decision, Nanning Normal University, No. 175, Mingxiu East Road, Xixiang Tang District, Nanning 530001, China.

Published: May 2024

Effective molecular representation learning is very important for Artificial Intelligence-driven Drug Design because it affects the accuracy and efficiency of molecular property prediction and other molecular modeling relevant tasks. However, previous molecular representation learning studies often suffer from limitations, such as over-reliance on a single molecular representation, failure to fully capture both local and global information in molecular structure, and ineffective integration of multiscale features from different molecular representations. These limitations restrict the complete and accurate representation of molecular structure and properties, ultimately impacting the accuracy of predicting molecular properties. To this end, we propose a novel multi-view molecular representation learning method called MvMRL, which can incorporate feature information from multiple molecular representations and capture both local and global information from different views well, thus improving molecular property prediction. Specifically, MvMRL consists of four parts: a multiscale CNN-SE Simplified Molecular Input Line Entry System (SMILES) learning component and a multiscale Graph Neural Network encoder to extract local feature information and global feature information from the SMILES view and the molecular graph view, respectively; a Multi-Layer Perceptron network to capture complex non-linear relationship features from the molecular fingerprint view; and a dual cross-attention component to fuse feature information on the multi-views deeply for predicting molecular properties. We evaluate the performance of MvMRL on 11 benchmark datasets, and experimental results show that MvMRL outperforms state-of-the-art methods, indicating its rationality and effectiveness in molecular property prediction. The source code of MvMRL was released in https://github.com/jedison-github/MvMRL.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11200189PMC
http://dx.doi.org/10.1093/bib/bbae298DOI Listing

Publication Analysis

Top Keywords

molecular representation
20
molecular
19
representation learning
16
molecular property
16
property prediction
16
multi-view molecular
8
learning method
8
capture local
8
local global
8
molecular structure
8

Similar Publications

Unified Knowledge-Guided Molecular Graph Encoder with multimodal fusion and multi-task learning.

Neural Netw

December 2024

School of Computer Science, Wuhan University, Luojiashan Road, Wuchang District., Wuhan, 430072, Hubei Province, China; Hubei Key Laboratory of Digital Finance Innovation, Hubei University of Economics, No. 8, Yangqiaohu Avenue, Zanglong Island Development Zone, Jiangxia District, Wuhan, 2007, Hubei Province, China. Electronic address:

The remarkable success of Graph Neural Networks underscores their formidable capacity to assimilate multimodal inputs, markedly enhancing performance across a broad spectrum of domains. In the context of molecular modeling, considerable efforts have been made to enrich molecular representations by integrating data from diverse aspects. Nevertheless, current methodologies frequently compartmentalize geometric and semantic components, resulting in a fragmented approach that impairs the holistic integration of molecular attributes.

View Article and Find Full Text PDF

Understanding the function of proteins is of great significance for revealing disease pathogenesis and discovering new targets. Benefiting from the explosive growth of the protein universal, deep learning has been applied to accelerate the protein annotation cycle from different biological modalities. However, most existing deep learning-based methods not only fail to effectively fuse different biological modalities, resulting in low-quality protein representations, but also suffer from the convergence of suboptimal solution caused by sparse label representations.

View Article and Find Full Text PDF

As the primary innate immune cells of the brain, microglia play a key role in various homeostatic and disease-related processes. To carry out their numerous functions, microglia adopt a wide range of phenotypic states. The proteomic landscape represents a more accurate molecular representation of these phenotypes; however, microglia present unique challenges for proteomic analysis.

View Article and Find Full Text PDF

Early weaning management followed by energy supplementation can lead to metabolic alterations in the calf that exert long-term effects on the animal's health and performance. It is believed that the main molecular basis underlying these metabolic adaptations are epigenetic mechanisms that regulate, activate, or silence genes at different stages of development and/or in response to different environmental stimuli. However, little is known about postnatal metabolic programming in .

View Article and Find Full Text PDF

Disordered single-stranded RNA (ssRNA) molecules, like their well-folded counterparts, have crucial functions that depend on their structures. However, since native ssRNAs constitute a highly heterogeneous conformer population, their structural characterization poses challenges. One important question regards the role of sequence in influencing ssRNA structure.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!