MolRoPE-BERT: An enhanced molecular representation with Rotary Position Embedding for molecular property prediction.

J Mol Graph Model

School of Information Science and Engineering, Lanzhou University, TianshuiRoad, Lanzhou city, 730000, Lanzhou, China. Electronic address:

Published: January 2023

Molecular property prediction is a significant task in drug discovery. Most deep learning-based computational methods either develop unique chemical representation or combine complex model. However, researchers are less concerned with the possible advantages of enormous quantities of unlabeled molecular data. Since the obvious limited amount of labeled data available, this task becomes more difficult. In some senses, SMILES of the drug molecule may be regarded of as a language for chemistry, taking inspiration from natural language processing research and current advances in pretrained models. In this paper, we incorporated Rotary Position Embedding(RoPE) efficiently encode the position information of SMILES sequences, ultimately enhancing the capability of the BERT pretrained model to extract potential molecular substructure information for molecular property prediction. We proposed the MolRoPE-BERT framework, an new end-to-end deep learning framework that integrates an efficient position coding approach for capturing sequence position information with a pretrained BERT model for molecular property prediction. To generate useful molecular substructure embeddings, we first exclusively train the MolRoPE-BERT on four million unlabeled drug SMILES(i.e., ZINC 15 and ChEMBL 27). Then, we conduct a series of experiments to evaluate the performance of our proposed MolRoPE-BERT on four well-studied datasets. Compared with conventional and state-of-the-art baselines, our experiment demonstrated comparable or superior performance.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jmgm.2022.108344DOI Listing

Publication Analysis

Top Keywords

molecular property
16
property prediction
16
molecular
8
rotary position
8
molecular substructure
8
proposed molrope-bert
8
position
5
molrope-bert
4
molrope-bert enhanced
4
enhanced molecular
4

Similar Publications

Menthol is a naturally occurring cyclic terpene alcohol and is the major component of peppermint and corn mint essential oils extracted from Mentha piperita L. and Mentha arvensis L..

View Article and Find Full Text PDF

Hierarchical Porous Aggregate-Enabled Chromatography-Inspired Single-Sensor E-Nose for Volatile Monitoring.

ACS Sens

January 2025

School of Chemistry and Molecular Engineering, In Situ Devices Research Center, Shanghai Key Laboratory for Urban Ecological Processes and Eco-Restoration, East China Normal University, Shanghai 200241, China.

Monitoring volatile organic compounds (VOCs) is crucial for ensuring safety and health. In this study, we introduce a strategy to engineer a chromatography-inspired single-sensor (CISS) e-nose tailored for VOC monitoring. This approach overcomes the limitations of traditional methodologies and conventional e-noses.

View Article and Find Full Text PDF

This study presents T-1-NBAB, a new compound derived from the natural xanthine alkaloid theobromine, aimed at inhibiting VEGFR-2, a crucial protein in angiogenesis. T-1-NBAB's potential to interacts with and inhibit the VEGFR-2 was indicated using in silico techniques like molecular docking, MD simulations, MM-GBSA, PLIP, essential dynamics, and bi-dimensional projection experiments. DFT experiments was utilized also to study the structural and electrostatic properties of T-1-NBAB.

View Article and Find Full Text PDF

Pancreatic Ductal Adenocarcinoma (PDAC) is a devastating disease with poor clinical outcomes, which is mainly because of delayed disease detection, resistance to chemotherapy, and lack of specific targeted therapies. The disease's development involves complex interactions among immunological, genetic, and environmental factors, yet its molecular mechanism remains elusive. A major challenge in understanding PDAC etiology lies in unraveling the genetic profiling that governs the PDAC network.

View Article and Find Full Text PDF

The aging population necessitates a critical need for medical devices, where polymers-based surface lubrication coating is essential for optimal functionality. In fact, lubrication and mechanical requirements vary depending on the service environment of different medical devices. Until now, key mean is still blank for general preparation of hydrophilic polymers-based lubrication coatings with on-demand mechanics and lubricity.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!