Reliable molecular property prediction is essential for various scientific endeavors and industrial applications, such as drug discovery. However, the data scarcity, combined with the highly non-linear causal relationships between physicochemical and biological properties and conventional molecular featurization schemes, complicates the development of robust molecular machine learning models. Self-supervised learning (SSL) has emerged as a popular solution, utilizing large-scale, unannotated molecular data to learn a foundational representation of chemical space that might be advantageous for downstream tasks. Yet, existing molecular SSL methods largely overlook chemical knowledge, including molecular structure similarity, scaffold composition, and the context-dependent aspects of molecular properties when operating over the chemical space. They also struggle to learn the subtle variations in structure-activity relationship. This paper introduces a multi-channel pre-training framework that learns robust and generalizable chemical knowledge. It leverages the structural hierarchy within the molecule, embeds them through distinct pre-training tasks across channels, and aggregates channel information in a task-specific manner during fine-tuning. Our approach demonstrates competitive performance across various molecular property benchmarks and offers strong advantages in particularly challenging yet ubiquitous scenarios like activity cliffs.

Download full-text PDF

Source
http://dx.doi.org/10.1038/s41467-024-55082-4DOI Listing

Publication Analysis

Top Keywords

molecular
9
molecular property
8
chemical space
8
chemical knowledge
8
multi-channel learning
4
learning integrating
4
integrating structural
4
structural hierarchies
4
hierarchies context-dependent
4
context-dependent molecular
4

Similar Publications

Background/aim: Despite the donor-exchange program implementation for highly sensitized (HS) patients, no improvement in waiting list in those HS patients with 100% calculated panel reactive of antibodies (cPRA) is observed. Recently, it has been published the treatment with imlifidase in desensitization algorithm. However, there are low-risk strategies to reduce cPRA.

View Article and Find Full Text PDF

Sequence analysis of the 5' region of the chymotrypsin C (CTRC) gene in chronic pancreatitis.

Pancreatology

January 2025

Center for Gastroenterology, Department of Medicine, Albert Szent-Györgyi Medical School, University of Szeged, Szeged, Hungary; Hungarian Centre of Excellence for Molecular Medicine - University of Szeged, Translational Pancreatology Research Group, Szeged, Hungary. Electronic address:

Background/objectives: Loss-of-function chymotrypsin C (CTRC) variants increase the risk for chronic pancreatitis (CP) by reducing protective pancreatic CTRC activity. Variants in the 5' upstream region that includes the promoter might affect CTRC expression but have not been investigated to date. The aim of the present study was to address this knowledge gap.

View Article and Find Full Text PDF

Personalized treatment approaches in hepatocellular carcinoma.

Arab J Gastroenterol

January 2025

Endemic Medicine Department, Faculty of Medicine, Helwan University, Cairo, Egypt; Liver Disease Research Center, College of Medicine, King Saud University, Riyadh 11411, Saudi Arabia. Electronic address:

Personalized medicine is an emerging field that provides novel approaches to disease's early diagnosis, prevention, treatment, and prognosis based on the patient's criteria in gene expression, environmental factors, lifestyle, and diet. To date, hepatocellular carcinoma (HCC) is a significant global health burden, with an increasing incidence and significant death rates, despite advancements in surveillance, diagnosis, and therapeutic approaches. The majority of HCC lesions develop in patients with liver cirrhosis, carrying the risks of mortality associated with both the tumor burden and the cirrhosis.

View Article and Find Full Text PDF

[Mitoxantrone hydrochloride liposome combined with cytarabine for treating pediatric acute myeloid leukemia with RUNX1∷MTG16 fusion gene: a case report and literature review].

Zhonghua Xue Ye Xue Za Zhi

December 2024

Institute of Hematology & Blood Diseases Hospital, Chinese Academy of Medical Sciences, State Key Laboratory of Experimental Hematology, National Clinical Research Center for Blood Diseases, Haihe Laboratory of Cell Ecosystem, Tianjin 300020, China Tianjin Institutes of Health Science, Tianjin 301600, China.

This case report presents a patient with pediatric acute myeloid leukemia (AML) with RUNX1∷MTG16, admitted to the Blood Disease Hospital of the Chinese Academy of Medical Sciences in October 2023. He was 13 years old, with a chief complaint of fatigue for 20 days. Bone marrow smear revealed 17.

View Article and Find Full Text PDF

[Clinical characteristics and treatment efficacy of newly diagnosed acute leukemia in the plateau].

Zhonghua Xue Ye Xue Za Zhi

December 2024

State Key Laboratory of Experimental Hematology, National Clinical Research Center for Blood Diseases, Haihe Laboratory of Cell Ecosystem, Institute of Hematology & Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China.

This study aimed to retrospectively analyze the clinical characteristics and prognosis of patients with acute leukemia in the plateau. The clinical information of patients diagnosed with acute leukemia from February 2010 to April 2023 at the People's Hospital of Tibet Autonomous Region was reviewed and collected, including blood cell count, morphology, immunophenotype, cytogenetics, and molecular data. Survival analysis was conducted to analyze the outcome of patients with acute leukemia.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!