Transformers for Molecular Property Prediction: Lessons Learned from the Past Five Years.

J Chem Inf Model

Data Driven Drug Design, Center for Bioinformatics, Saarland University, Saarbrücken 66123, Germany.

Published: August 2024

Molecular Property Prediction (MPP) is vital for drug discovery, crop protection, and environmental science. Over the last decades, diverse computational techniques have been developed, from using simple physical and chemical properties and molecular fingerprints in statistical models and classical machine learning to advanced deep learning approaches. In this review, we aim to distill insights from current research on employing transformer models for MPP. We analyze the currently available models and explore key questions that arise when training and fine-tuning a transformer model for MPP. These questions encompass the choice and scale of the pretraining data, optimal architecture selections, and promising pretraining objectives. Our analysis highlights areas not yet covered in current research, inviting further exploration to enhance the field's understanding. Additionally, we address the challenges in comparing different models, emphasizing the need for standardized data splitting and robust statistical analysis.

Download full-text PDF

Source
http://dx.doi.org/10.1021/acs.jcim.4c00747DOI Listing

Publication Analysis

Top Keywords

molecular property
8
property prediction
8
transformers molecular
4
prediction lessons
4
lessons learned
4
learned years
4
years molecular
4
prediction mpp
4
mpp vital
4
vital drug
4

Similar Publications

Enhancing oil recovery in sandstone reservoirs, particularly through smart water flooding, is an appealing area of research that has been thoroughly documented. However, few studies have examined the formation of water-in-heavy oil emulsion because of the incompatibility between the injected water-folded ions, clay particles, and heavy fraction in the oil phase. In this study, we investigated the synergistic roles of asphaltene and clay in the smart water flooding process using a novel experimental approach.

View Article and Find Full Text PDF

A genome-wide atlas of human cell morphology.

Nat Methods

January 2025

Broad Institute of MIT and Harvard, Cambridge, MA, USA.

A key challenge of the modern genomics era is developing empirical data-driven representations of gene function. Here we present the first unbiased morphology-based genome-wide perturbation atlas in human cells, containing three genome-wide genotype-phenotype maps comprising CRISPR-Cas9-based knockouts of >20,000 genes in >30 million cells. Our optical pooled cell profiling platform (PERISCOPE) combines a destainable high-dimensional phenotyping panel (based on Cell Painting) with optical sequencing of molecular barcodes and a scalable open-source analysis pipeline to facilitate massively parallel screening of pooled perturbation libraries.

View Article and Find Full Text PDF

Patients suffering epilepsy caused by the gain-of-function mutants of the hKCNT1 potassium channels are drug refractory. In this study, we cloned a novel human KCNT1B channel isoform using the brain cDNA library and conducted patch-clamp and molecular docking analyses to characterize the pharmacological properties of the hKCNT1B channel using thirteen drugs. Among cinchona alkaloids, we found that hydroquinine exerted the strongest blocking effect on the hKCNT1B channel, especially the F313L mutant.

View Article and Find Full Text PDF

Understanding the molecular mechanisms that confer cold resistance in mammalian cells might be relevant for advancing medical applications. This study aimed to exploit the protective function of Late Embryogenesis Abundant (LEA) proteins, known to provide resistance to low temperatures in extremophiles and plants, by their exogenous expression in mammalian cells, and compare their effects with the well characterized antioxidant, vitamin E.Remarkably, the expression of LEA proteins in mammalian cells exerted cold-protective effect similar to Vitamin E.

View Article and Find Full Text PDF

Impact of pollution on microbiological dynamics in the pistil stigmas of Orobanche lutea flowers (Orobanchaceae).

Sci Rep

January 2025

Center for Research and Conservation of Biodiversity, Department of Environmental Biology, Institute of Biology, Jan Kochanowski University, Uniwersytecka 7, 25-406, Kielce, Poland.

Our understanding of the basic relationships of microbiota associated with flowers is still quite limited, especially regarding parasitic plant species. The transient nature of flower parts such as pistil stigmas provides a unique opportunity for temporal investigations. This is the first report of the analysis of bacterial and fungal communities associated with the pistil stigmas of the lucerne parasite, Orobanche lutea.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!