Neural processes (NPs) are models for meta-learning which output uncertainty estimates. So far, most studies of NPs have focused on low-dimensional datasets of highly-correlated tasks. While these homogeneous datasets are useful for benchmarking, they may not be representative of realistic transfer learning. In particular, applications in scientific research may prove especially challenging due to the potential novelty of meta-testing tasks. Molecular property prediction is one such research area that is characterized by sparse datasets of many functions on a shared molecular space. In this paper, we study the application of graph NPs to molecular property prediction with DOCKSTRING, a diverse dataset of docking scores. Graph NPs show competitive performance in few-shot learning tasks relative to supervised learning baselines common in chemoinformatics, as well as alternative techniques for transfer learning and meta-learning. In order to increase meta-generalization to divergent test functions, we propose fine-tuning strategies that adapt the parameters of NPs. We find that adaptation can substantially increase NPs' regression performance while maintaining good calibration of uncertainty estimates. Finally, we present a Bayesian optimization experiment which showcases the potential advantages of NPs over Gaussian processes in iterative screening. Overall, our results suggest that NPs on molecular graphs hold great potential for molecular property prediction in the low-data setting. SCIENTIFIC CONTRIBUTION: Neural processes are a family of meta-learning algorithms which deal with data scarcity by transferring information across tasks and making probabilistic predictions. We evaluate their performance on regression and optimization molecular tasks using docking scores, finding them to outperform classical single-task and transfer-learning models. We examine the issue of generalization to divergent test tasks, which is a general concern of meta-learning algorithms in science, and propose strategies to alleviate it.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11515514PMC
http://dx.doi.org/10.1186/s13321-024-00904-2DOI Listing

Publication Analysis

Top Keywords

neural processes
12
docking scores
12
molecular property
12
property prediction
12
uncertainty estimates
8
graph nps
8
nps molecular
8
divergent test
8
meta-learning algorithms
8
nps
7

Similar Publications

Pain is a dynamic and nonlinear experience shaped by injury and contextual factors, including expectations of future pain or relief . While µ opioid receptors are central to the analgesic effects of opioid drugs, the endogenous opioid neurocircuitry underlying pain and placebo analgesia remains poorly understood. The ventrolateral column of the posterior periaqueductal gray is a critical hub for nociception and endogenous analgesia mediated by opioid signaling .

View Article and Find Full Text PDF

Due to their self-renewal and differentiation capabilities, pluripotent stem cells hold immense potential for advancing our understanding of human disease and developing cell-based or pharmacological interventions. Realizing this potential, however, requires a thorough understanding of the basal cellular mechanisms which occur during differentiation. Lipids are critical molecules that define the morphological, biochemical, and functional role of cells.

View Article and Find Full Text PDF

Unlabelled: The use of microcomputed tomography (Micro-CT) for imaging biological samples has burgeoned in the past decade, due to increased access to scanning platforms, ease of operation, isotropic three-dimensional image information, and the ability to derive accurate quantitative data. However, manual data analysis of Micro-CT images can be laborious and time intensive. Deep learning offers the ability to streamline this process, but historically has included caveats-namely, the need for a large amount of training data, which is often limited in many Micro-CT studies.

View Article and Find Full Text PDF

Unlabelled: Neurophysiology studies propose that predictive coding is implemented via alpha/beta (8-30 Hz) rhythms that prepare specific pathways to process predicted inputs. This leads to a state of relative inhibition, reducing feedforward gamma (40-90 Hz) rhythms and spiking to predictable inputs. We refer to this model as predictive routing.

View Article and Find Full Text PDF

Binocular vision requires that the brain integrate information coming from each eye. These images are combined (fused) to generate a meaningful composite image. Differences between images, within a range, provide useful information about depth (stereopsis).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!