In-context operator learning with data prompts for differential equation problems.

Proc Natl Acad Sci U S A

Department of Mathematics, University of California, Los Angeles, CA 90095.

Published: September 2023

This paper introduces the paradigm of "in-context operator learning" and the corresponding model "In-Context Operator Networks" to simultaneously learn operators from the prompted data and apply it to new questions during the inference stage, without any weight update. Existing methods are limited to using a neural network to approximate a specific equation solution or a specific operator, requiring retraining when switching to a new problem with different equations. By training a single neural network as an operator learner, rather than a solution/operator approximator, we can not only get rid of retraining (even fine-tuning) the neural network for new problems but also leverage the commonalities shared across operators so that only a few examples in the prompt are needed when learning a new operator. Our numerical results show the capability of a single neural network as a few-shot operator learner for a diversified type of differential equation problems, including forward and inverse problems of ordinary differential equations, partial differential equations, and mean-field control problems, and also show that it can generalize its learning capability to operators beyond the training distribution.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10523630PMC
http://dx.doi.org/10.1073/pnas.2310142120DOI Listing

Publication Analysis

Top Keywords

neural network
16
differential equation
8
equation problems
8
"in-context operator
8
single neural
8
operator learner
8
differential equations
8
operator
6
problems
5
in-context operator
4

Similar Publications

Enhanced brain tumor detection and segmentation using densely connected convolutional networks with stacking ensemble learning.

Comput Biol Med

January 2025

Emerging Technologies Research Lab (ETRL), College of Computer Science and Information Systems, Najran University, Najran, 61441, Saudi Arabia; Department of Computer Science, College of Computer Science and Information Systems, Najran University, Najran, 61441, Saudi Arabia. Electronic address:

- Brain tumors (BT), both benign and malignant, pose a substantial impact on human health and need precise and early detection for successful treatment. Analysing magnetic resonance imaging (MRI) image is a common method for BT diagnosis and segmentation, yet misdiagnoses yield effective medical responses, impacting patient survival rates. Recent technological advancements have popularized deep learning-based medical image analysis, leveraging transfer learning to reuse pre-trained models for various applications.

View Article and Find Full Text PDF

SEPO-FI: Deep-learning based software to calculate fusion index of muscle cells.

Comput Biol Med

January 2025

School of Computer Science, Chungbuk National University, Cheongju 28644, Republic of Korea. Electronic address:

The fusion index is a critical metric for quantitatively assessing the transformation of in vitro muscle cells into myotubes in the biological and medical fields. Traditional methods for calculating this index manually involve the labor-intensive counting of numerous muscle cell nuclei in images, which necessitates determining whether each nucleus is located inside or outside the myotubes, leading to significant inter-observer variation. To address these challenges, this study proposes a three-stage process that integrates the strengths of pattern recognition and deep-learning to automatically calculate the fusion index.

View Article and Find Full Text PDF

Microscopic cell segmentation typically requires complex imaging, staining, and computational steps to achieve acceptable consistency. Here, we describe a protocol for the high-fidelity segmentation of the nucleus and cytoplasm in cell culture and apply it to monitor interferon-induced signal transducer and activator of transcription (STAT) signaling. We provide guidelines for sample preparation, image acquisition, and segmentation.

View Article and Find Full Text PDF

Municipal waste classification is significant for effective recycling and waste management processes that involve the classification of diverse municipal waste materials such as paper, glass, plastic, and organic matter using diverse techniques. Yet, this municipal waste classification process faces several challenges, such as high computational complexity, more time consumption, and high variability in the appearance of waste caused by variations in color, type, and degradation level, which makes an inaccurate waste classification process. To overcome these challenges, this research proposes a novel Channel and Spatial Attention-Based Multiblock Convolutional Network for accurately classifying municipal waste that utilizes a unique attention mechanism for enhancing feature learning and waste classification accuracy.

View Article and Find Full Text PDF

A multiscale molecular structural neural network for molecular property prediction.

Mol Divers

January 2025

Key Laboratory for Macromolecular Science of Shaanxi Province, School of Chemistry and Chemical Engineering, Shaanxi Normal University, Xi'an, 710119, People's Republic of China.

Molecular Property Prediction (MPP) is a fundamental task in important research fields such as chemistry, materials, biology, and medicine, where traditional computational chemistry methods based on quantum mechanics often consume substantial time and computing power. In recent years, machine learning has been increasingly used in computational chemistry, in which graph neural networks have shown good performance in molecular property prediction tasks, but they have some limitations in terms of generalizability, interpretability, and certainty. In order to address the above challenges, a Multiscale Molecular Structural Neural Network (MMSNet) is proposed in this paper, which obtains rich multiscale molecular representations through the information fusion between bonded and non-bonded "message passing" structures at the atomic scale and spatial feature information "encoder-decoder" structures at the molecular scale; a multi-level attention mechanism is introduced on the basis of theoretical analysis of molecular mechanics in order to enhance the model's interpretability; the prediction results of MMSNet are used as label values and clustered in the molecular library by the K-NN (K-Nearest Neighbors) algorithm to reverse match the spatial structure of the molecules, and the certainty of the model is quantified by comparing virtual screening results across different K-values.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!