The hypothesis that midbrain dopamine (DA) neurons broadcast a reward prediction error (RPE) is among the great successes of computational neuroscience. However, recent results contradict a core aspect of this theory: specifically that the neurons convey a scalar, homogeneous signal. While the predominant family of extensions to the RPE model replicates the classic model in multiple parallel circuits, we argue that these models are ill suited to explain reports of heterogeneity in task variable encoding across DA neurons. Instead, we introduce a complementary 'feature-specific RPE' model, positing that individual ventral tegmental area DA neurons report RPEs for different aspects of an animal's moment-to-moment situation. Further, we show how our framework can be extended to explain patterns of heterogeneity in action responses reported among substantia nigra pars compacta DA neurons. This theory reconciles new observations of DA heterogeneity with classic ideas about RPE coding while also providing a new perspective of how the brain performs reinforcement learning in high-dimensional environments.

Download full-text PDF

Source
http://dx.doi.org/10.1038/s41593-024-01689-1DOI Listing

Publication Analysis

Top Keywords

prediction error
8
neurons
5
feature-specific prediction
4
model
4
error model
4
model explains
4
explains dopaminergic
4
heterogeneity
4
dopaminergic heterogeneity
4
heterogeneity hypothesis
4

Similar Publications

Cement ball mills in the finishing stage of the cement industries consume the highest energy in the cement manufacturing stage. Therefore, suitable controllers that result in good productivity and product quality with reduced energy consumption are required for the cement ball mill grinding process to increase the profit margins. In this study, generalised predictive controllers (GPC)have been designed for the cement ball mill grinding operation using the model obtained from the step response data taken from the industrially recognized simulator.

View Article and Find Full Text PDF

Visible and Near-infrared hyperspectral imaging (VNIR-HSI) combined with machine learning has shown its effectiveness in various detection applications. Specifically, the quality of cigar tobacco leaves undergoes subtle changes due to environmental differences during the air-curing phase. This study aims to evaluate the feasibility of deep learning methods in overcoming data limitations to develop a VNIR-HSI prediction model for the quality of cigar tobacco leaves at different air-curing levels.

View Article and Find Full Text PDF

Evaluating compost maturity, e.g. via manual seed germination index (GI) measurement, is both time-consuming and costly during composting.

View Article and Find Full Text PDF

Despite the widespread use of voriconazole in antifungal treatment, its high pharmacokinetic and pharmacodynamic variability may lead to suboptimal efficacy, especially in intensive care unit (ICU) patients. Machine learning (ML), an artificial intelligence modeling approach, is increasingly being applied to personalized medicine. The effectiveness of ML models for predicting voriconazole blood concentrations in ICU patients, compared to traditional population pharmacokinetics (popPK) models, has been uncertain until now.

View Article and Find Full Text PDF

As consumers increasingly prioritize food safety and nutritional value, the dairy industry faces a pressing need for rapid and accurate methods to detect essential nutritional components in milk, such as fat, protein, and lactose. Hyperspectral imaging (HSI) technology, known for its non-destructive, fast, and precise nature, shows great promise in food quality assessment. However, the high dimensionality of HSI data poses challenges for effective band selection and model optimization.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!