Emergence of belief-like representations through reinforcement learning.

Jay A Hennig Sandra A Romero Pinto Takahiro Yamaguchi Scott W Linderman Naoshige Uchida Samuel J Gershman

bioRxiv

Department of Psychology, Harvard University, Cambridge, MA, USA.

Published: April 2023

Unlabelled: To behave adaptively, animals must learn to predict future reward, or value. To do this, animals are thought to learn reward predictions using reinforcement learning. However, in contrast to classical models, animals must learn to estimate value using only incomplete state information. Previous work suggests that animals estimate value in partially observable tasks by first forming "beliefs"-optimal Bayesian estimates of the hidden states in the task. Although this is one way to solve the problem of partial observability, it is not the only way, nor is it the most computationally scalable solution in complex, real-world environments. Here we show that a recurrent neural network (RNN) can learn to estimate value directly from observations, generating reward prediction errors that resemble those observed experimentally, without any explicit objective of estimating beliefs. We integrate statistical, functional, and dynamical systems perspectives on beliefs to show that the RNN's learned representation encodes belief information, but only when the RNN's capacity is sufficiently large. These results illustrate how animals can estimate value in tasks without explicitly estimating beliefs, yielding a representation useful for systems with limited capacity.

Author Summary: Natural environments are full of uncertainty. For example, just because my fridge had food in it yesterday does not mean it will have food today. Despite such uncertainty, animals can estimate which states and actions are the most valuable. Previous work suggests that animals estimate value using a brain area called the basal ganglia, using a process resembling a reinforcement learning algorithm called TD learning. However, traditional reinforcement learning algorithms cannot accurately estimate value in environments with state uncertainty (e.g., when my fridge's contents are unknown). One way around this problem is if agents form "beliefs," a probabilistic estimate of how likely each state is, given any observations so far. However, estimating beliefs is a demanding process that may not be possible for animals in more complex environments. Here we show that an artificial recurrent neural network (RNN) trained with TD learning can estimate value from observations, without explicitly estimating beliefs. The trained RNN's error signals resembled the neural activity of dopamine neurons measured during the same task. Importantly, the RNN's activity resembled beliefs, but only when the RNN had enough capacity. This work illustrates how animals could estimate value in uncertain environments without needing to first form beliefs, which may be useful in environments where computing the true beliefs is too costly.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10104054	PMC
http://dx.doi.org/10.1101/2023.04.04.535512	DOI Listing

Publication Analysis

Top Keywords

animals estimate

reinforcement learning

estimating beliefs

estimate

animals

animals learn

learn estimate

previous work

work suggests

suggests animals

Similar Publications

Risk of Nipah Virus Seroprevalence in Healthcare Workers: A Systematic Review with Meta-Analysis.

Viruses

January 2025

Pediatric Clinic, Department of Medicine and Surgery, University of Parma, 43126 Parma, Italy.

Matteo Riccò Antonio Cascio Claudio Costantino Silvia Corrado Ilaria Zanella

Nipah virus (NiV) is a zoonotic pathogen with the potential to cause human outbreaks with a high case fatality ratio. In this systematic review and meta-analysis, available evidence on NiV infections occurring in healthcare workers (HCWs) was collected and critically appraised. According to the PRISMA statement, four medical databases (PubMed, CINAHL, EMBASE, and Scopus) and the preprint repository medRixv were inquired through a specifically designed searching strategy.

View Article and Find Full Text PDF

Similar Publications

Serosurvey of Bovine Viral Diarrhea Virus in Cattle in Southern Japan and Estimation of Its Transmissibility by Transient Infection in Nonvaccinated Cattle.

Viruses

January 2025

Laboratory of Microbiology, Department of Disease Control, Faculty of Veterinary Medicine, Hokkaido University, Kita 18, Nishi 9, Kita-Ku, Sapporo 060-0818, Hokkaido, Japan.

Norikazu Isoda Satoshi Sekiguchi Chika Ryu Kosuke Notsu Maya Kobayashi

Bovine viral diarrhea (BVD) is caused by the BVD virus (BVDV) and has been reported worldwide in cattle. To estimate BVDV circulation among cattle where few BVD cases were reported in southern Japan, 1910 serum samples collected from 35 cattle farms without a BVD outbreak were investigated to detect antibodies against BVDV-1 and BVDV-2 using an indicator virus with a cytopathogenic effect and the luciferase gene, respectively. Neutralizing antibodies against BVDV-1 and BVDV-2 were detected more frequently in 18 vaccinated farms than in 17 nonvaccinated farms.

View Article and Find Full Text PDF

Similar Publications

Assessment of Vaccination Impact in PPR-Control Program Implemented in Southern States of India: A System Dynamics Model Approach.

Viruses

December 2024

International Livestock Research Institute (ILRI), Block-C, First Floor, NASC Complex, CG Centre, DPS Marg, Pusa, New Delhi 110012, India.

Govindaraj Gurrappanaidu Naveen Kumar Gajalavarahalli Subbanna Francis Wanyoike Sirak Bahta Yeswanth Raghuram Reddy

Mass vaccination against peste des petits ruminants (PPR) in two southern states of India, namely Andhra Pradesh and Karnataka, has reduced disease outbreaks significantly. The sporadic outbreaks reported now can be attributed in part to the recurring movement of sheep and goats between these contiguous states. This study assessed the present level of economic burden and impact of vaccination on the local system (one state), considering the exposure from the external system (neighboring state) using a system dynamic (SD) model.

View Article and Find Full Text PDF

Similar Publications

Posterior Positivity Distribution Analysis of Subclinical Bluetongue in the Eastern and North-Eastern States of India: A Wakeup Call for Outbreak Preparedness.

Viruses

December 2024

Centre for Cosmology, Astrophysics and Space Science (CCASS), GLA University, Mathura 281 406, Utter Pradesh, India.

Siddhartha Narayan Joardar Aritra Sanyal Ahmed Abd El Wahed Saibal Ray

Bluetongue (BT) is considered endemic in the southern states of India, with sporadic incidences reported from the northern, western and central parts of India. However, the eastern and north-eastern states of India have not experienced active disease so far. In the recent past, an extensive sero-epidemiological investigation was carried out in the eastern and north-eastern Indian states.

View Article and Find Full Text PDF

Similar Publications

Detection and Phylogenetic Characterization of Influenza D in Swedish Cattle.

Viruses

December 2024

Department of Microbiology, Swedish Veterinary Agency, Ulls väg 2B, 751 89 Uppsala, Sweden.

Ignacio Alvarez Fereshteh Banihashem Annie Persson Emma Hurri Hyeyoung Kim

Increased evidence suggests that cattle are the primary host of Influenza D virus (IDV) and may contribute to respiratory disease in this species. The aim of this study was to detect and characterise IDV in the Swedish cattle population using archived respiratory samples. This retrospective study comprised a collection of a total 1763 samples collected between 1 January 2021 and 30 June 2024.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!