Unlabelled: To behave adaptively, animals must learn to predict future reward, or value. To do this, animals are thought to learn reward predictions using reinforcement learning. However, in contrast to classical models, animals must learn to estimate value using only incomplete state information. Previous work suggests that animals estimate value in partially observable tasks by first forming "beliefs"-optimal Bayesian estimates of the hidden states in the task. Although this is one way to solve the problem of partial observability, it is not the only way, nor is it the most computationally scalable solution in complex, real-world environments. Here we show that a recurrent neural network (RNN) can learn to estimate value directly from observations, generating reward prediction errors that resemble those observed experimentally, without any explicit objective of estimating beliefs. We integrate statistical, functional, and dynamical systems perspectives on beliefs to show that the RNN's learned representation encodes belief information, but only when the RNN's capacity is sufficiently large. These results illustrate how animals can estimate value in tasks without explicitly estimating beliefs, yielding a representation useful for systems with limited capacity.

Author Summary: Natural environments are full of uncertainty. For example, just because my fridge had food in it yesterday does not mean it will have food today. Despite such uncertainty, animals can estimate which states and actions are the most valuable. Previous work suggests that animals estimate value using a brain area called the basal ganglia, using a process resembling a reinforcement learning algorithm called TD learning. However, traditional reinforcement learning algorithms cannot accurately estimate value in environments with state uncertainty (e.g., when my fridge's contents are unknown). One way around this problem is if agents form "beliefs," a probabilistic estimate of how likely each state is, given any observations so far. However, estimating beliefs is a demanding process that may not be possible for animals in more complex environments. Here we show that an artificial recurrent neural network (RNN) trained with TD learning can estimate value from observations, without explicitly estimating beliefs. The trained RNN's error signals resembled the neural activity of dopamine neurons measured during the same task. Importantly, the RNN's activity resembled beliefs, but only when the RNN had enough capacity. This work illustrates how animals could estimate value in uncertain environments without needing to first form beliefs, which may be useful in environments where computing the true beliefs is too costly.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10104054PMC
http://dx.doi.org/10.1101/2023.04.04.535512DOI Listing

Publication Analysis

Top Keywords

animals estimate
20
reinforcement learning
16
estimating beliefs
16
estimate
10
animals
9
animals learn
8
learn estimate
8
previous work
8
work suggests
8
suggests animals
8

Similar Publications

Nipah virus (NiV) is a zoonotic pathogen with the potential to cause human outbreaks with a high case fatality ratio. In this systematic review and meta-analysis, available evidence on NiV infections occurring in healthcare workers (HCWs) was collected and critically appraised. According to the PRISMA statement, four medical databases (PubMed, CINAHL, EMBASE, and Scopus) and the preprint repository medRixv were inquired through a specifically designed searching strategy.

View Article and Find Full Text PDF

Serosurvey of Bovine Viral Diarrhea Virus in Cattle in Southern Japan and Estimation of Its Transmissibility by Transient Infection in Nonvaccinated Cattle.

Viruses

January 2025

Laboratory of Microbiology, Department of Disease Control, Faculty of Veterinary Medicine, Hokkaido University, Kita 18, Nishi 9, Kita-Ku, Sapporo 060-0818, Hokkaido, Japan.

Bovine viral diarrhea (BVD) is caused by the BVD virus (BVDV) and has been reported worldwide in cattle. To estimate BVDV circulation among cattle where few BVD cases were reported in southern Japan, 1910 serum samples collected from 35 cattle farms without a BVD outbreak were investigated to detect antibodies against BVDV-1 and BVDV-2 using an indicator virus with a cytopathogenic effect and the luciferase gene, respectively. Neutralizing antibodies against BVDV-1 and BVDV-2 were detected more frequently in 18 vaccinated farms than in 17 nonvaccinated farms.

View Article and Find Full Text PDF

Mass vaccination against peste des petits ruminants (PPR) in two southern states of India, namely Andhra Pradesh and Karnataka, has reduced disease outbreaks significantly. The sporadic outbreaks reported now can be attributed in part to the recurring movement of sheep and goats between these contiguous states. This study assessed the present level of economic burden and impact of vaccination on the local system (one state), considering the exposure from the external system (neighboring state) using a system dynamic (SD) model.

View Article and Find Full Text PDF

Bluetongue (BT) is considered endemic in the southern states of India, with sporadic incidences reported from the northern, western and central parts of India. However, the eastern and north-eastern states of India have not experienced active disease so far. In the recent past, an extensive sero-epidemiological investigation was carried out in the eastern and north-eastern Indian states.

View Article and Find Full Text PDF

Detection and Phylogenetic Characterization of Influenza D in Swedish Cattle.

Viruses

December 2024

Department of Microbiology, Swedish Veterinary Agency, Ulls väg 2B, 751 89 Uppsala, Sweden.

Increased evidence suggests that cattle are the primary host of Influenza D virus (IDV) and may contribute to respiratory disease in this species. The aim of this study was to detect and characterise IDV in the Swedish cattle population using archived respiratory samples. This retrospective study comprised a collection of a total 1763 samples collected between 1 January 2021 and 30 June 2024.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!