Unlabelled: To behave adaptively, animals must learn to predict future reward, or value. To do this, animals are thought to learn reward predictions using reinforcement learning. However, in contrast to classical models, animals must learn to estimate value using only incomplete state information. Previous work suggests that animals estimate value in partially observable tasks by first forming "beliefs"-optimal Bayesian estimates of the hidden states in the task. Although this is one way to solve the problem of partial observability, it is not the only way, nor is it the most computationally scalable solution in complex, real-world environments. Here we show that a recurrent neural network (RNN) can learn to estimate value directly from observations, generating reward prediction errors that resemble those observed experimentally, without any explicit objective of estimating beliefs. We integrate statistical, functional, and dynamical systems perspectives on beliefs to show that the RNN's learned representation encodes belief information, but only when the RNN's capacity is sufficiently large. These results illustrate how animals can estimate value in tasks without explicitly estimating beliefs, yielding a representation useful for systems with limited capacity.
Author Summary: Natural environments are full of uncertainty. For example, just because my fridge had food in it yesterday does not mean it will have food today. Despite such uncertainty, animals can estimate which states and actions are the most valuable. Previous work suggests that animals estimate value using a brain area called the basal ganglia, using a process resembling a reinforcement learning algorithm called TD learning. However, traditional reinforcement learning algorithms cannot accurately estimate value in environments with state uncertainty (e.g., when my fridge's contents are unknown). One way around this problem is if agents form "beliefs," a probabilistic estimate of how likely each state is, given any observations so far. However, estimating beliefs is a demanding process that may not be possible for animals in more complex environments. Here we show that an artificial recurrent neural network (RNN) trained with TD learning can estimate value from observations, without explicitly estimating beliefs. The trained RNN's error signals resembled the neural activity of dopamine neurons measured during the same task. Importantly, the RNN's activity resembled beliefs, but only when the RNN had enough capacity. This work illustrates how animals could estimate value in uncertain environments without needing to first form beliefs, which may be useful in environments where computing the true beliefs is too costly.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10104054 | PMC |
http://dx.doi.org/10.1101/2023.04.04.535512 | DOI Listing |
Viruses
January 2025
Pediatric Clinic, Department of Medicine and Surgery, University of Parma, 43126 Parma, Italy.
Nipah virus (NiV) is a zoonotic pathogen with the potential to cause human outbreaks with a high case fatality ratio. In this systematic review and meta-analysis, available evidence on NiV infections occurring in healthcare workers (HCWs) was collected and critically appraised. According to the PRISMA statement, four medical databases (PubMed, CINAHL, EMBASE, and Scopus) and the preprint repository medRixv were inquired through a specifically designed searching strategy.
View Article and Find Full Text PDFViruses
January 2025
Laboratory of Microbiology, Department of Disease Control, Faculty of Veterinary Medicine, Hokkaido University, Kita 18, Nishi 9, Kita-Ku, Sapporo 060-0818, Hokkaido, Japan.
Bovine viral diarrhea (BVD) is caused by the BVD virus (BVDV) and has been reported worldwide in cattle. To estimate BVDV circulation among cattle where few BVD cases were reported in southern Japan, 1910 serum samples collected from 35 cattle farms without a BVD outbreak were investigated to detect antibodies against BVDV-1 and BVDV-2 using an indicator virus with a cytopathogenic effect and the luciferase gene, respectively. Neutralizing antibodies against BVDV-1 and BVDV-2 were detected more frequently in 18 vaccinated farms than in 17 nonvaccinated farms.
View Article and Find Full Text PDFViruses
December 2024
International Livestock Research Institute (ILRI), Block-C, First Floor, NASC Complex, CG Centre, DPS Marg, Pusa, New Delhi 110012, India.
Mass vaccination against peste des petits ruminants (PPR) in two southern states of India, namely Andhra Pradesh and Karnataka, has reduced disease outbreaks significantly. The sporadic outbreaks reported now can be attributed in part to the recurring movement of sheep and goats between these contiguous states. This study assessed the present level of economic burden and impact of vaccination on the local system (one state), considering the exposure from the external system (neighboring state) using a system dynamic (SD) model.
View Article and Find Full Text PDFViruses
December 2024
Centre for Cosmology, Astrophysics and Space Science (CCASS), GLA University, Mathura 281 406, Utter Pradesh, India.
Bluetongue (BT) is considered endemic in the southern states of India, with sporadic incidences reported from the northern, western and central parts of India. However, the eastern and north-eastern states of India have not experienced active disease so far. In the recent past, an extensive sero-epidemiological investigation was carried out in the eastern and north-eastern Indian states.
View Article and Find Full Text PDFViruses
December 2024
Department of Microbiology, Swedish Veterinary Agency, Ulls väg 2B, 751 89 Uppsala, Sweden.
Increased evidence suggests that cattle are the primary host of Influenza D virus (IDV) and may contribute to respiratory disease in this species. The aim of this study was to detect and characterise IDV in the Swedish cattle population using archived respiratory samples. This retrospective study comprised a collection of a total 1763 samples collected between 1 January 2021 and 30 June 2024.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!