Current studies suggest that individuals estimate the value of their choices based on observed feedback. Here, we ask whether individuals also update the value of their unchosen actions, even when the associated feedback remains unknown. One hundred seventy-eight individuals completed a multi-armed bandit task, making choices to gain rewards. We found robust evidence suggesting latent value updating of unchosen actions based on the chosen action's outcome. Computational modeling results suggested that this effect is mainly explained by a value updating mechanism whereby individuals integrate the outcome history for choosing an option with that of rejecting the alternative. Properties of the deliberation (i.e., duration/difficulty) did not moderate the latent value updating of unchosen actions, suggesting that memory traces generated during deliberation might take a smaller role in this specific phenomenon than previously thought. We discuss the mechanisms facilitating credit assignment to unchosen actions and their implications for human decision-making.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10588947PMC
http://dx.doi.org/10.1126/sciadv.adi2704DOI Listing

Publication Analysis

Top Keywords

unchosen actions
20
latent updating
12
updating unchosen
12
unchosen
5
actions
5
computational mechanisms
4
mechanisms underlying
4
underlying latent
4
updating
4
actions current
4

Similar Publications

Cognitive psychology: computing the value of the choices we do not make.

Commun Psychol

March 2024

Donders Institute for Brain, Cognition, and Behaviour, Radboud University Nijmegen, Nijmegen, Netherlands.

Reflecting on choices we did make and those we could have made is very common. In a recent study in , researchers used a reinforcement learning paradigm together with computational modeling to study the processes underlying the value update of unchosen actions.

View Article and Find Full Text PDF

The frontopolar cortex (FPC) is, to date, one of the least understood regions of the prefrontal cortex. The current understanding of its function suggests that it plays a role in the control of exploratory behaviors by coordinating the activities of other prefrontal cortex areas involved in decision-making and exploiting actions based on their outcomes. Based on this hypothesis, FPC would drive fast-learning processes through a valuation of the different alternatives.

View Article and Find Full Text PDF

Current studies suggest that individuals estimate the value of their choices based on observed feedback. Here, we ask whether individuals also update the value of their unchosen actions, even when the associated feedback remains unknown. One hundred seventy-eight individuals completed a multi-armed bandit task, making choices to gain rewards.

View Article and Find Full Text PDF

A neurocognitive theory of flexible emotion control: The role of the lateral frontal pole in emotion regulation.

Ann N Y Acad Sci

July 2023

Donders Center for Cognitive Neuroimaging, Donders Institute for Brain Cognition and Behaviour, Radboud University, Nijmegen, the Netherlands.

Emotion regulation is essential to survive in a world full of challenges with rapidly changing contextual demands. The ability to flexibly shift between different emotional control strategies is critical to successfully deal with these demands. Recently, decision neuroscience has shown the importance of monitoring alternative control strategies.

View Article and Find Full Text PDF

A Normative Account of Confirmation Bias During Reinforcement Learning.

Neural Comput

January 2022

MRC Brain Network Dynamics Unit, Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford OX3 9DU, U.K.

Reinforcement learning involves updating estimates of the value of states and actions on the basis of experience. Previous work has shown that in humans, reinforcement learning exhibits a confirmatory bias: when the value of a chosen option is being updated, estimates are revised more radically following positive than negative reward prediction errors, but the converse is observed when updating the unchosen option value estimate. Here, we simulate performance on a multi-arm bandit task to examine the consequences of a confirmatory bias for reward harvesting.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!