An algorithmic account for how humans efficiently learn, transfer, and compose hierarchically structured decision policies.

Cognition

Helen Wills Neuroscience Institute, University of California, Berkeley, United States of America; Department of Psychology, University of California, Berkeley, United States of America. Electronic address:

Published: January 2025

Learning structures that effectively abstract decision policies is key to the flexibility of human intelligence. Previous work has shown that humans use hierarchically structured policies to efficiently navigate complex and dynamic environments. However, the computational processes that support the learning and construction of such policies remain insufficiently understood. To address this question, we tested 1026 human participants, who made over 1 million choices combined, in a decision-making task where they could learn, transfer, and recompose multiple sets of hierarchical policies. We propose a novel algorithmic account for the learning processes underlying observed human behavior. We show that humans rely on compressed policies over states in early learning, which gradually unfold into hierarchical representations via meta-learning and Bayesian inference. Our modeling evidence suggests that these hierarchical policies are structured in a temporally backward, rather than forward, fashion. Taken together, these algorithmic architectures characterize how the interplay between reinforcement learning, policy compression, meta-learning, and working memory supports structured decision-making and compositionality in a resource-rational way.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.cognition.2024.105967DOI Listing

Publication Analysis

Top Keywords

algorithmic account
8
learn transfer
8
hierarchically structured
8
decision policies
8
hierarchical policies
8
policies
7
learning
5
account humans
4
humans efficiently
4
efficiently learn
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!