Exploring Feature Dimensions to Learn a New Policy in an Uninformed Reinforcement Learning Task.

Sci Rep

Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology, 34141, Daejeon, Republic of Korea.

Published: December 2017

When making a choice with limited information, we explore new features through trial-and-error to learn how they are related. However, few studies have investigated exploratory behaviour when information is limited. In this study, we address, at both the behavioural and neural level, how, when, and why humans explore new feature dimensions to learn a new policy for choosing a state-space. We designed a novel multi-dimensional reinforcement learning task to encourage participants to explore and learn new features, then used a reinforcement learning algorithm to model policy exploration and learning behaviour. Our results provide the first evidence that, when humans explore new feature dimensions, their values are transferred from the previous policy to the new online (active) policy, as opposed to being learned from scratch. We further demonstrated that exploration may be regulated by the level of cognitive ambiguity, and that this process might be controlled by the frontopolar cortex. This opens up new possibilities of further understanding how humans explore new features in an open-space with limited information.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5732284	PMC
http://dx.doi.org/10.1038/s41598-017-17687-2	DOI Listing

Publication Analysis

Top Keywords

feature dimensions

reinforcement learning

humans explore

dimensions learn

learn policy

learning task

explore features

explore feature

policy

explore

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!