Modular inverse reinforcement learning for visuomotor behavior.

Biol Cybern

Frankfurt Institute for Advanced Studies, Goethe University, 60438 , Frankfurt, Germany.

Published: August 2013

In a large variety of situations one would like to have an expressive and accurate model of observed animal or human behavior. While general purpose mathematical models may capture successfully properties of observed behavior, it is desirable to root models in biological facts. Because of ample empirical evidence for reward-based learning in visuomotor tasks, we use a computational model based on the assumption that the observed agent is balancing the costs and benefits of its behavior to meet its goals. This leads to using the framework of reinforcement learning, which additionally provides well-established algorithms for learning of visuomotor task solutions. To quantify the agent's goals as rewards implicit in the observed behavior, we propose to use inverse reinforcement learning, which quantifies the agent's goals as rewards implicit in the observed behavior. Based on the assumption of a modular cognitive architecture, we introduce a modular inverse reinforcement learning algorithm that estimates the relative reward contributions of the component tasks in navigation, consisting of following a path while avoiding obstacles and approaching targets. It is shown how to recover the component reward weights for individual tasks and that variability in observed trajectories can be explained succinctly through behavioral goals. It is demonstrated through simulations that good estimates can be obtained already with modest amounts of observation data, which in turn allows the prediction of behavior in novel configurations.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3773182PMC
http://dx.doi.org/10.1007/s00422-013-0562-6DOI Listing

Publication Analysis

Top Keywords

reinforcement learning
16
inverse reinforcement
12
learning visuomotor
12
observed behavior
12
modular inverse
8
based assumption
8
agent's goals
8
goals rewards
8
rewards implicit
8
implicit observed
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!