Modular inverse reinforcement learning for visuomotor behavior.

Biol Cybern

Frankfurt Institute for Advanced Studies, Goethe University, 60438 , Frankfurt, Germany.

Published: August 2013

In a large variety of situations one would like to have an expressive and accurate model of observed animal or human behavior. While general purpose mathematical models may capture successfully properties of observed behavior, it is desirable to root models in biological facts. Because of ample empirical evidence for reward-based learning in visuomotor tasks, we use a computational model based on the assumption that the observed agent is balancing the costs and benefits of its behavior to meet its goals. This leads to using the framework of reinforcement learning, which additionally provides well-established algorithms for learning of visuomotor task solutions. To quantify the agent's goals as rewards implicit in the observed behavior, we propose to use inverse reinforcement learning, which quantifies the agent's goals as rewards implicit in the observed behavior. Based on the assumption of a modular cognitive architecture, we introduce a modular inverse reinforcement learning algorithm that estimates the relative reward contributions of the component tasks in navigation, consisting of following a path while avoiding obstacles and approaching targets. It is shown how to recover the component reward weights for individual tasks and that variability in observed trajectories can be explained succinctly through behavioral goals. It is demonstrated through simulations that good estimates can be obtained already with modest amounts of observation data, which in turn allows the prediction of behavior in novel configurations.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3773182	PMC
http://dx.doi.org/10.1007/s00422-013-0562-6	DOI Listing

Publication Analysis

Top Keywords

reinforcement learning

inverse reinforcement

learning visuomotor

observed behavior

modular inverse

based assumption

agent's goals

goals rewards

rewards implicit

implicit observed

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!