Hyperbolically discounted temporal difference learning.

Neural Comput

Department of Psychological and Brain Sciences, Indiana University, Bloomington, IN 47405, USA.

Published: June 2010

Hyperbolic discounting of future outcomes is widely observed to underlie choice behavior in animals. Additionally, recent studies (Kobayashi & Schultz, 2008) have reported that hyperbolic discounting is observed even in neural systems underlying choice. However, the most prevalent models of temporal discounting, such as temporal difference learning, assume that future outcomes are discounted exponentially. Exponential discounting has been preferred largely because it can be expressed recursively, whereas hyperbolic discounting has heretofore been thought not to have a recursive definition. In this letter, we define a learning algorithm, hyperbolically discounted temporal difference (HDTD) learning, which constitutes a recursive formulation of the hyperbolic model.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3005720PMC
http://dx.doi.org/10.1162/neco.2010.08-09-1080DOI Listing

Publication Analysis

Top Keywords

temporal difference
12
hyperbolic discounting
12
hyperbolically discounted
8
discounted temporal
8
difference learning
8
future outcomes
8
discounting
5
temporal
4
learning
4
hyperbolic
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!