Reconciling reinforcement learning models with behavioral extinction and renewal: implications for addiction, relapse, and problem gambling.

A David Redish Steve Jensen Adam Johnson Zeb Kurth-Nelson

Psychol Rev

Graduate Program in Neuroscience, University of Minnesota.

Published: July 2007

Because learned associations are quickly renewed following extinction, the extinction process must include processes other than unlearning. However, reinforcement learning models, such as the temporal difference reinforcement learning (TDRL) model, treat extinction as an unlearning of associated value and are thus unable to capture renewal. TDRL models are based on the hypothesis that dopamine carries a reward prediction error signal; these models predict reward by driving that reward error to zero. The authors construct a TDRL model that can accommodate extinction and renewal through two simple processes: (a) a TDRL process that learns the value of situation-action pairs and (b) a situation recognition process that categorizes the observed cues into situations. This model has implications for dysfunctional states, including relapse after addiction and problem gambling.

Download full-text PDF	Source
http://dx.doi.org/10.1037/0033-295X.114.3.784	DOI Listing

Publication Analysis

Top Keywords

reinforcement learning

learning models

extinction renewal

problem gambling

tdrl model

extinction

reconciling reinforcement

models

models behavioral

behavioral extinction

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!