A Case of Divergent Predictions Made by Delta and Decay Rule Learning Models.

Cogsci

Department of Psychological Sciences, MS 2051 Psychology Building, Lubbock, TX 79409-2051 USA.

Published: July 2018

The Delta and Decay rules are two learning rules used to update expected values in reinforcement learning (RL) models. The delta rule learns rewards, whereas the decay rule learns rewards for each option. Participants learned to select between pairs of options that had reward probabilities of .65 (option A) versus .35 (option B) or .75 (option C) versus .25 (option D) on separate trials in a binary-outcome choice task. Crucially, during training there were twice as AB trials as CD trials, therefore participants experienced more cumulative reward from option A even though option C had a higher average reward rate (.75 versus .65). Participants then decided between novel combinations of options (e.g, A versus C). The Decay model predicted more A choices, but the Delta model predicted more C choices, because those respective options had higher cumulative versus average reward values. Results were more in line with the Decay model's predictions. This suggests that people may retrieve memories of cumulative reward to compute expected value instead of learning average rewards for each option.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8086699PMC

Publication Analysis

Top Keywords

delta decay
8
decay rule
8
learning models
8
models delta
8
rule learns
8
learns rewards
8
option
8
rewards option
8
option versus
8
versus option
8

Similar Publications

The oxidation of CHCO by (Δ) O has been investigated by means of high level quantum chemical and chemical kinetic calculations. The reaction was found to proceed through a four-membered cyclic transition state resulting from the addition of O to the CC bond of ketene. The reaction energetics has been calculated employing post-CCSD(T) corrections.

View Article and Find Full Text PDF

Dynamics of nanoparticle tracers in supercooled nanoparticle matrices.

Soft Matter

December 2024

William A. Brookshire Department of Chemical and Biomolecular Engineering, University of Houston, 4226 Martin Luther King Boulevard, Houston, Texas, 77204-4004, USA.

We investigate the dynamics of tracer nanoparticles in bulk supercooled nanoparticle matrices using confocal microscopy. We mix fluorescent (tracer) and undyed (matrix) charged-stabilized polystyrene nanoparticles with tracer-to-matrix particle size ratios = 0.34, 0.

View Article and Find Full Text PDF

This study explores the nonlinear optical (NLO) and photophysical properties of newly designed naphthyridine derivatives by density functional theory (DFT). The first hyperpolarizability (β), a key indicator of NLO activity, varies significantly depending on the substituent groups. N-substituted compounds (IUB-N series) generally show lower β values, while compounds with electron donor/acceptor groups (IUB-P series) demonstrate a broader range, with IUB-A-02 achieving the highest β value of 16,362 a.

View Article and Find Full Text PDF

It holds enormous significance for red thermally activated delayed fluorescence (TADF) emitters to develop organic light-emitting diodes (OLEDs) with high efficiency and high color purity, which remains challenging for highly efficient solution-processed red TADF emitters due to the limitation of severe nonradiative decays. Herein, a red TADF emitter containing space interactions, 4,4'-(9,10-bis(phenylethynyl)anthracene-1,8-diyl)bis(,-bis(4-methoxyphenyl)aniline) (DBP-2MOTPA), is designed and synthesized, composed of ethynyl as the acceptor and methoxytriarylamine (MOTPA) as the donor. The triphenylamine donor unit decorated with peripheral methoxy units not only improves the solubility for the solution-processed technology but also increases the electron-donating ability.

View Article and Find Full Text PDF

Anomalies in Hadronic B Decays.

Phys Rev Lett

November 2024

Physique des Particules, Université de Montréal, 1375 Avenue Thérèse-Lavoie-Roux, Montréal, Quebec H2V 0B3, Canada.

In this Letter, we perform fits to B→PP decays, where B={B^{0},B^{+},B_{s}^{0}} and the pseudoscalar P={π,K}, under the assumption of flavor SU(3) symmetry [SU(3)_{F}]. Although the fits to ΔS=0 or ΔS=1 decays individually are good, the combined fit is very poor: there is a 3.6σ disagreement with the SU(3)_{F} limit of the standard model (SM_{SU(3)_{F}}).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!