Global reinforcement learning in neural networks.

IEEE Trans Neural Netw

Published: March 2007

In this letter, we have found a more general formulation of the REward Increment = Nonnegative Factor x Offset Reinforcement x Characteristic Eligibility (REINFORCE) learning principle first suggested by Williams. The new formulation has enabled us to apply the principle to global reinforcement learning in networks with various sources of randomness, and to suggest several simple local rules for such networks. Numerical simulations have shown that for simple classification and reinforcement learning tasks, at least one family of the new learning rules gives results comparable to those provided by the famous Rules A(r-i) and A(r-p) for the Boltzmann machines.

Download full-text PDF	Source
http://dx.doi.org/10.1109/TNN.2006.888376	DOI Listing

Publication Analysis

Top Keywords

reinforcement learning

global reinforcement

learning

learning neural

neural networks

networks letter

letter general

general formulation

formulation reward

reward increment

Similar Publications

Mechanisms of increased pain discrimination by contingent reinforcement: a perceptual decision-making and instrumental learning account.

Pain

January 2025

Integrative Spinal Research Group, Department of Chiropractic Medicine, Balgrist University Hospital, University of Zurich, Zurich, Switzerland.

Fabrice Hubschmid Melissa Luna Flury Martin Löffler Simon Desch Susanne Becker

Recent evidence highlights that monetary rewards can increase the precision at which healthy human volunteers can detect small changes in the intensity of thermal noxious stimuli, contradicting the idea that rewards exert a broad inhibiting influence on pain perception. This effect was stronger with contingent rewards compared with noncontingent rewards, suggesting a successful learning process. In the present study, we implemented a model comparison approach that aimed to improve our understanding of the mechanisms that underlie thermal noxious discrimination in humans.

View Article and Find Full Text PDF

Similar Publications

Honey bees rely on associative stimulus strength after training on an olfactory transitive inference task.

Front Psychol

January 2025

Sorbonne University, CNRS, INSERM, Institute of Biology Paris Seine, Neurosciences Paris Seine, Paris, France.

Martin Giurfa Silvia Lee Catherine Macri

Transitive inference, the ability to establish hierarchical relationships between stimuli, is typically tested by training with premise pairs (e.g., A + B-, B + C-, C + D-, D + E-), which establishes a stimulus hierarchy (A > B > C > D > E).

View Article and Find Full Text PDF

Similar Publications

Touch-driven advantages in reaction time but not in performance in a cross-sensory comparison of reinforcement learning.

Heliyon

January 2025

Centre for Tactile Internet with Human-in-the-Loop (CeTI), 6G Life, Technische Universität Dresden, Germany.

Wenhan Sun Isabelle Ripp Aylin Borrmann Maximilian Moll Merle Fairhurst

Recent research has highlighted a notable confidence bias in the haptic sense, yet its impact on learning relative to other senses remains unexplored. This online study investigated learning behaviour across visual, auditory, and haptic modalities using a probabilistic selection task on computers and mobile devices, employing dynamic and ecologically valid stimuli to enhance generalisability. We analysed reaction time as an indicator of confidence, alongside learning speed and task accuracy.

View Article and Find Full Text PDF

Similar Publications

The power of belief? Evidence of reduced fear extinction learning in Catholic God believers.

Front Public Health

January 2025

Dipartimento di Scienze Cognitive, Psicologiche, Pedagogiche e Degli Studi Culturali, Università di Messina, Messina, Italy.

Carmelo Mario Vicario Laura Culicetto Chiara Lucifora Francesca Ferraioli Simona Massimino

Religious beliefs can shape how people process fear. Yet the psychophysiological mechanisms underlying this phenomenon remain poorly understood. We investigated fear learning and extinction processes in a group of individuals who professed a belief in God, compared to non-believers.

View Article and Find Full Text PDF

Similar Publications

Neuroeconomically dissociable forms of mental accounting are altered in a mouse model of diabetes.

Commun Biol

January 2025

Nash Family Department of Neuroscience, Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA.

Chinonso A Nwakama Romain Durand-de Cuttoli Zainab M Oketokoun Samantha O Brown Jillian E Haller

Those with diabetes mellitus are at high-risk of developing psychiatric disorders, especially mood disorders, yet the link between hyperglycemia and altered motivation has not been thoroughly explored. Here, we characterized value-based decision-making behavior of a streptozocin-induced diabetic mouse model on Restaurant Row, a naturalistic neuroeconomic foraging paradigm capable of behaviorally capturing multiple decision systems known to depend on dissociable neural circuits. Mice made self-paced choices on a daily limited time-budget, accepting or rejecting reward offers based on cost (delays cued by tone pitch) and subjective value (flavors), in a closed-economy system tested across months.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!