Combining STDP and binary networks for reinforcement learning from images and sparse rewards.

Neural Netw

Centro de Informática - CIn, Universidade Federal de Pernambuco, Av. Jornalista Aníbal Fernandes, s/n, Cidade Universitária, 50.740-560, Brazil. Electronic address:

Published: December 2021

Spiking neural networks (SNNs) aim to replicate energy efficiency, learning speed and temporal processing of biological brains. However, accuracy and learning speed of such networks is still behind reinforcement learning (RL) models based on traditional neural models. This work combines a pre-trained binary convolutional neural network with an SNN trained online through reward-modulated STDP in order to leverage advantages of both models. The spiking network is an extension of its previous version, with improvements in architecture and dynamics to address a more challenging task. We focus on extensive experimental evaluation of the proposed model with optimized state-of-the-art baselines, namely proximal policy optimization (PPO) and deep Q network (DQN). The models are compared on a grid-world environment with high dimensional observations, consisting of RGB images with up to 256 × 256 pixels. The experimental results show that the proposed architecture can be a competitive alternative to deep reinforcement learning (DRL) in the evaluated environment and provide a foundation for more complex future applications of spiking networks.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.neunet.2021.09.010DOI Listing

Publication Analysis

Top Keywords

reinforcement learning
12
networks reinforcement
8
learning speed
8
learning
5
combining stdp
4
stdp binary
4
networks
4
binary networks
4
learning images
4
images sparse
4

Similar Publications

Background: Depression is being increasingly acknowledged as an important risk factor contributing to coronary heart disease (CHD). Currently, there is no predictive model specifically designed to evaluate the risk of coronary heart disease among individuals with depression. We aim to develop a machine learning (ML) model that will analyze risk factors and forecast the probability of coronary heart disease in individuals suffering from depression.

View Article and Find Full Text PDF

Reinforcement Learning is Impaired in the Sub-acute Post-stroke Period.

Neurorehabil Neural Repair

January 2025

Department of Physical Medicine and Rehabilitation, Johns Hopkins University, Baltimore, MD, USA.

Background: In humans, most spontaneous recovery from motor impairment after stroke occurs in the first 3 months. Studies in animal models show higher responsiveness to training over a similar time-period. Both phenomena are often attributed to a milieu of heightened plasticity, which may share some mechanistic overlap with plasticity associated with normal motor learning.

View Article and Find Full Text PDF

Rats and mice rapidly update timed behaviors.

Anim Cogn

January 2025

Neuroscience Department, Oberlin College, 173 Lorain St, Oberlin, OH, USA.

Keeping track of time intervals is a crucial aspect of behavior and cognition. Many theoretical models of how the brain times behavior make predictions for steady-state performance of well-learned intervals, but the rate of learning intervals in these models varies greatly, ranging from one-shot learning to learning over thousands of trials. Here, we explored how quickly rats and mice adapt to changes in interval durations using a serial fixed-interval task.

View Article and Find Full Text PDF

Expanding the brain's terrain for reward.

Science

January 2025

Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden.

A previously unknown region in the brainstem controls dopamine activity.

View Article and Find Full Text PDF

Rewards are essential for motivation, decision-making, memory, and mental health. We identified the subventricular tegmental nucleus (SVTg) as a brainstem reward center. In mice, reward and its prediction activate the SVTg, and SVTg stimulation leads to place preference, reduced anxiety, and accumbal dopamine release.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!