Gradient-free training of recurrent neural networks using random perturbations.

Front Neurosci

Department of Machine Learning and Neural Computing, Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, Netherlands.

Published: July 2024

Recurrent neural networks (RNNs) hold immense potential for computations due to their Turing completeness and sequential processing capabilities, yet existing methods for their training encounter efficiency challenges. Backpropagation through time (BPTT), the prevailing method, extends the backpropagation (BP) algorithm by unrolling the RNN over time. However, this approach suffers from significant drawbacks, including the need to interleave forward and backward phases and store exact gradient information. Furthermore, BPTT has been shown to struggle to propagate gradient information for long sequences, leading to vanishing gradients. An alternative strategy to using gradient-based methods like BPTT involves stochastically approximating gradients through perturbation-based methods. This learning approach is exceptionally simple, necessitating only forward passes in the network and a global reinforcement signal as feedback. Despite its simplicity, the random nature of its updates typically leads to inefficient optimization, limiting its effectiveness in training neural networks. In this study, we present a new approach to perturbation-based learning in RNNs whose performance is competitive with BPTT, while maintaining the inherent advantages over gradient-based learning. To this end, we extend the recently introduced activity-based node perturbation (ANP) method to operate in the time domain, leading to more efficient learning and generalization. We subsequently conduct a range of experiments to validate our approach. Our results show similar performance, convergence time and scalability when compared to BPTT, strongly outperforming standard node perturbation and weight perturbation methods. These findings suggest that perturbation-based learning methods offer a versatile alternative to gradient-based methods for training RNNs which can be ideally suited for neuromorphic computing applications.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11267880PMC
http://dx.doi.org/10.3389/fnins.2024.1439155DOI Listing

Publication Analysis

Top Keywords

neural networks
12
recurrent neural
8
methods training
8
gradient-based methods
8
perturbation-based learning
8
node perturbation
8
methods
6
bptt
5
learning
5
gradient-free training
4

Similar Publications

Physical activity is a meaningful part of life, which starts before birth and lasts until death. There are many health benefits to be derived from physical activity, hence, regular engagement is recommended on a weekly basis. However, these recommendations are often not met.

View Article and Find Full Text PDF

A methodology is proposed, which addresses the caveat that line-of-sight emission spectroscopy presents in that it cannot provide spatially resolved temperature measurements in non-homogeneous temperature fields. The aim of this research is to explore the use of data-driven models in measuring temperature distributions in a spatially resolved manner using emission spectroscopy data. Two categories of data-driven methods are analyzed: (i) Feature engineering and classical machine learning algorithms, and (ii) end-to-end convolutional neural networks (CNN).

View Article and Find Full Text PDF

Organic fertilizers have been identified as a sustainable agricultural practice that can enhance productivity and reduce environmental impact. Recently, the European Union defined and accepted insect frass as an innovative and emerging organic fertilizer. In the wider domain of organic fertilizers, mathematical and computational models have been developed to optimize their production and application conditions.

View Article and Find Full Text PDF

The evolutionary origin of the vertebrate brain remains a major subject of debate, as its development from a dorsal tubular neuroepithelium is unique to chordates. To shed light on the evolutionary emergence of the vertebrate brain, we compared anterior neuroectoderm development across deuterostome species, using available single-cell datasets from sea urchin, amphioxus, and zebrafish embryos. We identified a conserved gene co-expression module, comparable to the anterior gene regulatory network (aGRN) controlling apical organ development in ambulacrarians, and spatially mapped it by multiplexed in situ hybridization to the developing retina and hypothalamus of chordates.

View Article and Find Full Text PDF

Anomaly detection is crucial in areas such as financial fraud identification, cybersecurity defense, and health monitoring, as it directly affects the accuracy and security of decision-making. Existing generative adversarial nets (GANs)-based anomaly detection methods overlook the importance of local density, limiting their effectiveness in detecting anomaly objects in complex data distributions. To address this challenge, we introduce a generative adversarial local density-based anomaly detection (GALD) method, which combines the data distribution modeling capabilities of GANs with local synthetic density analysis.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!