Gradient-free training of recurrent neural networks using random perturbations.

Jesús García Fernández Sander Keemink Marcel van Gerven

Front Neurosci

Department of Machine Learning and Neural Computing, Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, Netherlands.

Published: July 2024

Recurrent neural networks (RNNs) hold immense potential for computations due to their Turing completeness and sequential processing capabilities, yet existing methods for their training encounter efficiency challenges. Backpropagation through time (BPTT), the prevailing method, extends the backpropagation (BP) algorithm by unrolling the RNN over time. However, this approach suffers from significant drawbacks, including the need to interleave forward and backward phases and store exact gradient information. Furthermore, BPTT has been shown to struggle to propagate gradient information for long sequences, leading to vanishing gradients. An alternative strategy to using gradient-based methods like BPTT involves stochastically approximating gradients through perturbation-based methods. This learning approach is exceptionally simple, necessitating only forward passes in the network and a global reinforcement signal as feedback. Despite its simplicity, the random nature of its updates typically leads to inefficient optimization, limiting its effectiveness in training neural networks. In this study, we present a new approach to perturbation-based learning in RNNs whose performance is competitive with BPTT, while maintaining the inherent advantages over gradient-based learning. To this end, we extend the recently introduced activity-based node perturbation (ANP) method to operate in the time domain, leading to more efficient learning and generalization. We subsequently conduct a range of experiments to validate our approach. Our results show similar performance, convergence time and scalability when compared to BPTT, strongly outperforming standard node perturbation and weight perturbation methods. These findings suggest that perturbation-based learning methods offer a versatile alternative to gradient-based methods for training RNNs which can be ideally suited for neuromorphic computing applications.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11267880	PMC
http://dx.doi.org/10.3389/fnins.2024.1439155	DOI Listing

Publication Analysis

Top Keywords

neural networks

recurrent neural

methods training

gradient-based methods

perturbation-based learning

node perturbation

methods

bptt

learning

gradient-free training

Similar Publications

Psycho-physiological foundations of human physical activity behavior and motivation: Theories, systems, mechanisms, evolution, and genetics.

Physiol Rev

January 2025

Department of Sport, Exercise and Health, University of Basel, Basel, Switzerland.

Markus Gerber Boris Cheval Robyn Cody Flora Colledge Vivien Hohberg

Physical activity is a meaningful part of life, which starts before birth and lasts until death. There are many health benefits to be derived from physical activity, hence, regular engagement is recommended on a weekly basis. However, these recommendations are often not met.

View Article and Find Full Text PDF

Similar Publications

Comparative analysis of data-driven models for spatially resolved thermometry using emission spectroscopy.

PLoS One

January 2025

Department of Computer Science, Khalifa University, Abu Dhabi, UAE.

Ruiyuan Kang Dimitrios C Kyritsis Panos Liatsis

A methodology is proposed, which addresses the caveat that line-of-sight emission spectroscopy presents in that it cannot provide spatially resolved temperature measurements in non-homogeneous temperature fields. The aim of this research is to explore the use of data-driven models in measuring temperature distributions in a spatially resolved manner using emission spectroscopy data. Two categories of data-driven methods are analyzed: (i) Feature engineering and classical machine learning algorithms, and (ii) end-to-end convolutional neural networks (CNN).

View Article and Find Full Text PDF

Similar Publications

Mathematical and computational modeling for organic and insect frass fertilizer production: A systematic review.

PLoS One

January 2025

Data Management, Modelling and Geo-Information Unit, International Centre of Insect Physiology and Ecology, Kenya.

Malontema Katchali Edward Richard Henri E Z Tonnang Chrysantus M Tanga Dennis Beesigamukama

Organic fertilizers have been identified as a sustainable agricultural practice that can enhance productivity and reduce environmental impact. Recently, the European Union defined and accepted insect frass as an innovative and emerging organic fertilizer. In the wider domain of organic fertilizers, mathematical and computational models have been developed to optimize their production and application conditions.

View Article and Find Full Text PDF

Similar Publications

An ancient apical patterning system sets the position of the forebrain in chordates.

Sci Adv

January 2025

Department of Zoology, University of Cambridge, Cambridge, UK.

Giacomo Gattoni Daniel Keitley Ashley Sawle Elia Benito-Gutiérrez

The evolutionary origin of the vertebrate brain remains a major subject of debate, as its development from a dorsal tubular neuroepithelium is unique to chordates. To shed light on the evolutionary emergence of the vertebrate brain, we compared anterior neuroectoderm development across deuterostome species, using available single-cell datasets from sea urchin, amphioxus, and zebrafish embryos. We identified a conserved gene co-expression module, comparable to the anterior gene regulatory network (aGRN) controlling apical organ development in ambulacrarians, and spatially mapped it by multiplexed in situ hybridization to the developing retina and hypothalamus of chordates.

View Article and Find Full Text PDF

Similar Publications

Generative adversarial local density-based unsupervised anomaly detection.

PLoS One

January 2025

School of Information Science and Engineering, Xinjiang University, Urumqi, China.

Xinliang Li Jianmin Peng Wenjing Li Zhiping Song Xusheng Du

Anomaly detection is crucial in areas such as financial fraud identification, cybersecurity defense, and health monitoring, as it directly affects the accuracy and security of decision-making. Existing generative adversarial nets (GANs)-based anomaly detection methods overlook the importance of local density, limiting their effectiveness in detecting anomaly objects in complex data distributions. To address this challenge, we introduce a generative adversarial local density-based anomaly detection (GALD) method, which combines the data distribution modeling capabilities of GANs with local synthetic density analysis.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!