Decentralization is a central characteristic of biological motor control that allows for fast responses relying on local sensory information. In contrast, the current trend of Deep Reinforcement Learning (DRL) based approaches to motor control follows a centralized paradigm using a single, holistic controller that has to untangle the whole input information space. This motivates to ask whether decentralization as seen in biological control architectures might also be beneficial for embodied sensori-motor control systems when using DRL. To answer this question, we provide an analysis and comparison of eight control architectures for adaptive locomotion that were derived for a four-legged agent, but with their degree of decentralization varying systematically between the extremes of fully centralized and fully decentralized. Our comparison shows that learning speed is significantly enhanced in distributed architectures-while still reaching the same high performance level of centralized architectures-due to smaller search spaces and local costs providing more focused information for learning. Second, we find an increased robustness of the learning process in the decentralized cases-it is less demanding to hyperparameter selection and less prone to becoming trapped in poor local minima. Finally, when examining generalization to uneven terrains-not used during training-we find best performance for an intermediate architecture that is decentralized, but integrates only local information from both neighboring legs. Together, these findings demonstrate beneficial effects of distributing control into decentralized units and relying on local information. This appears as a promising approach towards more robust DRL and better generalization towards adaptive behavior.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.neunet.2021.09.017DOI Listing

Publication Analysis

Top Keywords

deep reinforcement
8
reinforcement learning
8
motor control
8
relying local
8
control architectures
8
decentralized
6
local
6
control
6
learning
5
decentralized control
4

Similar Publications

In cybersecurity, anomaly detection in tabular data is essential for ensuring information security. While traditional machine learning and deep learning methods have shown some success, they continue to face significant challenges in terms of generalization. To address these limitations, this paper presents an innovative method for tabular data anomaly detection based on large language models, called "Tabular Anomaly Detection via Guided Prompts" (TAD-GP).

View Article and Find Full Text PDF

Patients with type 1 diabetes and their physicians have long desired a fully closed-loop artificial pancreas (AP) system that can alleviate the burden of blood glucose regulation. Although deep reinforcement learning (DRL) methods theoretically enable adaptive insulin dosing control, they face numerous challenges, including safety and training efficiency, which have hindered their clinical application. This paper proposes a safe and efficient adaptive insulin delivery controller based on DRL.

View Article and Find Full Text PDF

Opioid reward and deep brain stimulation of the lateral hypothalamic area.

Vitam Horm

January 2025

Department of Physiology, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran. Electronic address:

Opioid use disorder (OUD) is considered a global health issue that affects various aspects of patients' lives and poses a considerable burden on society. Due to the high prevalence of remissions and relapses, novel therapeutic approaches are required to manage OUD. Deep brain stimulation (DBS) is one of the most promising clinical breakthroughs in translational neuroscience.

View Article and Find Full Text PDF

The oceanic dissolved organic matter (DOM) reservoir is one of Earth's largest carbon pools, yet the factors contributing to its recalcitrance and persistence remain poorly understood. Here, we employed ultra-high resolution mass spectrometry (UHRMS) to examine the molecular dynamics of DOM from terrestrial, marine and mixed sources during bio-incubation over weekly, monthly, and one year time spans. Using Fourier transform ion cyclotron resonance mass spectrometry (FT-ICR MS), we classified DOM into three distinct categories (Consumed, Resistant and Product) based on their presence or absence at the start and end of the incubation.

View Article and Find Full Text PDF

Polyurethane (PU) grouting materials are widely used in underground engineering rehabilitation, particularly in reinforcement and waterproofing engineering in deep-water environments. The long-term effect of complex underground environments can lead to nanochannel formation within PU, weakening its repair remediation effect. However, the permeation behavior and microscopic mechanisms of water molecules within PU nanochannels remain unclear.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!