Neurocontrol for fixed-length trajectories in environments with soft barriers.

Neural Netw

Department of Electrical and Computer Engineering, University of Alabama, Tuscaloosa, 35401, AL, US.

Published: December 2024

In this paper we present three neurocontrol problems where the analytic policy gradient via back-propagation through time is used to train a simulated agent to maximise a polynomial reward function in a simulated environment. If the environment includes terminal barriers (e.g. solid walls) which terminate the episode whenever the agent touches them, then we show learning can get stuck in oscillating limit cycles, or local minima. Hence we propose to use fixed-length trajectories, and change these barriers into soft barriers, which the agent may pass through, while incurring a significant penalty cost. We demonstrate that the presence of soft barriers can have the drawback of causing exploding learning gradients. Furthermore, the strongest learning gradients often appear at inappropriate parts of the trajectory, where control of the system has already been lost. When combined with modern adaptive optimisers, this combination of exploding gradients and inappropriate learning often causes learning to grind to a halt. We propose ways to avoid these difficulties; either by careful gradient clipping, or by smoothly truncating the gradients of the soft barriers' polynomial cost functions. We argue that this enables the learning algorithm to avoid exploding gradients, and also to concentrate on the most important parts of the trajectory, as opposed to parts of the trajectory where control has already been irreversibly lost.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.neunet.2024.107034DOI Listing

Publication Analysis

Top Keywords

soft barriers
12
parts trajectory
12
fixed-length trajectories
8
learning gradients
8
trajectory control
8
exploding gradients
8
learning
6
barriers
5
gradients
5
neurocontrol fixed-length
4

Similar Publications

Controversies in Myopia Control Treatment: What Does it Mean for Future Research?

Am J Ophthalmol

January 2025

Department of Ophthalmology, University of Warmia and Mazury, Olsztyn, Poland; Institute for Research in Ophthalmology, Foundation for Ophthalmology Development, Poznan Poland. Electronic address:

Purpose: Treatment of myopia has been informed by more than 3 decades of clinical trials and other observations. However, controversies regarding myopia control remain, such as when to stop treatment and what is the long-term efficacy of treatment. This perspective aims to describe clinically relevant and current controversies regarding myopia treatment.

View Article and Find Full Text PDF

Recombinant collagen microneedles for transdermal delivery of antibacterial copper-DNA nanoparticles to treat skin and soft tissue infections.

J Control Release

January 2025

School of Pharmacy, Changzhou University, Changzhou 213164, China; School of Medical and Health Engineering, Changzhou University, Changzhou 213164, PR China. Electronic address:

Skin and soft tissue infections (SSTI) include bacterial infections of the skin, muscles, and connective tissue such as ligaments and tendons. SSTI in patients with immunocompromising diseases may lead to chronic, hard-to-heal infected wounds, resulting in disability, amputation, or even death. To treat SSTI and rebuild the defensive barrier of the skin, here we utilize recombinant type XVII collagen protein (rCol XVII) to construct biodegradable, regenerative collagen microneedles (rCol-MNs) for transdermal delivery of antibacterial agents.

View Article and Find Full Text PDF

The Helicobacter pylori flagellar motor contains several accessory structures that are not found in the archetypal Escherichia coli and Salmonella enterica motors. H. pylori hp0838 encodes a previously uncharacterized lipoprotein and is in an operon with flgP, which encodes a motor accessory protein.

View Article and Find Full Text PDF

Evaluate predictors for attendance and adherence to speech evaluations and determine factors that influence longitudinal care for patients with cleft palate and craniofacial differences. Retrospective, observational cohort study. Tertiary children's hospital.

View Article and Find Full Text PDF

Increasing attention to sustainability and cost-effectiveness in energy storage sector has catalyzed the rise of rechargeable Zinc-ion batteries (ZIBs). However, finding replacement for limited cycle-life Zn-anode is a major challenge. Molybdenum disulfide (MoS), an insertion-type 2D layered material, has shown promising characteristics as a ZIB anode.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!