In this paper we present three neurocontrol problems where the analytic policy gradient via back-propagation through time is used to train a simulated agent to maximise a polynomial reward function in a simulated environment. If the environment includes terminal barriers (e.g. solid walls) which terminate the episode whenever the agent touches them, then we show learning can get stuck in oscillating limit cycles, or local minima. Hence we propose to use fixed-length trajectories, and change these barriers into soft barriers, which the agent may pass through, while incurring a significant penalty cost. We demonstrate that the presence of soft barriers can have the drawback of causing exploding learning gradients. Furthermore, the strongest learning gradients often appear at inappropriate parts of the trajectory, where control of the system has already been lost. When combined with modern adaptive optimisers, this combination of exploding gradients and inappropriate learning often causes learning to grind to a halt. We propose ways to avoid these difficulties; either by careful gradient clipping, or by smoothly truncating the gradients of the soft barriers' polynomial cost functions. We argue that this enables the learning algorithm to avoid exploding gradients, and also to concentrate on the most important parts of the trajectory, as opposed to parts of the trajectory where control has already been irreversibly lost.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.neunet.2024.107034 | DOI Listing |
Am J Ophthalmol
January 2025
Department of Ophthalmology, University of Warmia and Mazury, Olsztyn, Poland; Institute for Research in Ophthalmology, Foundation for Ophthalmology Development, Poznan Poland. Electronic address:
Purpose: Treatment of myopia has been informed by more than 3 decades of clinical trials and other observations. However, controversies regarding myopia control remain, such as when to stop treatment and what is the long-term efficacy of treatment. This perspective aims to describe clinically relevant and current controversies regarding myopia treatment.
View Article and Find Full Text PDFJ Control Release
January 2025
School of Pharmacy, Changzhou University, Changzhou 213164, China; School of Medical and Health Engineering, Changzhou University, Changzhou 213164, PR China. Electronic address:
Skin and soft tissue infections (SSTI) include bacterial infections of the skin, muscles, and connective tissue such as ligaments and tendons. SSTI in patients with immunocompromising diseases may lead to chronic, hard-to-heal infected wounds, resulting in disability, amputation, or even death. To treat SSTI and rebuild the defensive barrier of the skin, here we utilize recombinant type XVII collagen protein (rCol XVII) to construct biodegradable, regenerative collagen microneedles (rCol-MNs) for transdermal delivery of antibacterial agents.
View Article and Find Full Text PDFPLoS Pathog
January 2025
Department of Microbiology, University of Georgia, Athens, Georgia, United States of America.
The Helicobacter pylori flagellar motor contains several accessory structures that are not found in the archetypal Escherichia coli and Salmonella enterica motors. H. pylori hp0838 encodes a previously uncharacterized lipoprotein and is in an operon with flgP, which encodes a motor accessory protein.
View Article and Find Full Text PDFCleft Palate Craniofac J
January 2025
Center for Craniofacial Disorders, Children's Healthcare of Atlanta, Atlanta, GA, USA.
Evaluate predictors for attendance and adherence to speech evaluations and determine factors that influence longitudinal care for patients with cleft palate and craniofacial differences. Retrospective, observational cohort study. Tertiary children's hospital.
View Article and Find Full Text PDFSmall
January 2025
Department of Physics, Malaviya National Institute of Technology Jaipur, Rajasthan, 302017, India.
Increasing attention to sustainability and cost-effectiveness in energy storage sector has catalyzed the rise of rechargeable Zinc-ion batteries (ZIBs). However, finding replacement for limited cycle-life Zn-anode is a major challenge. Molybdenum disulfide (MoS), an insertion-type 2D layered material, has shown promising characteristics as a ZIB anode.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!