Efficient Reinforcement Learning for 3D Jumping Monopods.

Sensors (Basel)

Dipartimento di Ingegneria and Scienza Dell'Informazione (DISI), University of Trento, 38123 Trento, Italy.

Published: August 2024

We consider a complex control problem: making a monopod accurately reach a target with a single jump. The monopod can jump in any direction at different elevations of the terrain. This is a paradigm for a much larger class of problems, which are extremely challenging and computationally expensive to solve using standard optimization-based techniques. Reinforcement learning (RL) is an interesting alternative, but an end-to-end approach in which the controller must learn everything from scratch can be non-trivial with a sparse-reward task like jumping. Our solution is to guide the learning process within an RL framework leveraging nature-inspired heuristic knowledge. This expedient brings widespread benefits, such as a drastic reduction of learning time, and the ability to learn and compensate for possible errors in the low-level execution of the motion. Our simulation results reveal a clear advantage of our solution against both optimization-based and end-to-end RL approaches.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11314636PMC
http://dx.doi.org/10.3390/s24154981DOI Listing

Publication Analysis

Top Keywords

reinforcement learning
8
efficient reinforcement
4
learning
4
learning jumping
4
jumping monopods
4
monopods consider
4
consider complex
4
complex control
4
control problem
4
problem making
4

Similar Publications

Central to the development of universal learning systems is the ability to solve multiple tasks without retraining from scratch when new data arrives. This is crucial because each task requires significant training time. Addressing the problem of continual learning necessitates various methods due to the complexity of the problem space.

View Article and Find Full Text PDF

Accurate interoceptive processing in decision-making is essential to maintain homeostasis and overall health. Disruptions in this process have been associated with various psychiatric conditions, including depression. Recent studies have focused on nutrient homeostatic dysregulation in depression for effective subtype classification and treatment.

View Article and Find Full Text PDF

Background: Molecular interactions between proteins and their ligands are important for drug design. A pharmacophore consists of favorable molecular interactions in a protein binding site and can be utilized for virtual screening. Pharmacophores are easiest to identify from co-crystal structures of a bound protein-ligand complex.

View Article and Find Full Text PDF

Background: This study was undertaken to understand the role of the Health Care Assistants and how they negotiate roles and responsibilities with Registered Nurses in adult acute hospitals.

Methods: The qualitative approach of focused ethnography used non-participant observation and interviews with staff from four acute wards. Field notes and interview data, analysed using NVIVO10, moved data from description through explanation, interpretation and identification of themes.

View Article and Find Full Text PDF

Background: Post-traumatic stress disorder (PTSD) causes intrusive symptoms and avoidance behaviours due to dysregulation in various brain regions, including the hippocampus. Deep brain stimulation (DBS) shows promise for refractory PTSD cases. In rodents, DBS improves fear extinction and reduces anxiety-like behaviours, but its effects on active-avoidance extinction remain unexplored.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!