In this article, the pose regulation control problem of a robotic fish is investigated by formulating it as a Markov decision process (MDP). Such a typical task that requires the robot to arrive at the desired position with the desired orientation remains a challenge, since two objectives (position and orientation) may be conflicted during optimization. To handle the challenge, we adopt the sparse reward scheme, i.e., the robot will be rewarded if and only if it completes the pose regulation task. Although deep reinforcement learning (DRL) can achieve such an MDP with sparse rewards, the absence of immediate reward hinders the robot from efficient learning. To this end, we propose a novel imitation learning (IL) method that learns DRL-based policies from demonstrations with inverse reward shaping to overcome the challenge raised by extremely sparse rewards. Moreover, we design a demonstrator to generate various trajectory demonstrations based on one simple example from a nonexpert helper, which greatly reduces the time consumption of collecting robot samples. The simulation results evaluate the effectiveness of our proposed demonstrator and the state-of-the-art (SOTA) performance of our proposed IL method. Furthermore, we deploy the trained IL policy on a physical robotic fish to perform pose regulation in a swimming tank without/with external disturbances. The experimental results verify the effectiveness and robustness of our proposed methods in real world. Therefore, we believe this article is a step forward in the field of biomimetic underwater robot learning.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2022.3202075DOI Listing

Publication Analysis

Top Keywords

pose regulation
16
robotic fish
12
imitation learning
8
problem robotic
8
sparse rewards
8
learning
5
robot
5
leveraging imitation
4
pose
4
learning pose
4

Similar Publications

Emerging Roles of TRIM56 in Antiviral Innate Immunity.

Viruses

January 2025

Department of Microbiology, Immunology and Biochemistry, University of Tennessee Health Science Center, Memphis, TN 38163, USA.

The tripartite-motif protein 56 (TRIM56) is a RING-type E3 ubiquitin ligase whose functions were recently beginning to be unveiled. While the physiological role(s) of TRIM56 remains unclear, emerging evidence suggests this protein participates in host innate defense mechanisms that guard against viral infections. Interestingly, TRIM56 has been shown to pose a barrier to viruses of distinct families by utilizing its different domains.

View Article and Find Full Text PDF

Despite the vast amount of water on Earth, only a small percent is suitable for consumption, and these resources are diminishing. Moreover, water resources are unevenly distributed, leading to significant disparities in access to drinking water between countries and populations. Increasing consumption and the expanding human population necessitate the development of novel wastewater treatment technologies and the use of water treatment byproducts in other areas, such as fertilisers.

View Article and Find Full Text PDF

Insights into Reproduction Through Gonadal Tissue Methylation Analysis and Transcriptomic Integration.

Biomolecules

January 2025

Área de Genética, Facultad de Ciencias del Mar y Ambientales, INMAR, Universidad de Cádiz, 11510 Cádiz, Spain.

Fish exhibit diverse mechanisms of sex differentiation and determination, shaped by both external and internal influences, often regulated by distinct DNA methylation patterns responding to environmental changes. In aquaculture, reproductive issues in captivity pose significant challenges, particularly the lack of fertilization capabilities in captive-bred males, hindering genetic improvement measures. This study analyzed the methylation patterns and transcriptomic profiles in gonadal tissue DNA from groups differing in rearing conditions and sexual maturity stages.

View Article and Find Full Text PDF

Multidrug-resistant infections pose a critical challenge to healthcare systems, particularly in nosocomial settings. This drug-resistant bacterium forms biofilms and produces an array of virulent factors regulated by quorum sensing. In this study, metal-tolerant bacteria were isolated from a metal-contaminated site and screened for their ability to synthesize multifunctional nanocomposites (NCs).

View Article and Find Full Text PDF

Essential and edible oils have applications in reducing oxidative processes and inhibiting the growth of microorganisms in meats and their derivatives, providing a natural alternative to synthetic preservatives. This preservative action meets the demand for clean labels and safe products, aiming to replace synthetic additives that pose potential health risks. Advances and limitations in applying essential and edible oils in meat preservation, highlighting their preservative properties or ability to improve nutritional profiles, are explored in this study.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!