The implementation of robotic reinforcement learning is hampered by problems such as an unspecified reward function and high training costs. Many previous works have used cross-domain policy transfer to obtain the policy of the problem domain. However, these researches require paired and aligned dynamics trajectories or other interactions with the environment. We propose a cross-domain dynamics alignment framework for the problem domain policy acquisition that can transfer the policy trained in the source domain to the problem domain. Our framework aims to learn dynamics alignment across two domains that differ in agents' physical parameters (armature, rotation range, or torso mass) or agents' morphologies (limbs). Most importantly, we learn dynamics alignment between two domains using unpaired and unaligned dynamics trajectories. For these two scenarios, we propose a cross-physics-domain policy adaptation algorithm (CPD) and a cross-morphology-domain policy adaptation algorithm (CMD) based on our cross-domain dynamics alignment framework. In order to improve the performance of policy in the source domain so that a better policy can be transferred to the problem domain, we propose the Boltzmann TD3 (BTD3) algorithm. We conduct diverse experiments on agent continuous control domains to demonstrate the performance of our approaches. Experimental results show that our approaches can obtain better policies and higher rewards for the agents in the problem domains even when the dataset of the problem domain is small.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.neunet.2023.08.025DOI Listing

Publication Analysis

Top Keywords

dynamics alignment
20
problem domain
20
policy adaptation
12
cross-domain policy
8
policy
8
transfer policy
8
dynamics trajectories
8
cross-domain dynamics
8
alignment framework
8
source domain
8

Similar Publications

Facile Preparation of Carbon Nanotube-Based Skin-Like Pressure Sensors.

Small

December 2024

Department of Chemistry & Chemical Biology and the Brockhouse Institute for Materials Research, McMaster University, 1280 Main St. W., Hamilton, ON, L8S 4M1, Canada.

Flexible sensors have garnered significant interest for their potential to monitor human activities and provide valuable feedback for healthcare purposes. Single-walled carbon nanotubes (SWNTs) are promising materials for these applications but suffer from issues of poor purity and solubility. Dispersing SWNTs with conjugated polymers (CPs) enhances solution processability, yet the polymer sidechains can insulate the SWNTs, limiting the sensor's operating voltage.

View Article and Find Full Text PDF

The mechanism and severity of mitral valve (MV) regurgitation (MR) play a critical role in guiding treatment decisions. Transthoracic echocardiography (TTE) is the primary diagnostic modality for evaluating MV disease. Discordant findings on TTE can be further quantified through transesophageal echocardiography (TEE).

View Article and Find Full Text PDF

Intervention policies play a crucial role in promoting the green transformation of consumption patterns and reducing consumer-side carbon emissions. This topic has been extensively explored by interdisciplinary scholars. However, these studies have not substantially improved our understanding of how intervention policies effectively encourage consumers to engage in green consumption.

View Article and Find Full Text PDF

Motivation: Understanding the conformational landscape of protein-ligand interactions is critical for elucidating the binding mechanisms that govern these interactions. Traditional methods like molecular dynamics (MD) simulations are computationally intensive, leading to a demand for more efficient approaches. This study explores how multiple sequence alignment (MSA) clustering enhance AF-Multimer's ability to predict conformational landscapes, particularly for proteins with multiple conformational states.

View Article and Find Full Text PDF

Background: Radiology is an essential component of modern medicine and a rapidly evolving research field. The nature and dynamic of radiology research in Ethiopia remained largely unexplored This bibliometric scoping review was done to explore the current state of radiology research in Ethiopia.

Methods: Literature search was conducted using PubMed, Scopus, Embase, Web of Science, and Google Scholar from inception to June 15, 2024.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!