Achieving human-level dexterity in robotics remains a critical open problem. Even simple dexterous manipulation tasks pose significant difficulties due to the high number of degrees of freedom and the need for cooperation among heterogeneous agents (e.g., finger joints). While some researchers have utilized reinforcement learning (RL) to control a single hand in manipulating objects, tasks that require coordinated bimanual cooperation are still under-explored due to the fewer suitable environments, which can result in difficulties and sub-optimal performance. To address these challenges, we introduce Bi-DexHands, a simulator with two dexterous hands featuring 20 bimanual manipulation tasks and thousands of target objects, designed to match various levels of human motor skills based on cognitive science research. We developed Bi-DexHands in Issac Gym, enabling highly efficient RL training at over 30,000 frames per second using a single NVIDIA RTX 3090. Based on Bi-DexHands, we present a comprehensive evaluation of popular RL algorithms in different settings, including single-agent/multi-agent RL, offline RL, multi-task RL, and meta RL. Our findings show that on-policy algorithms, such as PPO, can master simple manipulation tasks that correspond to those of 48-month-old babies, such as catching a flying object or opening a bottle. Furthermore, multi-agent RL can improve the ability to perform manipulations that require skilled bimanual cooperation, such as lifting a pot or stacking blocks. Despite achieving success in individual tasks, current RL algorithms struggle to learn multiple manipulation skills in most multi-task and few-shot learning scenarios. This highlights the need for further research and development within the RL community.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2023.3339515DOI Listing

Publication Analysis

Top Keywords

manipulation tasks
12
dexterous manipulation
8
bimanual cooperation
8
manipulation
5
tasks
5
bi-dexhands
4
bi-dexhands human-level
4
bimanual
4
human-level bimanual
4
bimanual dexterous
4

Similar Publications

Purpose: Wearable electronic low vision enhancement systems (wEVES) improve visual function but are not widely adopted by people with vision impairment. Here, qualitative research methods were used to investigate the usefulness of wEVES for people with age-related macular degeneration (AMD) after an extended home trial.

Methods: Following a 12-week non-masked randomised crossover trial, semi-structured interviews were completed with 34 participants with AMD, 64.

View Article and Find Full Text PDF

This paper deals with a "digital twin" (DT) approach for processing, reprocessing, and scrapping (P/R/S) technology running on a modular production system (MPS) assisted by a mobile cyber-physical robotic system (MCPRS). The main hardware architecture consists of four line-shaped workstations (WSs), a wheeled mobile robot (WMR) equipped with a robotic manipulator (RM) and a mobile visual servoing system (MVSS) mounted on the end effector. The system architecture integrates a hierarchical control system where each of the four WSs, in the MPS, is controlled by a Programable Logic Controller (PLC), all connected via Profibus DP to a central PLC.

View Article and Find Full Text PDF

Body image concerns are key prognostic and pathogenic factors of anorexia nervosa (AN) and bulimia nervosa (BN). This study aimed to investigate the neural mechanisms underlying body image perception across its two domains of estimation and satisfaction in anorexia and bulimia patients and healthy controls (HC). Systematic searches were conducted across eight databases, including PubMed; Cochrane Library; Ovid; Google Scholar; Sage Journals; Scopus; PsycInfo; and ScienceDirect, from database inception until the 23rd of April 2023.

View Article and Find Full Text PDF

An Efficient 3D Convolutional Neural Network for Dose Prediction in Cancer Radiotherapy from CT Images.

Diagnostics (Basel)

January 2025

Institute of Information Technology, Vietnam Academy of Science and Technology, Hoang Quoc Viet, Hanoi 10072, Vietnam.

: Cancer is a highly lethal disease with a significantly high mortality rate. One of the most commonly used methods for treatment is radiation therapy. However, cancer treatment using radiotherapy is a time-consuming process that requires significant manual work from planners and doctors.

View Article and Find Full Text PDF

Delay discounting (DD) describes the tendency of individuals to devalue the worth of a reward as a function of the delay in receiving it. DD is impaired in many clinical conditions and changes across development. Many existing automated DD tasks are built on copyrighted software and primarily designed for English speakers, which hinders content editing and accessibility.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!