Achieving human-level dexterity in robotics remains a critical open problem. Even simple dexterous manipulation tasks pose significant difficulties due to the high number of degrees of freedom and the need for cooperation among heterogeneous agents (e.g., finger joints). While some researchers have utilized reinforcement learning (RL) to control a single hand in manipulating objects, tasks that require coordinated bimanual cooperation are still under-explored due to the fewer suitable environments, which can result in difficulties and sub-optimal performance. To address these challenges, we introduce Bi-DexHands, a simulator with two dexterous hands featuring 20 bimanual manipulation tasks and thousands of target objects, designed to match various levels of human motor skills based on cognitive science research. We developed Bi-DexHands in Issac Gym, enabling highly efficient RL training at over 30,000 frames per second using a single NVIDIA RTX 3090. Based on Bi-DexHands, we present a comprehensive evaluation of popular RL algorithms in different settings, including single-agent/multi-agent RL, offline RL, multi-task RL, and meta RL. Our findings show that on-policy algorithms, such as PPO, can master simple manipulation tasks that correspond to those of 48-month-old babies, such as catching a flying object or opening a bottle. Furthermore, multi-agent RL can improve the ability to perform manipulations that require skilled bimanual cooperation, such as lifting a pot or stacking blocks. Despite achieving success in individual tasks, current RL algorithms struggle to learn multiple manipulation skills in most multi-task and few-shot learning scenarios. This highlights the need for further research and development within the RL community.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TPAMI.2023.3339515 | DOI Listing |
Ophthalmic Physiol Opt
January 2025
Vision and Hearing Sciences Research Centre, Anglia Ruskin University, Cambridge, UK.
Purpose: Wearable electronic low vision enhancement systems (wEVES) improve visual function but are not widely adopted by people with vision impairment. Here, qualitative research methods were used to investigate the usefulness of wEVES for people with age-related macular degeneration (AMD) after an extended home trial.
Methods: Following a 12-week non-masked randomised crossover trial, semi-structured interviews were completed with 34 participants with AMD, 64.
Sensors (Basel)
January 2025
Department of Automation, "Dunarea de Jos" University of Galati, 800008 Galati, Romania.
This paper deals with a "digital twin" (DT) approach for processing, reprocessing, and scrapping (P/R/S) technology running on a modular production system (MPS) assisted by a mobile cyber-physical robotic system (MCPRS). The main hardware architecture consists of four line-shaped workstations (WSs), a wheeled mobile robot (WMR) equipped with a robotic manipulator (RM) and a mobile visual servoing system (MVSS) mounted on the end effector. The system architecture integrates a hierarchical control system where each of the four WSs, in the MPS, is controlled by a Programable Logic Controller (PLC), all connected via Profibus DP to a central PLC.
View Article and Find Full Text PDFInt J Environ Res Public Health
January 2025
School of Psychology, Bond University, Gold Coast, QLD 4226, Australia.
Body image concerns are key prognostic and pathogenic factors of anorexia nervosa (AN) and bulimia nervosa (BN). This study aimed to investigate the neural mechanisms underlying body image perception across its two domains of estimation and satisfaction in anorexia and bulimia patients and healthy controls (HC). Systematic searches were conducted across eight databases, including PubMed; Cochrane Library; Ovid; Google Scholar; Sage Journals; Scopus; PsycInfo; and ScienceDirect, from database inception until the 23rd of April 2023.
View Article and Find Full Text PDFDiagnostics (Basel)
January 2025
Institute of Information Technology, Vietnam Academy of Science and Technology, Hoang Quoc Viet, Hanoi 10072, Vietnam.
: Cancer is a highly lethal disease with a significantly high mortality rate. One of the most commonly used methods for treatment is radiation therapy. However, cancer treatment using radiotherapy is a time-consuming process that requires significant manual work from planners and doctors.
View Article and Find Full Text PDFBehav Res Methods
January 2025
Departamento de Psicobiologia, Universidade Federal de São Paulo, São Paulo, SP, Brazil.
Delay discounting (DD) describes the tendency of individuals to devalue the worth of a reward as a function of the delay in receiving it. DD is impaired in many clinical conditions and changes across development. Many existing automated DD tasks are built on copyrighted software and primarily designed for English speakers, which hinders content editing and accessibility.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!