Deep reinforcement learning trajectory planning for robotic manipulator based on simulation-efficient training.

Bin Zhao Yao Wu Chengdong Wu Ruohuai Sun

Sci Rep

School of Information Science and Engineering, Northeastern University, Shenyang, 110819, China.

Published: March 2025

The paper proposes a new M2ACD(Multi-Actor-Critic Deep Deterministic Policy Gradient) algorithm to apply trajectory planning of the robotic manipulator in complex environments. First, the paper presents a general inverse kinematics algorithm that transforms the inverse kinematics problem into a general Newton-MP iterative method. The M2ACD algorithm based on multiple actors and critics is structured. The dual-actor network reduces the overestimation of action values, minimizes the correlation between the actor and value networks, and mitigates instability during the actor's selection process caused by excessively high Q-values. The dual-critic network reduces the estimation bias of Q-values, ensuring more reliable action selection and enhancing the stability of Q-value estimation. Secondly, The robotic manipulator's TSR (two-stage reward) strategy is designed and divided into the approach and close. Rewards in the approach phase focuses on safely and efficiently approaching the target, and rewards in the close phase involves final adjustments before contact is made with the target. Thirdly, to solve the position hopping jitter problem in traditional reinforcement learning trajectory planning, the NURBS(Non-Uniform Rational B-Splines) curve is used to smooth the hopping trajectory generated by M2ACD. Finally, the correctness of the M2ACD and the kinematics algorithm is verified by experiments. The M2ACD algorithm demonstrated superior curve smoothing, convergence stability and convergence speed compared to the TD3, DARC and DDPG algorithms. The M2ACD algorithm can be effectively applied to collaborative robots' trajectory planning, establishing a foundation for subsequent research.

Download full-text PDF	Source
http://dx.doi.org/10.1038/s41598-025-93175-2	DOI Listing

Publication Analysis

Top Keywords

trajectory planning

m2acd algorithm

reinforcement learning

learning trajectory

planning robotic

robotic manipulator

inverse kinematics

kinematics algorithm

network reduces

algorithm

Similar Publications

Deep reinforcement learning trajectory planning for robotic manipulator based on simulation-efficient training.

Sci Rep

March 2025

School of Information Science and Engineering, Northeastern University, Shenyang, 110819, China.

Bin Zhao Yao Wu Chengdong Wu Ruohuai Sun

View Article and Find Full Text PDF

Similar Publications

Human-Robot Shared Control for Osteotomy Procedure.

IEEE Trans Biomed Eng

March 2025

Elisa Iovene Lorenzo Casadio Junling Fu Francesco Costa Giancarlo Ferrigno

Spinal intervention can benefit from advancements in robotic systems, particularly in the field of Human-Robot Interaction (HRI). Despite the promising potential of these technologies, their integration into spine surgeries remains relatively limited, comprising mainly only selected procedures. Meanwhile, complex and time-consuming procedures, such as osteotomy, continue to be performed manually, significantly impacting surgeon workload and stress.

View Article and Find Full Text PDF

Similar Publications

Comparative Analysis of Efficacy and Safety of Frame-Based, Frameless, and Robot-Assisted Stereotactic Brain Biopsies: A Systematic Review and Meta-Analysis.

Oper Neurosurg (Hagerstown)

November 2024

Department of Neurological Surgery, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania, USA.

Neslihan Nisa Gecici N U Farrukh Hameed Ahmed Habib Hansen Deng L Dade Lunsford

Background And Objectives: For 50 years, frame-based stereotactic brain biopsy has been the "gold standard" for its high diagnostic yield and safety, especially for complex or deep-seated lesions. Over the past decade, frameless and robotic alternatives have emerged. This report evaluates and compares the outcomes, diagnostic yield, and safety of these methods.

View Article and Find Full Text PDF

Similar Publications

Developing capacity in identifying cost-effective interventions to prevent and reduce obesity in China.

Glob Health Action

December 2025

Health Science Center, Peking University, Beijing, China.

Angela M Jackson-Morris Suying Chang Christina L Meyer Guansheng Ma

Obesity is associated with multiple noncommunicable diseases and has increased rapidly worldwide. Population obesity in China grew fourfold between 1993 and 2015, increasing most rapidly among children and adolescents. Cost-effective policies and programs delivered over time and at scale are required to change this trajectory, yet application of methodologies to identify such interventions have been sparse.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!