A control system for bipedal walking in the sagittal plane was developed in simulation. The biped model was built based on anthropometric data for a 1.8 m tall male of average build. At the core of the controller is a deep deterministic policy gradient (DDPG) neural network that was trained in GAZEBO, a physics simulator, to predict the ideal foot placement to maintain stable walking despite external disturbances. The complexity of the DDPG network was decreased through carefully selected state variables and a distributed control system. Additional controllers for the hip joints during their stance phases and the ankle joint during toe-off phase help to stabilize the biped during walking. The simulated biped can walk at a steady pace of approximately 1 m/s, and during locomotion it can maintain stability with a 30 kg·m/s impulse applied forward on the torso or a 40 kg·m/s impulse applied rearward. It also maintains stable walking with a 10 kg backpack or a 25 kg front pack. The controller was trained on a 1.8 m tall model, but also stabilizes models 1.4-2.3 m tall with no changes.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6477666PMC
http://dx.doi.org/10.3390/biomimetics4010028DOI Listing

Publication Analysis

Top Keywords

deep deterministic
8
deterministic policy
8
bipedal walking
8
control system
8
stable walking
8
kg·m/s impulse
8
impulse applied
8
walking
5
implementation deep
4
policy gradients
4

Similar Publications

Although deterministic analysis can provide initial insights into slope stability, there is no way to reflect the true distribution of soil properties within a slope. To further investigate the effects of the spatial variability of soil properties on the stability and failure mechanism of slope under different foundation types, this study develops a framework combining simple limit equilibrium method (LEM), Monte Carlo Simulation (MCS), and random field to incorporate these factors into slope probabilistic stability analysis. The slope models of two typical foundations (e.

View Article and Find Full Text PDF

The multi-parameter and nonlinear characteristics of the Smith Watson Topper (SWT) equation present considerable challenges for predicting the fatigue life of 2024-T3 clad Al alloy. To overcome these challenges, a novel model integrating traditional fatigue analysis methods with machine learning algorithms is introduced. An improved SWT fatigue life prediction equation is developed by incorporating key factors such as the mean stress effect, stress concentration factor, and surface roughness coefficient.

View Article and Find Full Text PDF

RL-QPSO net: deep reinforcement learning-enhanced QPSO for efficient mobile robot path planning.

Front Neurorobot

January 2025

Hebi Institute of Engineering and Technology, Henan Polytechnic University, Hebi, Henan, China.

Introduction: Path planning in complex and dynamic environments poses a significant challenge in the field of mobile robotics. Traditional path planning methods such as genetic algorithms, Dijkstra's algorithm, and Floyd's algorithm typically rely on deterministic search strategies, which can lead to local optima and lack global search capabilities in dynamic settings. These methods have high computational costs and are not efficient for real-time applications.

View Article and Find Full Text PDF

Conventional scanned optical coherence tomography (OCT) suffers from the frame rate/resolution tradeoff, whereby increasing image resolution leads to decreases in the maximum achievable frame rate. To overcome this limitation, we propose two variants of machine learning (ML)-based adaptive scanning approaches: one using a ConvLSTM-based sequential prediction model and another leveraging a temporal attention unit (TAU)-based parallel prediction model for scene dynamics prediction. These models are integrated with a kinodynamic path planner based on the clustered traveling salesperson problem to create two versions of ML-based adaptive scanning pipelines.

View Article and Find Full Text PDF

Uncertainty-Aware Multimodal Trajectory Prediction via a Single Inference from a Single Model.

Sensors (Basel)

January 2025

Seamless Trans-X Lab (STL), School of Integrated Technology, Yonsei University, Incheon 21983, Republic of Korea.

In the domain of autonomous driving, trajectory prediction plays a pivotal role in ensuring the safety and reliability of autonomous systems, especially when navigating complex environments. Unfortunately, trajectory prediction suffers from uncertainty problems due to the randomness inherent in the driving environment, but uncertainty quantification in trajectory prediction is not widely addressed, and most studies rely on deep ensembles methods. This study presents a novel uncertainty-aware multimodal trajectory prediction (UAMTP) model that quantifies aleatoric and epistemic uncertainties through a single forward inference.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!