In this article, an optimized backstepping (OB) control scheme is proposed for a class of stochastic nonlinear strict-feedback systems with unknown dynamics by using reinforcement learning (RL) strategy of identifier-critic-actor architecture, where the identifier aims to compensate the unknown dynamic, the critic aims to evaluate the control performance and to give the feedback to the actor, and the actor aims to perform the control action. The basic control idea is that all virtual controls and the actual control of backstepping are designed as the optimized solution of corresponding subsystems so that the entire backstepping control is optimized. Different from the deterministic system, stochastic system control needs to consider not only the stochastic disturbance depicted by the Wiener process but also the Hessian term in stability analysis. If the backstepping control is developed on the basis of the published RL optimization methods, it will be difficult to be achieved because, on the one hand, RL of these methods are very complex in the algorithm thanks to their critic and actor updating laws deriving from the negative gradient of the square of approximation of Hamilton-Jacobi-Bellman (HJB) equation; on the other hand, these methods require persistence excitation and known dynamic, where persistence excitation is for training adaptive parameters sufficiently. In this research, both critic and actor updating laws are derived from the negative gradient of a simple positive function, which is yielded on the basis of a partial derivative of the HJB equation. As a result, the RL algorithm can be significantly simplified, meanwhile, two requirements of persistence excitation and known dynamic can be released. Therefore, it can be a natural selection for stochastic optimization control. Finally, from two aspects of theory and simulation, it is demonstrated that the proposed control can arrive at the desired system performance.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TNNLS.2021.3105176 | DOI Listing |
An Acad Bras Cienc
January 2025
University of Technology, Department of Control and System Engineering, Baghdad, 10066, Iraq.
Latency in flux observation has an adverse effect on the performance of observer-based field-oriented speed control for three-phase induction motor (IM). The reduction of the convergent rate of estimation errors could improve the performance of speed-controlled IM based on flux observers. The main contribution is to design a fast convergent flux observer, which provides bounded estimation error immediately after one instant of motor startup.
View Article and Find Full Text PDFISA Trans
January 2025
College of Control Science and Engineering, Bohai University, Jinzhou 121013, Liaoning, China. Electronic address:
This paper investigates the optimal fixed-time tracking control problem for a class of nonstrict-feedback large-scale nonlinear systems with prescribed performance. In the process of optimal control design, the new critic and actor neural network updating laws are proposed by adopting the fixed-time technique and the simplified reinforcement learning algorithm, which both guarantee the simplified optimal control algorithm and accelerate the convergence rate. Furthermore, the prescribed performance method is contemplated simultaneously, which ensures tracking errors can converge within the prescribed performance bounds in fixed time.
View Article and Find Full Text PDFISA Trans
December 2024
State Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110169, China. Electronic address:
For Electro-Hydraulic Actuators (EHA) with parametric uncertainties and mismatched and matched disturbances, most existing robust adaptive control strategies can achieve only uniformly ultimately bounded tracking errors. An Extended-State-Observer (ESO) based asymptotic control scheme is proposed by incorporating the prescribed performance control into the backstepping framework to ensure satisfied tracking performance and anti-disturbance ability of EHA systems. A novel ESO is designed to acquire an asymptotic estimation without prior bounds of the mismatched disturbance and its derivatives.
View Article and Find Full Text PDFSensors (Basel)
December 2024
School of Mathematics and Statistics, Guangdong University of Technology, Guangzhou 510006, China.
This paper proposes the fixed-time prescribed performance optimal consensus control method for stochastic nonlinear multi-agent systems with sensor faults. The consensus error converges to the prescribed performance bounds in fixed-time by an improved performance function and coordinate transformation. Due to the unknown faults in sensors, the system states cannot be gained correctly; therefore, an adaptive compensation strategy is constructed based on the approximation capabilities of neural networks to solve the negative impact of sensor failures.
View Article and Find Full Text PDFPolymers (Basel)
December 2024
School of Mechanical and Electronic Engineering, Northeastern University, Shenyang 110819, China.
In this study, a fuzzy adaptive impedance control method integrating the backstepping control for the PAM elbow exoskeleton was developed to facilitate robot-assisted rehabilitation tasks. The proposed method uses fuzzy logic to adjust impedance parameters, thereby optimizing user adaptability and reducing interactive torque, which are major limitations of traditional impedance control methods. Furthermore, a repetitive learning algorithm and an adaptive control strategy were incorporated to improve the performance of position accuracy, addressing the time-varying uncertainties and nonlinear disturbances inherent in the exoskeleton.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!