This article is to propose an adaptive reinforcement learning (RL)-based optimized distributed formation control for the unknown stochastic nonlinear single-integrator dynamic multi-agent system (MAS). For solving the issue of unknown dynamic, an adaptive identifier neural network (NN) is developed to determine the stochastic MAS under expectation sense. And then, for deriving the optimized formation control, the RL is putted into effect via constructing a pair of critic and actor NNs.
View Article and Find Full Text PDFIEEE Trans Neural Netw Learn Syst
June 2022
In this article, the game-based backstepping control method is proposed for the high-order nonlinear multi-agent system with unknown dynamic and input saturation. Reinforcement learning (RL) is employed to get the saddle point solution of the tracking game between each agent and the reference signal for achieving robust control. Specifically, the approximate optimal solution of the established Hamilton-Jacobi-Isaacs (HJI) equation is obtained by policy iteration for each subsystem, and the single network adaptive critic (SNAC) architecture is used to reduce the computational burden.
View Article and Find Full Text PDFIEEE Trans Neural Netw Learn Syst
March 2024
This article addresses a distributed time-varying optimal formation protocol for a class of second-order uncertain nonlinear dynamic multiagent systems (MASs) based on an adaptive neural network (NN) state observer through the backstepping method and simplified reinforcement learning (RL). Each follower agent is subjected to only local information and measurable partial states due to actual sensor limitations. In view of the distributed optimized formation strategic needs, the uncertain nonlinear dynamics and undetectable states may jointly affect the stability of the time-varying cooperative formation control.
View Article and Find Full Text PDFIEEE Trans Neural Netw Learn Syst
March 2023
In this article, an optimized leader-following consensus control scheme is proposed for the nonlinear strict-feedback-dynamic multi-agent system by learning from the controlling idea of optimized backstepping technique, which designs the virtual and actual controls of backstepping to be the optimized solution of corresponding subsystems so that the entire backstepping control is optimized. Since this control needs to not only ensure the optimizing system performance but also synchronize the multiple system state variables, it is an interesting and challenging topic. In order to achieve this optimized control, the neural network approximation-based reinforcement learning (RL) is performed under critic-actor architecture.
View Article and Find Full Text PDFIEEE Trans Neural Netw Learn Syst
March 2023
In this article, an optimized backstepping (OB) control scheme is proposed for a class of stochastic nonlinear strict-feedback systems with unknown dynamics by using reinforcement learning (RL) strategy of identifier-critic-actor architecture, where the identifier aims to compensate the unknown dynamic, the critic aims to evaluate the control performance and to give the feedback to the actor, and the actor aims to perform the control action. The basic control idea is that all virtual controls and the actual control of backstepping are designed as the optimized solution of corresponding subsystems so that the entire backstepping control is optimized. Different from the deterministic system, stochastic system control needs to consider not only the stochastic disturbance depicted by the Wiener process but also the Hessian term in stability analysis.
View Article and Find Full Text PDFIEEE Trans Neural Netw Learn Syst
February 2022
In this article, an adaptive optimized control scheme based on neural networks (NNs) is developed for a class of perturbed strict-feedback nonlinear systems. An optimized backstepping (OB) technique is employed for breaking through the limitation of the matching condition. The disturbance of existing nonlinear systems may degrade system performance or even lead to instability.
View Article and Find Full Text PDFIn this article, a control scheme based on optimized backstepping (OB) technique is developed for a class of nonlinear strict-feedback systems with unknown dynamic functions. Reinforcement learning (RL) is employed for achieving the optimized control, and it is designed on the basis of the neural-network (NN) approximations under identifier-critic-actor architecture, where the identifier, critic, and actor are utilized for estimating the unknown dynamic, evaluating the system performance, and implementing the control action, respectively. OB control is to design all virtual controls and the actual control of backstepping to be the optimized solutions of corresponding subsystems.
View Article and Find Full Text PDFThis paper presents an adaptive neural network output feedback control method for stochastic nonlinear systems with full state constraints. The barrier Lyapunov functions are used to conquer the effect of state constraints to system performance. The neural network state observer is established to estimate the unmeasured states.
View Article and Find Full Text PDFIEEE Trans Cybern
September 2019
In this paper, a tracking control approach for surface vessel is developed based on the new control technique named optimized backstepping (OB), which considers optimization as a backstepping design principle. Since surface vessel systems are modeled by second-order dynamic in strict feedback form, backstepping is an ideal technique for finishing the tracking task. In the backstepping control of surface vessel, the virtual and actual controls are designed to be the optimized solutions of corresponding subsystems, therefore the overall control is optimized.
View Article and Find Full Text PDFIEEE Trans Neural Netw Learn Syst
August 2018
In this paper, a control technique named optimized backstepping is first proposed by implementing tracking control for a class of strict-feedback systems, which considers optimization as a design philosophy of the high-order system control. The basic idea is that designing the actual and virtual controls of backstepping is the optimized solutions of the corresponding subsystems so that overall control of the high-order system is optimized. In general, optimization control is designed based on the solution of Hamilton-Jacobi-Bellman equation, but solving the equation is very difficult due to the inherent nonlinearity and intractability.
View Article and Find Full Text PDFCompared with the existing neural network (NN) or fuzzy logic system (FLS) based adaptive consensus methods, the proposed approach can greatly alleviate the computation burden because it needs only to update a few adaptive parameters online. In the multiagent agreement control, the system uncertainties derive from the unknown nonlinear dynamics are counteracted by employing the adaptive NNs; the state delays are compensated by designing a Lyapunov-Krasovskii functional. Finally, based on Lyapunov stability theory, it is demonstrated that the proposed consensus scheme can steer a multiagent system synchronizing to the predefined reference signals.
View Article and Find Full Text PDFCombined with backstepping techniques, an observer-based adaptive consensus tracking control strategy is developed for a class of high-order nonlinear multiagent systems, of which each follower agent is modeled in a semi-strict-feedback form. By constructing the neural network-based state observer for each follower, the proposed consensus control method solves the unmeasurable state problem of high-order nonlinear multiagent systems. The control algorithm can guarantee that all signals of the multiagent system are semi-globally uniformly ultimately bounded and all outputs can synchronously track a reference signal to a desired accuracy.
View Article and Find Full Text PDFThis paper studies an adaptive tracking control for a class of nonlinear stochastic systems with unknown functions. The considered systems are in the nonaffine pure-feedback form, and it is the first to control this class of systems with stochastic disturbances. The fuzzy-neural networks are used to approximate unknown functions.
View Article and Find Full Text PDFThis brief studies an adaptive neural output feedback tracking control of uncertain nonlinear multi-input-multi-output (MIMO) systems in the discrete-time form. The considered MIMO systems are composed of n subsystems with the couplings of inputs and states among subsystems. In order to solve the noncausal problem and decouple the couplings, it needs to transform the systems into a predictor form.
View Article and Find Full Text PDFBackground: In China recruitment and retention of sufficient numbers of safe blood donors continues to be a challenge. Understanding who donates blood, particularly those who donate larger (>200 mL) whole blood (WB) units, will help blood centers to target more effective recruitment and retention strategies.
Study Design And Methods: Demographic characteristics of 226,489 allogeneic WB donors from January to December 2008 at five geographically and ethnically diverse, urban blood centers were analyzed.
Background: A multi-blood center study was conducted to evaluate a human immunodeficiency virus type 1 (HIV-1) and hepatitis C virus (HCV) multiplex nucleic acid testing (NAT) donor screening test and to determine the residual risk for HIV-1 and HCV infection.
Study Design And Methods: A commercially available HIV-1 and HCV assay (Procleix, Chiron Corp.) was used for simultaneous detection of HIV-1 RNA and HCV RNA on 89,647 unlinked donor samples.
Background: The recruitment and retention of voluntary, nonremunerated blood donors continues to be a challenge in China. Understanding donor demographics and donor characteristics is crucial for any blood center in developing strategies to recruit potential donors.
Study Design And Methods: The study population included all 29,784 whole blood donors from January 1 to December 31, 2003, at the Urumqi City Blood Center or one of its mobile blood collection buses.