Based on the neural network (NN) approximator, an online reinforcement learning algorithm is proposed for a class of affine multiple input and multiple output (MIMO) nonlinear discrete-time systems with unknown functions and disturbances. In the design procedure, two networks are provided where one is an action network to generate an optimal control signal and the other is a critic network to approximate the cost function. An optimal control signal and adaptation laws can be generated based on two NNs. In the previous approaches, the weights of critic and action networks are updated based on the gradient descent rule and the estimations of optimal weight vectors are directly adjusted in the design. Consequently, compared with the existing results, the main contributions of this paper are: 1) only two parameters are needed to be adjusted, and thus the number of the adaptation laws is smaller than the previous results and 2) the updating parameters do not depend on the number of the subsystems for MIMO systems and the tuning rules are replaced by adjusting the norms on optimal weight vectors in both action and critic networks. It is proven that the tracking errors, the adaptation laws, and the control inputs are uniformly bounded using Lyapunov analysis method. The simulation examples are employed to illustrate the effectiveness of the proposed algorithm.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2014.2360724DOI Listing

Publication Analysis

Top Keywords

adaptation laws
12
reinforcement learning
8
nonlinear discrete-time
8
mimo systems
8
optimal control
8
control signal
8
optimal weight
8
weight vectors
8
learning design-based
4
design-based adaptive
4

Similar Publications

Bioprinting, an innovative combination of biotechnology and additive manufacturing, has emerged as a transformative technology in healthcare, enabling the fabrication of functional tissues, organs, and patient-specific implants. The implementation of the aforementioned, however, introduces unique intellectual property (IP) challenges that extend beyond conventional biotechnology. The study explores three critical areas of concern: IP protection for bioprinting hardware and bioinks, ownership and ethical management of digital files derived from biological data, and the implications of commercializing bioprinted tissues and organs.

View Article and Find Full Text PDF

. Stringent regulations restricting tobacco access to those under 21 are in place, yet young people continue accessing tobacco products. This study aimed to assess the knowledge, opinions, resource utilization, and training needs of tobacco retailers in terms of preventing underage tobacco sales.

View Article and Find Full Text PDF

Baseline dependent differences in the perception of changes in visuomotor delay.

Front Hum Neurosci

January 2025

Center for Tactile Internet With Human-in-the-Loop, Technical University of Dresden, Dresden, Germany.

Introduction: The detection of, and adaptation to delayed visual movement feedback has been extensively studied. One important open question is whether the Weber-Fechner Laws hold in the domain of visuomotor delay; i.e.

View Article and Find Full Text PDF

The finite-element method (FEM) is a well-established procedure for computing approximate solutions to deterministic engineering problems described by partial differential equations. FEM produces discrete approximations of the solution with a discretisation error that can be quantified with a posteriori error estimates. The practical relevance of error estimates for biomechanics problems, especially for soft tissue where the response is governed by large strains, is rarely addressed.

View Article and Find Full Text PDF

Adaptive Fixed-time tracking control for large-scale nonlinear systems based on improved simplified optimized backstepping strategy.

ISA Trans

January 2025

College of Control Science and Engineering, Bohai University, Jinzhou 121013, Liaoning, China. Electronic address:

This paper investigates the optimal fixed-time tracking control problem for a class of nonstrict-feedback large-scale nonlinear systems with prescribed performance. In the process of optimal control design, the new critic and actor neural network updating laws are proposed by adopting the fixed-time technique and the simplified reinforcement learning algorithm, which both guarantee the simplified optimal control algorithm and accelerate the convergence rate. Furthermore, the prescribed performance method is contemplated simultaneously, which ensures tracking errors can converge within the prescribed performance bounds in fixed time.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!