This article focuses on solving the secure control problem by developing a novel resilient hybrid learning scheme for discrete-time Markov jump cyber-physical systems with malicious attacks. Within the zero-sum game framework, the secure control problem is converted into solving a set of game coupled algebraic Riccati equations. However, it contains the coupling terms arising from the Markov jump parameters, which are difficult to solve. To address this issue, we propose a framework for parallel reinforcement learning. Thereafter, a model-based resilient hybrid learning scheme is first designed to obtain the optimal policies, where the system dynamics are required during the learning process. Furthermore, a novel online model-free resilient hybrid learning scheme combining the advantages of value iteration and policy iteration is proposed without using the system dynamics. Besides, the convergence of the proposed hybrid learning schemes is discussed. Eventually, the effectiveness of the designed algorithms is demonstrated with the inverted pendulum model.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TCYB.2024.3448407 | DOI Listing |
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!