We develop a method for obtaining safe initial policies for reinforcement learning via approximate dynamic programming (ADP) techniques for uncertain systems evolving with discrete-time dynamics. We employ the kernelized Lipschitz estimation to learn multiplier matrices that are used in semidefinite programming frameworks for computing admissible initial control policies with provably high probability. Such admissible controllers enable safe initialization and constraint enforcement while providing exponential stability of the equilibrium of the closed-loop system.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2020.2978805DOI Listing

Publication Analysis

Top Keywords

approximate dynamic
8
dynamic programming
8
kernelized lipschitz
8
lipschitz estimation
8
safe approximate
4
programming kernelized
4
estimation develop
4
develop method
4
method obtaining
4
obtaining safe
4

Similar Publications

Today, huge amounts of time series data are sensed continuously by AIoT devices, transmitted to edge nodes, and to data centers. It costs a lot of energy to transmit these data, store them, and process them. Data compression technologies are commonly used to reduce the data size and thus save energy.

View Article and Find Full Text PDF

Investigating the physiological mechanisms in the motor cortex during rehabilitation exercises is crucial for assessing stroke patients' progress. This study developed a single-channel Jansen neural mass model to explore the relationship between model parameters and motor cortex mechanisms. Firstly, EEG signals were recorded from 11 healthy participants under 20%, 40%, and 60% maximum voluntary contraction, and alpha rhythm power spectral density characteristics were extracted using the Welch power spectrum method.

View Article and Find Full Text PDF

Analysis of the internal motions of thermoresponsive polymers and single chain nanoparticles.

Soft Matter

January 2025

Department of Macromolecular Science and Engineering, Case Western Reserve University, 10900 Euclid Ave., Cleveland, OH 44122, USA.

Data-driven techniques, such as proper orthogonal decomposition (POD) and uniform manifold approximation & projection (UMAP), are powerful methods for understanding polymer behavior in complex systems that extend beyond ideal conditions. They are based on the principle that low-dimensional behaviors are often embedded within the structure and dynamics of complex systems. Here, the internal motions of a thermoresponsive, LCST polymer are investigated for two cases: (1) the coil-to-globule transition that occurs as the system is heated above its critical temperature and (2) intramolecularly crosslinked, single chain nanoparticles (SCNPs) both above and below the critical temperature ().

View Article and Find Full Text PDF

Emergent symmetries in prethermal phases of periodically driven quantum systems.

J Phys Condens Matter

January 2025

School of Physical Sciences, Indian Association for the Cultivation of Science, 2A & 2B Raja S.C. Mullick Road, Jadavpur, Kolkata, Kolkata, West Bengal, 700032, INDIA.

Periodically driven closed quantum systems are expected to eventually heat up to infinite temperature ; reaching a steady state described by a circular orthogonal ensemble (COE). However, such finite driven systems may exhibit sufficiently long prethermal regimes; their properties in these regimes are qualitatively different from that of their corresponding infinite temperature steady states. These, often experimentally relevant, prethermal regimes host a wide range of phenomena; they may exhibit dynamical localization and freezing, host Floquet scars, display signatures of Hilbert space fragmentation, and exhibit time crystalline phases.

View Article and Find Full Text PDF

Selecting Initial Conditions for Trajectory-Based Nonadiabatic Simulations.

Acc Chem Res

January 2025

Centre for Computational Chemistry, School of Chemistry, University of Bristol, Bristol BS8 1TS, United Kingdom.

ConspectusPhotochemical reactions have always been the source of a great deal of mystery. While classified as a type of chemical reaction, no doubts are allowed that the general tenets of ground-state chemistry do not directly apply to photochemical reactions. For a typical chemical reaction, understanding the critical points of the ground-state potential (free) energy surface and embedding them in a thermodynamics framework is often enough to infer reaction yields or characteristic time scales.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!