We develop a method for obtaining safe initial policies for reinforcement learning via approximate dynamic programming (ADP) techniques for uncertain systems evolving with discrete-time dynamics. We employ the kernelized Lipschitz estimation to learn multiplier matrices that are used in semidefinite programming frameworks for computing admissible initial control policies with provably high probability. Such admissible controllers enable safe initialization and constraint enforcement while providing exponential stability of the equilibrium of the closed-loop system.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TNNLS.2020.2978805 | DOI Listing |
Sensors (Basel)
December 2024
Department of Information Management, Tunghai University, Taichung 407224, Taiwan.
Today, huge amounts of time series data are sensed continuously by AIoT devices, transmitted to edge nodes, and to data centers. It costs a lot of energy to transmit these data, store them, and process them. Data compression technologies are commonly used to reduce the data size and thus save energy.
View Article and Find Full Text PDFSensors (Basel)
December 2024
Department of Neurosurgery, Xuanwu Hospital, Capital Medical University, Beijing 100032, China.
Investigating the physiological mechanisms in the motor cortex during rehabilitation exercises is crucial for assessing stroke patients' progress. This study developed a single-channel Jansen neural mass model to explore the relationship between model parameters and motor cortex mechanisms. Firstly, EEG signals were recorded from 11 healthy participants under 20%, 40%, and 60% maximum voluntary contraction, and alpha rhythm power spectral density characteristics were extracted using the Welch power spectrum method.
View Article and Find Full Text PDFSoft Matter
January 2025
Department of Macromolecular Science and Engineering, Case Western Reserve University, 10900 Euclid Ave., Cleveland, OH 44122, USA.
Data-driven techniques, such as proper orthogonal decomposition (POD) and uniform manifold approximation & projection (UMAP), are powerful methods for understanding polymer behavior in complex systems that extend beyond ideal conditions. They are based on the principle that low-dimensional behaviors are often embedded within the structure and dynamics of complex systems. Here, the internal motions of a thermoresponsive, LCST polymer are investigated for two cases: (1) the coil-to-globule transition that occurs as the system is heated above its critical temperature and (2) intramolecularly crosslinked, single chain nanoparticles (SCNPs) both above and below the critical temperature ().
View Article and Find Full Text PDFJ Phys Condens Matter
January 2025
School of Physical Sciences, Indian Association for the Cultivation of Science, 2A & 2B Raja S.C. Mullick Road, Jadavpur, Kolkata, Kolkata, West Bengal, 700032, INDIA.
Periodically driven closed quantum systems are expected to eventually heat up to infinite temperature ; reaching a steady state described by a circular orthogonal ensemble (COE). However, such finite driven systems may exhibit sufficiently long prethermal regimes; their properties in these regimes are qualitatively different from that of their corresponding infinite temperature steady states. These, often experimentally relevant, prethermal regimes host a wide range of phenomena; they may exhibit dynamical localization and freezing, host Floquet scars, display signatures of Hilbert space fragmentation, and exhibit time crystalline phases.
View Article and Find Full Text PDFAcc Chem Res
January 2025
Centre for Computational Chemistry, School of Chemistry, University of Bristol, Bristol BS8 1TS, United Kingdom.
ConspectusPhotochemical reactions have always been the source of a great deal of mystery. While classified as a type of chemical reaction, no doubts are allowed that the general tenets of ground-state chemistry do not directly apply to photochemical reactions. For a typical chemical reaction, understanding the critical points of the ground-state potential (free) energy surface and embedding them in a thermodynamics framework is often enough to infer reaction yields or characteristic time scales.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!