Publications by Boris Belousov

Publications by authors named "Boris Belousov"

Page 1 of 1

Continuous-Time Fitted Value Iteration for Robust Policies.

Michael Lutter Boris Belousov Shie Mannor Dieter Fox Animesh Garg

IEEE Trans Pattern Anal Mach Intell

May 2023

Solving the Hamilton-Jacobi-Bellman equation is important in many domains including control, robotics and economics. Especially for continuous control, solving this differential equation and its extension the Hamilton-Jacobi-Isaacs equation, is important as it yields the optimal policy that achieves the maximum reward on a give task. In the case of the Hamilton-Jacobi-Isaacs equation, which includes an adversary controlling the environment and minimizing the reward, the obtained policy is also robust to perturbations of the dynamics.

View Article and Find Full Text PDF

Entropic Regularization of Markov Decision Processes.

Boris Belousov Jan Peters

Entropy (Basel)

July 2019

An optimal feedback controller for a given Markov decision process (MDP) can in principle be synthesized by value or policy iteration. However, if the system dynamics and the reward function are unknown, a learning agent must discover an optimal controller via direct interaction with the environment. Such interactive data gathering commonly leads to divergence towards dangerous or uninformative regions of the state space unless additional regularization measures are taken.

View Article and Find Full Text PDF