Exploration in environments with continuous control and sparse rewards remains a key challenge in reinforcement learning (RL). One of the approaches to encourage more systematic and efficient exploration relies on surprise as an intrinsic reward for the agent. We introduce a new definition of surprise and its RL implementation named variational assorted surprise exploration (VASE). VASE uses a Bayesian neural network as a model of the environment dynamics and is trained using variational inference, alternately updating the accuracy of the agent's model and policy. Our experiments show that in continuous control sparse reward environments, VASE outperforms other surprise-based exploration techniques.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2021.3105140DOI Listing

Publication Analysis

Top Keywords

variational assorted
8
assorted surprise
8
surprise exploration
8
reinforcement learning
8
continuous control
8
control sparse
8
exploration
5
vase
4
vase variational
4
surprise
4

Similar Publications

Exploration in environments with continuous control and sparse rewards remains a key challenge in reinforcement learning (RL). One of the approaches to encourage more systematic and efficient exploration relies on surprise as an intrinsic reward for the agent. We introduce a new definition of surprise and its RL implementation named variational assorted surprise exploration (VASE).

View Article and Find Full Text PDF

Bacterial gliding fluid dynamics on a layer of non-Newtonian slime: Perturbation and numerical study.

J Theor Biol

May 2016

Theoretical Physics Division, PINSTECH, P.O. Nilore, Islamabad 44000, Pakistan.

Gliding bacteria are an assorted group of rod-shaped prokaryotes that adhere to and glide on certain layers of ooze slime attached to a substratum. Due to the absence of organelles of motility, such as flagella, the gliding motion is caused by the waves moving down the outer surface of these rod-shaped cells. In the present study we employ an undulating surface model to investigate the motility of bacteria on a layer of non-Newtonian slime.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!