Action-driven contrastive representation for reinforcement learning.

Minbeom Kim Kyeongha Rho Yong-Duk Kim Kyomin Jung

PLoS One

Graduate School of Artificial Intelligence, Seoul National University, Seoul, Republic of Korea.

Published: May 2022

In reinforcement learning, reward-driven feature learning directly from high-dimensional images faces two challenges: sample-efficiency for solving control tasks and generalization to unseen observations. In prior works, these issues have been addressed through learning representation from pixel inputs. However, their representation faced the limitations of being vulnerable to the high diversity inherent in environments or not taking the characteristics for solving control tasks. To attenuate these phenomena, we propose the novel contrastive representation method, Action-Driven Auxiliary Task (ADAT), which forces a representation to concentrate on essential features for deciding actions and ignore control-irrelevant details. In the augmented state-action dictionary of ADAT, the agent learns representation to maximize agreement between observations sharing the same actions. The proposed method significantly outperforms model-free and model-based algorithms in the Atari and OpenAI ProcGen, widely used benchmarks for sample-efficiency and generalization.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8932622	PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0265456	PLOS

Publication Analysis

Top Keywords

contrastive representation

reinforcement learning

solving control

control tasks

representation

action-driven contrastive

representation reinforcement

learning

learning reinforcement

learning reward-driven

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!

A PHP Error was encountered