2020年度 人工知能学会全国大会(第34回)

講演情報

国際セッション

国際セッション » E-2 Machine learning

[2K6-ES-2] Machine learning: Modeling

2020年6月10日(水) 17:50 〜 19:30 K会場 (jsai2020online-11)

座長:森純一郎(東京大学)

19:10 〜 19:30

[2K6-ES-2-05] Combining Local and Global Exploration via Intrinsic Rewards

〇Nicolas Bougie1,2, Ryutaro Ichise2,1 (1. The Graduate University for Advanced Studies, SOKENDAI, 2. National Institute of Informatics)

キーワード:Deep Reinforcement Learning, Exploration, Curiosity , Autonomous exploration

Reinforcement learning methods rely on well-designed rewards provided by the environment. However, rewards are often sparse in the real world, which entails that exploration remains one of the key challenges of reinforcement learning. While prior work on intrinsic motivation hold promise of better local exploration, discovering global exploration strategies is beyond the reach of current methods. We propose a novel end-to-end intrinsic reward formulation that introduces high-level exploration in reinforcement learning. Our technique decomposes the exploration bonus into a fast reward that deals with local exploration and a slow reward that incentivizes long-time horizon exploration. We formulate curiosity as the error in an agent’s ability to reconstruct the observations given their contexts. We further propose to balance local and high-level strategies by estimating state diversity. Experimental results show that this long-time horizon exploration bonus enables our agents to outperform prior work in most tasks, including Minigrid, and Atari games.

講演PDFパスワード認証
論文PDFの閲覧にはログインが必要です。参加登録者の方は「参加者用ログイン」画面からログインしてください。あるいは論文PDF閲覧用のパスワードを以下にご入力ください。

パスワード