JSAI2020

Presentation information

General Session

General Session » J-2 Machine learning

[2J5-GS-2] Machine learning: Advancement reinforcement learning (1)

Wed. Jun 10, 2020 3:50 PM - 5:30 PM Room J (jsai2020online-10)

座長:内部英治(ATR)

4:30 PM - 4:50 PM

[2J5-GS-2-03] Policy Transfer in Reinforcement Learning with Domain Adaptation using Transition Probability

〇Rei Sato1,2, Kazuto Fukuchi1,2, Jun Sakuma1,2, Youhei Akimoto1,2 (1. University of Tsukuba, 2. RIKEN Center for Advanced Intelligence Project)

Keywords:Reinforcement Learning, Transfer Learning, Domain Adaptation, Representation Learning

Reinforcement learning is drawing increasing attentions in real world applications.Since it often takes enormous cost to learn the agent in the real world environment (called target task), pre-training in a low-cost environment such as a simulator (called source task) is gathering attention.
In this paper, we focus on the situation where the source and target tasks are different only in the form of state observation.
Our proposed method trains encoders mapping state observation to latent representations, and trains a policy that receives a latent representation and output an action.We utilize the transition probability to learn latent representations robust to changes in the form of state observation.This enables transferring the policy learned in the source task to improve the performance in the target task.Experiments show that our method can achieve higher performance when the number of interactions in the target task is limited.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password