JSAI2020

Presentation information

General Session

General Session » J-2 Machine learning

[2J6-GS-2] Machine learning: Advancement reinforcement learning (2)

Wed. Jun 10, 2020 5:50 PM - 7:30 PM Room J (jsai2020online-10)

座長:谷口忠大(立命館大学)

5:50 PM - 6:10 PM

[2J6-GS-2-01] Transferable Inverse Reinforcement Learning with Demonstrations in Multiple Dynamics

〇Yuki Nakaguchi1 (1. NEC)

Keywords:Inverse Reinforcement Learning, Reinforcement Learning, Maximum Entropy

Recently, reinforcement learning (RL) has been showing increasingly high performance in a variety of complex tasks of decision making and control, but RL requires quite careful engineering of reward functions to solve real tasks. Inverse reinforcement learning (IRL) is a framework to construct reward functions by learning from demonstration, but the estimated reward function cannot be transferred to other dynamics due to its dynamics-dependent indefiniteness. To obtain transferable reward functions, we propose a novel mathematical formulation for fixing the dynamics-dependent indefiniteness of reward functions by utilizing demonstrations generated in multiple dynamics. We also show that the existing discussion on the indefiniteness of reward functions can be generalized from usual RL to maximum entropy RL, which serves as the subroutine forward solver in usual IRL algorithms based on maximum entropy IRL.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password