Presentation information

General Session

General Session » GS-2 Machine learning

[1G2-GS-2a] 機械学習:強化学習

Tue. Jun 8, 2021 1:20 PM - 3:00 PM Room G (GS room 2)

座長:市川 嘉裕(奈良工業高等専門学校)

1:20 PM - 1:40 PM

[1G2-GS-2a-01] Theoretical Evaluation of Performance of Maximum Entropy Inverse Reinforcement Learning

〇Yuki Nakaguchi1 (1. NEC Data Science Research Laboratories)

Keywords:Inverse Reinforcement Learning, Reinforcement Learning, Maximum Entropy

Recently, reinforcement learning (RL) has been showing increasingly high performance in a variety of complex tasks of decision making and control, but RL requires quite careful engineering of reward functions to solve real tasks. Inverse reinforcement learning (IRL) is a framework to construct reward functions by learning from demonstration, but there is no way to guarantee the performance of the learned reward functions in maximum entropy IRL, the mainstream of IRL. Therefore it is unclear how reliable the results can be. To provide a theoretical guarantee on the performance of maximum entropy IRL, we evaluate and discuss its performance theoretically.

