Learning Optimal Polices through Interactive Imitation Learning

Yuki Nakaguchi

9:20 AM - 9:40 AM

[3D1-GS-2-02] Learning Optimal Polices through Interactive Imitation Learning

〇Yuki Nakaguchi¹, Dai Kubota¹ (1. NEC)

Keywords:Reinforcement Learning, Imitation Learning, Interactive Imitation Learning

Imitation learning solves reinforcement learning problems with reference to some teacher information. While the typical method of behavioral cloning could not be applied to long-term tasks due to covariate shifts, interactive imitation learning solves this problem by obtaining online feedback from a teacher model. On the other hand, in the existing methods of interactive imitation learning, students could not learn the optimal policies when the teacher differed from the optimal for the student. In this study, we propose a novel method to solve this problem while providing an organized review of interactive imitation learning.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Presentation information

[3D1-GS-2] Machine learning

[3D1-GS-2-02] Learning Optimal Polices through Interactive Imitation Learning

Password