JSAI2024

Presentation information

General Session

General Session » GS-2 Machine learning

[1B4-GS-2] Machine learning: Expression learning

Tue. May 28, 2024 3:00 PM - 4:40 PM Room B (Concert hall)

座長:大澤 正彦(日本大学)

4:00 PM - 4:20 PM

[1B4-GS-2-04] Interactive Imitation Learning Based on Teachers' Offline Data

〇Yuki Nakaguchi1 (1. NEC)

Keywords:Reinforcement Learning, Imitation Learning, Interactive Imitation Learning

Imitation learning solves reinforcement learning problems with reference to some teacher information. While the typical method of behavior cloning could not be applied to long-term tasks because covariate shifts accumulate over time, interactive imitation learning solves this problem by obtaining online feedback from a teacher model. Furthermore, even when the teacher is non-optimal, such as when the task is not exactly the same for teacher and student, if one can use the student's reward information, it is possible to learn faster than reinforcement learning and even surpass the teacher. However, interactive imitation learning requires a teacher who can respond online, which limits applicable teachers. In particular, efficient interactive imitation learning requires a teacher's value function, and applicable teachers are limited to reinforcement-learned models. In this study, we propose a method to extend efficient interactive imitation learning that requires a value function to be applied to teachers with only offline trajectory data.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password