JSAI2020

Presentation information

General Session

General Session » J-11 Robot and real worlds

[1Q4-GS-11] Robot and real worlds: Machine learning

Tue. Jun 9, 2020 3:20 PM - 5:00 PM Room Q (jsai2020online-17)

座長:堀井隆斗(大阪大学)

3:40 PM - 4:00 PM

[1Q4-GS-11-02] Learning Alignment Tasks Based on Residual Reinforcement Learning with Multiple Expert Policies

〇Kazuki Yaginuma1, Tomoaki Nakamura1, Yusuke Kato2,3, Takayuki Nagai1,4, Jun Ozawa2,3 (1. The University of Electro-Communications, 2. Advanced Industrial Science and Technology, 3. Panasonic Corporation, 4. Osaka University)

Keywords:Residual Reinforcement Learning, Reinforcement Learning, Gaussian process-hidden semi-Markov model (GP-HSMM)

Reinforcement learning (RL) enables robots to flexibly learn skills from the interaction with the environment.

However, robots move randomly to explore valuable actions at an early stage, which can be unsafe.

Moreover, it takes a long time to learn motion from scratch.

To learn more efficiently and safely, residual reinforcement learning (RRL) has been proposed.

In RRL, skills are learned by correcting expert policy that can be obtained from an expert's demonstration.

However, conventional RRL assumes a single expert policy, whereas we consider multiple policies for more complex tasks.

In this paper, we propose RRL with multiple expert policies, where a selection of a suitable expert policy in the current state is also learned based on RL.

Experimentally, we show that the agent can learn more accurate skills in the object alignment task.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password