Presentation information

General Session

General Session » [GS] J-2 Machine learning

[1Q2-J-2] Machine learning: reinforcement learning and its advances

Tue. Jun 4, 2019 1:20 PM - 3:00 PM Room Q (6F Meeting room, Bandaijima bldg.)

Chair:Koichiro Yoshino Reviewer:Kohei Miyaguchi

2:40 PM - 3:00 PM

[1Q2-J-2-05] On/off-policy Hybrid Deep Reinforcement Learning and Simulation in Control Tasks

〇Bonan Wang1, Shin Kawai1, Hajime Nobuhara1 (1. University of Tsukuba)

Keywords:Reinforcement Learning, Deep Learning, Hybrid, LSTM

Recently, deep reinforcement learning with neural network shows great performance in tasks such as game AI and robotics control tasks. However, on-policy and off-policy reinforcement learning methods proposed in related works have problems such as slow exploration speed. To solve these problems, we propose a hybrid deep reinforcement learning method which combines on-policy and off-policy reinforcement learning in this paper. The comparison experiment shows that the proposed method outperforms classic DDPG and DPPO method with an obvious advantage.