JSAI2022

Presentation information

Organized Session

Organized Session » OS-19

[2M5-OS-19c] 世界モデルと知能(3/4)

Wed. Jun 15, 2022 3:20 PM - 5:00 PM Room M (Room B-2)

オーガナイザ:鈴木 雅大(東京大学)、岩澤 有祐(東京大学)[現地]、河野 慎(東京大学)、熊谷 亘(東京大学)、森 友亮(スクウェア・エニックス)、松尾 豊(東京大学)

4:20 PM - 4:40 PM

[2M5-OS-19c-04] Effect of Temporal Discretization in Control Policy Learning Algorithms and Their Extensions to Continuous Time

〇Tatsuya Matsushima1, Jumpei Arima2, Kaito Suzuki3, Yusuke Iwasawa1, Yutaka Matsuo1 (1. The University of Tokyo, 2. Matsuo Institute, 3. Tohoku University)

Keywords:Robot Learning, Reinforcement Learning, Deep Reinforcement Learning, Temporal Discretization

There have been many attempts, mainly in deep reinforcement learning domain, to learn end-to-end control policies using high-dimensional inputs like images, and their effectiveness has been verified mainly with video games and simple robot simulators.
Many of the environments used in these studies are designed with the assumption that time is discretized and that observations and actions operate synchronously.
This is different from the nature of real robotic systems, where various sensors and actuators operate asynchronously at different frequencies and computation time needs to be taken into account, which can be the reasons why learning end-to-end policies is difficult in the real world.
After summarizing and discussing previous works on time discretization, this paper presents experiments using algorithms and robot simulation environments frequently used in deep reinforcement learning to verify and discuss the effect of time discretization on the performance of policy learning.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password