JSAI2022

Presentation information

Organized Session

Organized Session » OS-19

[2M5-OS-19c] 世界モデルと知能(3/4)

Wed. Jun 15, 2022 3:20 PM - 5:00 PM Room M (Room B-2)

オーガナイザ:鈴木 雅大(東京大学)、岩澤 有祐(東京大学)[現地]、河野 慎(東京大学)、熊谷 亘(東京大学)、森 友亮(スクウェア・エニックス)、松尾 豊(東京大学)

3:40 PM - 4:00 PM

[2M5-OS-19c-02] Learning Egg Drilling Task by using Multimodal RSSM with a real robot

〇Yuki Toramatsu1, Pedro Miguel Uriguen Eljuri1, Katsuyoshi Maeyama1, Tadahiro Taniguchi1 (1. Ritsumeikan University)

Keywords:World Model, Imitation Learning, Multimodal Learning, Manipulation

In this paper, we train a Multimodal Recurrent State-Space Model (MRSSM) for an egg drilling task, analyze the composed state space and control a real robot. One of the methods of biological experiments is to create a cranial window in a rat, and there is an egg task as a mock task. It is difficult to distinguish contact and non-contact states between the egg and the drill in the Egg Task by using only image information. However, if we use the image and audio information, the state space that separates contact and non-contact states can be composed. In the experiments, we analyzed the transitions of the latent states, and the real robot was controlled by MRSSM. The results of the trained MRSSM with images and audio information show different transitions between the contact and non-contact states. In addition, we confirmed that the MRSSM could control a real robot.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password