JSAI2023

Presentation information

Organized Session

Organized Session » OS-21

[2G4-OS-21d] 世界モデルと知能

Wed. Jun 7, 2023 1:30 PM - 3:10 PM Room G (A4)

オーガナイザ:鈴木 雅大、岩澤 有祐、河野 慎、熊谷 亘、松嶋 達也、森 友亮、松尾 豊

1:30 PM - 1:50 PM

[2G4-OS-21d-01] Switching Head-Tail Funnel UNITER: Multimodal Instruction Comprehension for Object Manipulation Tasks

〇Ryosuke Korekata1, Motonari Kambara1, Yu Yoshida1, Shintaro Ishikawa1, Yosuke Kawasaki1, Masaki Takahashi1, Komei Sugiura1 (1. Keio University)

Keywords:Multimodal Language Understanding, Vision & Language, fetch-and-carry, Object Manipulation, Domestic Service Robot

This paper describes a domestic service robot (DSR) that fetches everyday objects and carries them to specified destinations according to free-form natural language instructions. We propose Switching Head-Tail Funnel UNITER, which solves the task by predicting the target object and the destination individually using a single model. We conduct physical experiments in which a DSR delivers standardized everyday objects in a standardized domestic environment as requested by instructions with referring expressions. The experimental results show that our method outperforms the baseline method in terms of language comprehension accuracy and the object grasping and placing actions are achieved with success rates of more than 90%.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password