JSAI2021

Presentation information

General Session

General Session » GS-8 Robot and real worlds

[2J3-GS-8b] ロボットと実世界:要素技術

Wed. Jun 9, 2021 1:20 PM - 3:00 PM Room J (GS room 5)

座長:内部 英治(ATR)

2:00 PM - 2:20 PM

[2J3-GS-8b-03] Improving the Robustness to Variations of Objects and Instructions with A Neuro-Symbolic Approach for Interactive Instruction Following

〇Kazutoshi Shinoda1, Yuki Takezawa2, Masahiro Suzuki1, Yusuke Iwasawa1, Yutaka Matsuo1 (1. The University of Tokyo, 2. Kyoto University)

Keywords:instruction following, multimodal learning, vision, language

Instruction following is a task for learning to transform natural language instructions into a sequence of actions in visual environments.
Recently, an interactive instruction following task has been proposed to encourage research in following natural language instructions that require interactions with objects.
We observe that an existing model for this task is not robust to variations of objects and instructions, which may cause a serious problem in real-world applications.
We assume that this is due to the high sensitiveness of neural feature extraction to small perturbations in vision and language.
We propose a Neuro-Symbolic approach to mitigate the lack of robustness.
Concretely, we introduce object detection and semantic parsing modules to this task and make reasoning over symbolic features feasible.
Our experiments on the ALFRED dataset show that our approach significantly improves the performance on subtasks that require object interactions.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password