2:00 PM - 2:20 PM
[2J3-GS-8b-03] Improving the Robustness to Variations of Objects and Instructions with A Neuro-Symbolic Approach for Interactive Instruction Following
Keywords:instruction following, multimodal learning, vision, language
Instruction following is a task for learning to transform natural language instructions into a sequence of actions in visual environments.
Recently, an interactive instruction following task has been proposed to encourage research in following natural language instructions that require interactions with objects.
We observe that an existing model for this task is not robust to variations of objects and instructions, which may cause a serious problem in real-world applications.
We assume that this is due to the high sensitiveness of neural feature extraction to small perturbations in vision and language.
We propose a Neuro-Symbolic approach to mitigate the lack of robustness.
Concretely, we introduce object detection and semantic parsing modules to this task and make reasoning over symbolic features feasible.
Our experiments on the ALFRED dataset show that our approach significantly improves the performance on subtasks that require object interactions.
Recently, an interactive instruction following task has been proposed to encourage research in following natural language instructions that require interactions with objects.
We observe that an existing model for this task is not robust to variations of objects and instructions, which may cause a serious problem in real-world applications.
We assume that this is due to the high sensitiveness of neural feature extraction to small perturbations in vision and language.
We propose a Neuro-Symbolic approach to mitigate the lack of robustness.
Concretely, we introduce object detection and semantic parsing modules to this task and make reasoning over symbolic features feasible.
Our experiments on the ALFRED dataset show that our approach significantly improves the performance on subtasks that require object interactions.
Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.