JSAI2023

Presentation information

International Session

International Session » IS-1 Knowledge engineering

[2U1-IS-1b] Knowledge engineering

Wed. Jun 7, 2023 9:00 AM - 10:40 AM Room U (Online)

Chair: Katsutoshi Yada (Kansai university)

9:20 AM - 9:40 AM

[2U1-IS-1b-02] Towards Commonsense Reasoning in Outdoor Visual Linguistic Navigation

〇Anirudh Reddy Kondapally1,2, Kentaro Yamada2, Hitomi Yanaka1 (1. The University of Tokyo, 2. Honda R&D Co.,Ltd., Tokyo, Japan)

[[Online, Regular]]

Keywords:Commonsense, AI reasoning, Visual Linguistic Navigation

The advent of deep learning models has made considerable strides in tasks related to navigation in the real world such as object detection and path planning. It has also led to the development of a more complicated task of visual-linguistic navigation (VLN) i.e., dialogue for navigation. Among VLN variations, outdoor scenes are significantly more difficult than indoor because of the randomness inherent in an uncontrolled environment. Outdoor VLN is also said to be closer to the reasoning required in the real world. However, the datasets available for Outdoor VLN tasks have been focused mainly on judging spatial reasoning abilities. This is not enough to create systems that work in real life as there is a need for commonsense reasoning abilities i.e. social and event-based reasoning. We create a small benchmark commonsense reasoning-based dataset and evaluate the performance of state-of-the-art VLN models on it. From our findings, we show that there is a need for commonsense reasoning-based datasets.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password