2023年度 人工知能学会全国大会(第37回)

講演情報

国際セッション

国際セッション » IS-4 Robots and real worlds

[3U5-IS-4] Robots and real worlds

2023年6月8日(木) 15:30 〜 17:10 U会場 (遠隔)

Chair: Nihan Karatas (Nagoya University)

15:30 〜 15:50

[3U5-IS-4-01] A neuro-symbolic approach for multimodal reference expression comprehension

〇Aman Jain1,2, Anirudh Reddy Kondapally1,2, Kentaro Yamada2, Hitomi Yanaka1 (1. The University of Tokyo, 2. Honda R&D Co.,Ltd., Tokyo, Japan)

[[Online, Regular]]

キーワード:Human-Machine Interaction, Reference Expression Comprehension, Neuro-symbolic Models

Human-Machine Interaction (HMI) systems have gained huge interest in recent years, with reference expression comprehension being one of the main challenges. Traditionally human-machine interaction has been mostly limited to speech and visual modalities. However, to allow for more freedom in interaction, recent works have proposed the integration of additional modalities, such as gestures in HMI systems. We consider such an HMI system with pointing gestures and construct a table-top object picking scenario inside a simulated virtual reality (VR) environment to collect data. Previous works for such a task have used deep neural networks to classify the referred object, which lacks transparency. In this work, we propose an interpretable and compositional model, crucial to building robust HMI systems for real-world application, based on a neuro-symbolic approach to tackle this task. Finally we also show the generalizability of our model on unseen environments and report the results.

講演PDFパスワード認証
論文PDFの閲覧にはログインが必要です。参加登録者の方は「参加者用ログイン」画面からログインしてください。あるいは論文PDF閲覧用のパスワードを以下にご入力ください。

パスワード