JSAI2023

Presentation information

Organized Session

Organized Session » OS-21

[2G1-OS-21c] 世界モデルと知能

Wed. Jun 7, 2023 9:00 AM - 10:40 AM Room G (A4)

オーガナイザ:鈴木 雅大、岩澤 有祐、河野 慎、熊谷 亘、松嶋 達也、森 友亮、松尾 豊

9:40 AM - 10:00 AM

[2G1-OS-21c-03] Multimodal Information Integration with Iterative Amortized Inference

〇Yuta Oshima1, Masahiro Suzuki1, Yutaka Matsuo1 (1. Graduate School of Engineering, the University of Tokyo)

Keywords:multimodality, Iterative Amortized Inference

Multimodal variational autoencoders can acquire a latent representation that integrates information from all modalities by learning an inference model. However, when we want to obtain the shared representation from an arbitrary modality, other modality inputs are missing, which prevents proper inference of the representation. In this study, we reconsider the missing modality problem as part of the amortization gap between amortization inference from any modality and multimodal ELBO, and propose a method to appropriately obtain a shared representation from a single modality input by using iterative amortized inference. However, since multimodal ELBO must be evaluated in the process of iterative amortized inference, missing modality inputs are also required. We, therefore, prepare an inference model that takes only the modality to be inferred as input, distill iterative amortized inference as the teacher and the newly prepared inference model as the student, and verify that an inference model that can acquire a shared representation from a single modality is obtained.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password