JSAI2023

Presentation information

Organized Session

Organized Session » OS-21

[2G1-OS-21c] 世界モデルと知能

Wed. Jun 7, 2023 9:00 AM - 10:40 AM Room G (A4)

オーガナイザ:鈴木 雅大、岩澤 有祐、河野 慎、熊谷 亘、松嶋 達也、森 友亮、松尾 豊

9:20 AM - 9:40 AM

[2G1-OS-21c-02] Multi-modal NewtonianVAE: High-precision reaching method for autonomous suturing

〇Mai Terashima1, Pedro Miguel Uriguen Eljuri1, Yuanyuan Jia1, Hironobu Shibata1, Masaki Ito1, Tadahiro Taniguchi1 (1. Ritsumeikan University)

Keywords:World model, Multi-modal information

This study focuses on NewtonianVAE, a world model that can learn a proportionally controllable latent space. To achieve precise control in a physical world, it is necessary to construct a latent space of NewtonianVAE representing a physical world from multi-modal observations. However, learning from multi-modal observations using NewtonianVAE has not been studied.
To address this issue, we discuss methods for learning multi-modal observations using NewtonianVAE.
In this paper, we propose Multi-modal NewtonianVAE (MNVAE), which uses Mixture-of-Products-of-Experts (MoPoE) to integrate multi-modal observations.
MNVAE learns a latent space representing a physical environment and it has the potential for precise control in a physical world.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password