9:20 AM - 9:40 AM
[2G1-OS-21c-02] Multi-modal NewtonianVAE: High-precision reaching method for autonomous suturing
Keywords:World model, Multi-modal information
This study focuses on NewtonianVAE, a world model that can learn a proportionally controllable latent space. To achieve precise control in a physical world, it is necessary to construct a latent space of NewtonianVAE representing a physical world from multi-modal observations. However, learning from multi-modal observations using NewtonianVAE has not been studied.
To address this issue, we discuss methods for learning multi-modal observations using NewtonianVAE.
In this paper, we propose Multi-modal NewtonianVAE (MNVAE), which uses Mixture-of-Products-of-Experts (MoPoE) to integrate multi-modal observations.
MNVAE learns a latent space representing a physical environment and it has the potential for precise control in a physical world.
To address this issue, we discuss methods for learning multi-modal observations using NewtonianVAE.
In this paper, we propose Multi-modal NewtonianVAE (MNVAE), which uses Mixture-of-Products-of-Experts (MoPoE) to integrate multi-modal observations.
MNVAE learns a latent space representing a physical environment and it has the potential for precise control in a physical world.
Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.