JSAI2023

Presentation information

Organized Session

Organized Session » OS-21

[1G5-OS-21b] 世界モデルと知能

Tue. Jun 6, 2023 5:00 PM - 6:40 PM Room G (A4)

オーガナイザ:鈴木 雅大、岩澤 有祐、河野 慎、熊谷 亘、松嶋 達也、森 友亮、松尾 豊

5:40 PM - 6:00 PM

[1G5-OS-21b-03] World models using latent diffusion model

Eitaro Yamatsuta1, 〇Fumiya Uchiyama2, Reiya Sekido3, Yuto Kawahara4, Masahiro Suzuki5, Yutaka Matsuo5 (1. Graduate School of Engineering, Osaka University, 2. The University of Tokyo, 3. Hokkaido University of Science, 4. Santa Moica College, 5. Graduate School of Engineering, the University of Tokyo)

Keywords:World models, Latent diffusion model, Rainforced Learning

World models model the external world from limited information and can be used to predict future external states and observations for learning. In spatio-temporal prediction, reinforcement learning methods using deep generative models have attracted attention. In generative models, Imagen and Stable-diffusion based on diffusion models are known for their high image generation capability. In this study, we propose a method to generate a better latent representation from the hidden states of LSTM by changing the vision part of World Models from conventional β-VAE to latent diffusion model, and compare these methods.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password