JSAI2023

Presentation information

Organized Session

Organized Session » OS-21

[2G4-OS-21d] 世界モデルと知能

Wed. Jun 7, 2023 1:30 PM - 3:10 PM Room G (A4)

オーガナイザ:鈴木 雅大、岩澤 有祐、河野 慎、熊谷 亘、松嶋 達也、森 友亮、松尾 豊

2:30 PM - 2:50 PM

[2G4-OS-21d-04] A World Model for Learning Policies Utilizing Prior Knowledge in Similar Graph Environments

〇Kazuki Kawamura1, Hayato Ikenochi3, Shunya Ishikawa2, Ayana Murakami4, Makoto Kawano1, Yutaka Matsuo1 (1. The University of Tokyo, 2. The University of Electro-Communications, 3. Ehime University, 4. Ochanomizu University)

Keywords:World Models, Deep Learning, Reinforcement Learning, Graph Neural Network, Game AI

In this paper, we introduce a reinforcement learning method based on a world model that finds the optimal policy in an environment represented by a graph. There are many environments in virtual and real worlds that are represented by graphs, such as games, transportation networks, knowledge graphs, social networks, and communication networks. Although there are several methods for finding the optimal policy in these environments, existing research has not been able to utilize prior knowledge from similar environments when learning new policies. Therefore, in this study, we propose a method for learning better policies in environments represented by graphs when knowledge of the environment is acquired. We also show that the proposed method outperforms a simple search method without prior knowledge by simulating a maze game represented by a graph.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password