JSAI2025

Presentation information

Organized Session

Organized Session » OS-31

[3R5-OS-31] OS-31

Thu. May 29, 2025 3:40 PM - 5:20 PM Room R (Room 805)

オーガナイザ:三宅 陽一郎(立教大学),濱田 直希(KLab)

4:00 PM - 4:20 PM

[3R5-OS-31-02] Combining IMPALA and Demonstrations to Solve Problems with Hard Exploration and Large State-Action Spaces

Application examples in Yu-Gi-Oh! MASTER DUEL

〇Sora Satake1, Soichiro Hattori1, Naoya Kihara1 (1. Konami Digital Entertainment Co., Ltd.)

Keywords:Digital Game, AI, Reinforcement Learning, Game AI

When applying deep reinforcement learning to current digital games, the difficulty of exploration and the vastness of the state-action space often become challenges. If a lot of play logs can be utilized, these difficulties can be mitigated through imitation learning. However, in cases where sufficient logs are difficult to collect, such as during the development of a game or events with different regulations, imitation learning may not be feasible. In this study, we propose a method to efficiently perform deep reinforcement learning using the IMPALA architecture by guiding exploration with small-scale demonstrations that developers can manually create. We show that it is possible to quickly learn hard exploration problems by devising the correction calculation in V-trace. Additionally, in current competitive digital games, we trained the AI using the proposed method. As a result, we trained an AI that is as strong as existing rule-based AI.

Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password