JSAI2024

Presentation information

General Session

General Session » GS-2 Machine learning

[2B5-GS-2] Machine learning: Reinforcement learning

Wed. May 29, 2024 3:30 PM - 5:10 PM Room B (Concert hall)

座長:谷口 忠大(京都大学)

3:50 PM - 4:10 PM

[2B5-GS-2-02] Goal-specific state space reduction in incomplete information games

〇Kazuki Takahashi Takahashi1, Tomoki Fukai2, Yutaka Sakai3, Takashi Takahashi Takekawa1 (1. Kogakuin University, 2. Okinawa Institute of Science and Technology, 3. Tamagawa University)

Keywords:Reinforcement learning, Bayesian inference, State reduction

In incomplete information games, it is difficult to predict the opponent's strategy, there has been a lot of research on finding a Nash equilibrium, which is a strategy that is easy to win independent of the opponent's strategy. Poker, which has a huge observable value space of 1016, uses Deep Neural Networks (DNNs) to find Nash equilibrium strategies and has achieved performance superior to that of humans. On the other hand, it is difficult to explain the appropriateness of the selected action in terms of the complex state space. In this study, we propose a Bayesian model that reduces a huge observation space to a concise state space and evaluates its performance using the incomplete information game "Vulture Culture" as a subject. As a result, the proposed method reduces an observation space of about 104 to a near-optimal state space. It is also shown that the appropriate state space reduction facilitates the prediction of the opponent's strategy and improves the learning speed of the optimal strategy.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password