Multiplicative Weight Update with Mutation in Two-Player Zero-Sum Extensive-Form Games

Mitsuki Sakamoto

1:50 PM - 2:10 PM

[2T4-GS-5-02] Multiplicative Weight Update with Mutation in Two-Player Zero-Sum Extensive-Form Games

〇Mitsuki Sakamoto¹, Kenshi Abe², Kaito Ariu², Atsushi Iwasaki¹ (1. The University of Electro-Communications, 2. CyberAgent, Inc.)

[[Online]]

Keywords:game theory, Extensive-form games, equilibrium, learning

In this study, we propose a multiplicative weight update algorithm that utilizes mutations in two-player zero-sum extensive-form games. These games are important models for decision-making under imperfect information. While equilibria in these games can be computed using linear programming, it becomes challenging to handle large-scale games such as poker. To address this issue, learning algorithms for finding an (approximate) equilibrium have been proposed. However, most of the existing algorithms converge to Nash equilibrium through time-averaged strategies. In normal-form games, it has been shown that introducing mutations allows for learning equilibrium strategies without taking time averages. Inspired by that, we propose the Dilated Mutant Multiplicative Weight Update with the introduction of mutations in extensive-form games. The experimental results show that the proposed method can learn equilibrium strategies without computing time averages for multiple settings.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Presentation information

[2T4-GS-5] Agents

[2T4-GS-5-02] Multiplicative Weight Update with Mutation in Two-Player Zero-Sum Extensive-Form Games

Password