JSAI2025

Presentation information

General Session

General Session » GS-5 Agents

[3J4-GS-5] Agents:

Thu. May 29, 2025 1:40 PM - 3:20 PM Room J (Room 1005)

座長:林 宏樹(雲雀丘学園中高等学校)

1:40 PM - 2:00 PM

[3J4-GS-5-01] The Power of Perturbation under Sampling in Solving Extensive-Form Games

〇Wataru Masaka1, Mitsuki Sakamoto2, Kenshi Abe2, Kaito Ariu2, Atsushi Iwasaki1 (1. The University of Electro-Communications, 2. CyberAgent,Inc.)

Keywords:Imperfect-Information Extensive-Form Games, Equilibrium Computation, Follow the Regularized Leader

This paper investigates how perturbation does and does not improve the Follow-the-Regularized-Leader (FTRL) algorithm in imperfect-information extensive-form games. Perturbing the expected payoffs guarantees that the FTRL dynamics reach an approximate equilibrium, and proper adjustments of the magnitude of the perturbation lead to a Nash equilibrium (last-iterate convergence). This approach is robust even when payoffs are estimated using sampling, which is common in large games.
Building upon those insights, we first develop a general framework for perturbed FTRL algorithms under sampling. We then empirically show that in the last-iterate sense, the perturbed FTRL consistently outperforms the non-perturbed FTRL. We further identify a divergence function that reduces the variance of the estimates for perturbed payoffs, with which it significantly outperforms the prior algorithms on Leduc poker (whose structure is more asymmetric in a sense than that of the other benchmark games) and consistently performs smooth convergence behavior on all the benchmark games.

Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password