The Power of Perturbation under Sampling in Solving Extensive-Form Games

Wataru Masaka

1:40 PM - 2:00 PM

[3J4-GS-5-01] The Power of Perturbation under Sampling in Solving Extensive-Form Games

〇Wataru Masaka¹, Mitsuki Sakamoto², Kenshi Abe², Kaito Ariu², Atsushi Iwasaki¹ (1. The University of Electro-Communications, 2. CyberAgent,Inc.)

Keywords:Imperfect-Information Extensive-Form Games, Equilibrium Computation, Follow the Regularized Leader

This paper investigates how perturbation does and does not improve the Follow-the-Regularized-Leader (FTRL) algorithm in imperfect-information extensive-form games. Perturbing the expected payoffs guarantees that the FTRL dynamics reach an approximate equilibrium, and proper adjustments of the magnitude of the perturbation lead to a Nash equilibrium (last-iterate convergence). This approach is robust even when payoffs are estimated using sampling, which is common in large games.
Building upon those insights, we first develop a general framework for perturbed FTRL algorithms under sampling. We then empirically show that in the last-iterate sense, the perturbed FTRL consistently outperforms the non-perturbed FTRL. We further identify a divergence function that reduces the variance of the estimates for perturbed payoffs, with which it significantly outperforms the prior algorithms on Leduc poker (whose structure is more asymmetric in a sense than that of the other benchmark games) and consistently performs smooth convergence behavior on all the benchmark games.

Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Presentation information

[3J4-GS-5] Agents:

[3J4-GS-5-01] The Power of Perturbation under Sampling in Solving Extensive-Form Games

Password