Numerical analysis of the Nash equilibria of Iterated Prisoner's Dilemma between reinforcement learning players

Takuma Torii

6:00 PM - 6:20 PM

[1P5-GS-7-03] Numerical analysis of the Nash equilibria of Iterated Prisoner's Dilemma between reinforcement learning players

〇Takuma Torii¹, Shohei Hidaka¹ (1. Japan Advanced Institute of Science and Technology)

Keywords:Reinforcement Learning, Prisoner's Dilemma, Game Theory, Mutual Cooperation

Iterated Prisoner's Dilemma (IPD) has been a standard tool for social dilemma. As the classic game-theoretic analyses of IPD have ended up mutual defection, another class of IPDs with reinforcement learners have been explored. However, the basic nature of such class of games themselves have not been well understood yet. In the present paper, we analyzed the Nash equilibria of IPD between reinforcement learners. In the standard IPD, it has been known that the only Nash equilibrium as a result of the rationale choices is the worst result for both players. However, unlike both previous lines of research, our analysis showed that in IPD with reinforcement learners the individually rational choices will correspond with the mutually beneficial result for both players. This result suggests that the social dilemma has been dissolved between this type of learning agents.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Presentation information

[1P5-GS-7] Agents: Cooperation and game theory

[1P5-GS-7-03] Numerical analysis of the Nash equilibria of Iterated Prisoner's Dilemma between reinforcement learning players

Password