The Implicit Reward of the Entropy Regularizer in Signaling Games

Ryo Ueda

12:00 PM - 12:20 PM

[4H2-OS-6a-01] The Implicit Reward of the Entropy Regularizer in Signaling Games

〇Ryo Ueda¹ (1. The University of Tokyo)

Keywords:Emergent Language, Emergent Communication

This paper focuses on the auxiliary objective function, the entropy regularizer, used in signaling game optimization, and to show its implicit reward function. The signaling game is a very simple communication model used in the field of language emergence. The entropy regularizer is used to aid the agents' search when optimizing signaling games via reinforcement learning techniques. However, this auxiliary function is introduced ad hoc, and thus the reward function implicitly assumed therein is unclear. It may also hinder mathematical discussions in this research field. We clarify the implicit reward function of the entropy regularization term to make the agent's optimization target more explicit. In addition, we discuss the entropy maximizer which is a similar auxiliary objective to the entropy regularizer. We hope that our paper will trigger mathematical discussions in the field of language emergence.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Presentation information

[4H2-OS-6a] 言語とコミュニケーションの創発～記号創発システムから共創的言語進化まで～

[4H2-OS-6a-01] The Implicit Reward of the Entropy Regularizer in Signaling Games

Password

Presentation information

[4H2-OS-6a] 言語とコミュニケーションの創発 ～記号創発システムから共創的言語進化まで～

[4H2-OS-6a-01] The Implicit Reward of the Entropy Regularizer in Signaling Games

Password

[4H2-OS-6a] 言語とコミュニケーションの創発～記号創発システムから共創的言語進化まで～