JSAI2023

Presentation information

General Session

General Session » GS-2 Machine learning

[3R5-GS-2] Machine learning

Thu. Jun 8, 2023 3:30 PM - 5:10 PM Room R (602)

座長:漥澤 駿平(NEC) [オンライン]

3:30 PM - 3:50 PM

[3R5-GS-2-01] AlphaZeRS that Efficiently Target-oriented Search

〇Ryoji Sakuraoka1, Syuichi Arimura2, Yu Kono1, Tatsuji Takahasi1 (1. Tokyo Denki University, 2. Gradueate school of Tokyo Denki University)

Keywords:Reinforcement Learning, Neural Network, Deep Reinforcement Learning

Tree search is still important in the field of AI for Player versus Player, and AlphaZero combines tree search with machine learning.
On the other hand, AI is not only pursues simple performance but
also adjusts the difficulty level according to the opponent, is also considered important in servces. What is the most important a fighting style always achieves a desired win rate against the opponent, so AI is needed to achieve a natural objective win rate level
Risk-sensitive Satisficing (RS) is algorithm for target-oriented exploration.
we proposed AlphaZeRS, which changes the evaluation function of AlphaZero from PUCT to RS. RS feature quick search and discovery to the objective level, which may reduce the number of nodes. In this paper, we tested AlphaZeRS in terms of achieving the target win probability level against opponents of different strengths and saving node deployment through simulations of two-player games.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password