JSAI2023

Presentation information

General Session

General Session » GS-2 Machine learning

[1B4-GS-2] Machine learning

Tue. Jun 6, 2023 3:00 PM - 4:40 PM Room B (Civic hall B)

座長:井田 安俊(NTT) [現地]

4:00 PM - 4:20 PM

[1B4-GS-2-04] Natural reinforcement learning based on Stochastic policy

〇Takumi Suzuki1, Shunpei Koshikawa2, Tatsuji Takahashi1, Yu Kono1 (1. Tokyo Denki University, 2. Graduate School of Tokyo Denki University)

Keywords:Reinforcement Learning, Machine Learning

In recent years, much attention has been given to deep reinforcement learning, which is one of the artificial intelligence technologies that combines reinforcement learning and deep learning. Deep reinforcement learning, for example, has already shown better performance than humans in games such as Go and Atari video games. Whereas, the progress of its application to real-world tasks beyond artificially limited environments has been slow, and this fact may mean the necessity of other approaches. We focused, in this study, on natural reinforcement learning, which sets an aspiration level and finds quality in rewards. Risk-sensitive Satisficing (RS), an algorithm for natural reinforcement learning, has already demonstrated certain target-oriented exploration and its efficiency in table-based reinforcement learning. However, the current RS employs a Deterministic policy, meaning the difficulty of its application to using probability distributions which deep reinforcement learning draws on. In this study, we extended the Deterministic policy to a Stochastic policy, and verified whether its performances are as good as those of existing table-based reinforcement learning tasks.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password