JSAI2020

Presentation information

General Session

General Session » J-2 Machine learning

[2I5-GS-2] Machine learning: Cognition and decision support

Wed. Jun 10, 2020 3:50 PM - 5:30 PM Room I (jsai2020online-9)

座長:欅惇志((株)デンソーアイティーラボラトリ)

4:10 PM - 4:30 PM

[2I5-GS-2-02] Function approximation of Cognitive Satisficing Value Function

〇Yuki Yoshii1, Yu Kono1, Tatsuji Takahashi1 (1. Tokyo Denki University)

Keywords:reinforcement learning, contextual bandit problem, decision making

Humans have a tendency in decision-making called satisficing: they stop exploring more when they find an option above a criterion (aspiration level). Risk-sensitive Satisficing (RS) model is a value function that enables efficient non-random exploration and realizes satisficing in reinforcement learning (Tamatsukuri & Takahashi, 2019). To apply RS to continuous state spaces, we extended RS to Linear RS (LinRS) for function approximation and test its performance in the contextual bandit problems. As a result, it was found that the algorithm had better performance in probabilistic environments than the existing algorithms. Also, it was found that the aspiration level needed to be corrected because of the approximation error.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password