2:20 PM - 2:40 PM
[1G2-GS-2a-04] Simulation study of Stochastic Risk-sensitive Satisificing policy which is based on non-satisfaction equilibrium
Keywords:Reinforcement learning, Machine learning, Bandit Problem, Satisficing
Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.