JSAI2023

Presentation information

Poster Session

General Session » Poster session

[3Xin4] Poster session 1

Thu. Jun 8, 2023 1:30 PM - 3:10 PM Room X (Exhibition hall B)

[3Xin4-28] Extended Bayesian Inference in the Multi-armed Bandit Problem

〇DAIKI MORITA1, Nobuhito Manome2, Tatsuji Takahashi1, Shuji Shinohara1,2 (1.Tokyo Denki University, 2.Graduate School of Engineering, The University of Tokyo)

Keywords:Reinforcement Learning, Bayesian Inference, Bandit problem

Bayesian inference is a statistical inference method that can probabilistically infer the process of data generation from observed data, and is one of the fundamental technologies that play an important role in machine learning models. In Bayesian inference, however, past information is evaluated in the same way as current information in the estimation process. As a consequence of this fact, in non-stationary environments where the state of the target changes during the estimation process, conventional Bayesian inference may not be so effective. For example, in the bandit problem, which is a kind of reinforcement learning problem, the Thompson sampling algorithm, a Bayesian inference-based policy, performs particularly well in steady-state environments, but performs extremely poorly in non-stationary environments. In a previous study, the extended Bayesian inference method was proposed that introduces forgetting and learning rate parameters into Bayesian inference. In this study, we use the extended Bayesian inference-based algorithm for the Bandit problem and conduct experiments in steady-state and non-stationary environments to validate the algorithm.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password