The 82nd JSAP Autumn Meeting 2021

Presentation information

Oral presentation

FS Focused Session "AI Electronics" » FS.1 Focused Session "AI Electronics"

[12p-S101-1~15] FS.1 Focused Session "AI Electronics"

Sun. Sep 12, 2021 1:00 PM - 5:30 PM S101 (Oral)

Takao Marukame(Toshiba), Megumi Akai(北大)

1:30 PM - 1:45 PM

[12p-S101-2] Analysis of time discount in photonic reinforcement learning

Honoka Shiratori1, 〇Takashi Urushibara2, Nicolas Chauvet1,2, Satoshi Sunada3, Kazutaka Kanno4, Atsushi Uchida4, Ryoichi Horisaki1,2, Makoto Naruse1,2 (1.Faculty of Eng., Univ. Tokyo,, 2.Grad. School of Info. Sci. & Technol., Univ., 3.Kanazawa Univ., 4.Saitama Univ.)

Keywords:Laser chaos, Reinforcement learning, Time discount

In the proceeding study, we proposed a method for multi-state reinforcement learning utilizing chaotic laser time series and demonstrated the fastness and smartness of our method, comparing it with a conventional method of Q-learning in the Cart-pole balancing situation. At that time, we used exponential time discount for the penalty of failure of the Cart-Pole. In this study, we explore other functional types for time discount and find that the type of function is not so important and what is more critical is the timing for the discount value to converge to 0, in terms of fastness of learning.