Presentation information

International Session

International Session » [ES] E-2 Machine learning

[2A4-E-2] Machine learning: method extensions

Wed. Jun 5, 2019 3:20 PM - 5:00 PM Room A (2F Main hall A)

The room is connected with B.

3:40 PM - 4:00 PM

[2A4-E-2-02] Attention-masking extended deep Q network (AME-DQN) reinforcement learning algorithm for combinatory optimization of smart-grid energy

〇Dinesh Bahadur Malla3, Hioki Tomoyuki 2, Kei Takahashi2, Masaru Sogabe3, Katsuyoshi Sakamoto1,2, Koichi Yamaguchi1,2, Tomah Sogabe1,2,3 (1. i-PERC, The University ofElectro-Communications, 2. The University ofElectro-Communications, 3. Grid Inc.)

Keywords:Attention-masking, deep Q network, combinatory optimization

Recently deep neural network-based reinforcement learning methods, which demonstrated unprecedented success in game and robotic control, are gradually gaining attention to solve the combinatory optimization problem. However, effective operation in smart grid system has to be submitted to various constraints such as power demand-supply relation, lower and upper bound of battery electricity, market price etc. Because of these constraints, DRL algorithm is not efficient to get an optimized result. In this paper we address this issue by developing an attention-masking extended deep Q network reinforcement learning algorithm. Special focus was lied on the prediction ability of the trained AME-DQN model given various weather conditions and demand profile. These results were further compared with MILP results and finally we demonstrate that the AME-DQN are able to predict optimized actions which satisfy all the constraints while the MILP failed to meet the conditions in most of the cases.