JSAI2018

Presentation information

Oral presentation

General Session » [General Session] 2. Machine Learning

[1Z1] [General Session] 2. Machine Learning

Tue. Jun 5, 2018 1:20 PM - 3:00 PM Room Z (3F Matsu Take)

座長:大塚 琢馬(NTT)

2:00 PM - 2:20 PM

[1Z1-03] Expert-based reward function training: the novel method to train sequence generators

〇Joji Toyama1, Yusuke Iwasawa1, Yutaka Matsuo1 (1. The University of Tokyo)

Keywords:Sequence modelling

The training methods of sequence generator with a combination of GAN and policy gradient has shown good performance. In this paper, we propose expert-based reward function training: the novel method to train sequence generator. Different from previous studies of sequence generation, expert-based reward function training does not utilize GAN's framework. Still, our model outperforms SeqGAN and a strong baseline, RankGAN.