JSAI2022

Presentation information

General Session

General Session » GS-5 Language media processing

[1P4-GS-6] Language media processing: basic theory

Tue. Jun 14, 2022 2:20 PM - 4:00 PM Room P (Online P)

座長:岡嶋 穣(NEC)[遠隔]

3:20 PM - 3:40 PM

[1P4-GS-6-04] A Study of Reward Functions Suitable for Reinforcement Learning in Machine Translation

〇Yuki Nakatani1, Tomoyuki Kajiwara1, Takashi Ninomiya1 (1. Ehime University)

[[Online]]

Keywords:Machine Translation, Reinforcement Learning

In text generation tasks such as machine translation, models are generally trained using cross-entropy loss.However, mismatches between the loss function and the evaluation metric are often problematic.It is known that this problem can be addressed by direct optimization to the evaluation metric with reinforcement learning.In machine translation, previous studies have used BLEU to calculate rewards for reinforcement learning, but BLEU is not well correlated with human evaluation.In this study, we investigate the impact on machine translation quality through reinforcement learning based on evaluation metrics that are more highly correlated with human evaluation.Experimental results show that reinforcement learning based on BERT trained on the STS task can improve various evaluation metrics.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password