JSAI2022

Presentation information

General Session

General Session » GS-5 Language media processing

[2B5-GS-6] Language media processing: leaning / inference

Wed. Jun 15, 2022 3:20 PM - 5:00 PM Room B (Room C-1)

座長:竹岡 邦紘(NEC)[現地]

4:20 PM - 4:40 PM

[2B5-GS-6-04] Transfer Learning from Other Languages to Japanese Dialogue Response Generation

〇yusaku yanase1, Itsugun Cho1, Hiroaki Saito1 (1. Keio University)

[[Online]]

Keywords:Transfer Learning, Non-task oriented dialogue response generation, ChatBot

Compared to English and Chinese, there are not many high-quality publicly available corpora for Japanese in terms of chat dialog response generation. Therefore, in order to achieve a high enough performance in chat dialogue generation with small and low-quality data, this study utilized transfer learning from Chinese and English in the Transformer-based model. Three Japanese corpora and a corpus collected from Twitter were used as the dataset to generate the input sentences. The average value of the distinct-1 as an automatic evaluation index of the generated results was 0.368 without transfer learning, and 0.412 for the transfer learning model. In terms of human evaluation, the model with transfer learning scored significantly better on all three items: sentence connection, informativeness, and humanness, compared to the model without transfer learning, for a small training data set with 9343 sentences of dialogue.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password