Improving Neural Machine Translation by Incorporating Hierarchical Subword Features

Makoto Morishita

9:00 AM - 10:40 AM

[4Pin1-09] Improving Neural Machine Translation by Incorporating Hierarchical Subword Features

〇Makoto Morishita¹, Jun Suzuki¹, Masaaki Nagata¹ (1. NTT Communication Science Laboratories)

Keywords:Machine Translation, Natural Language Processing, Deep Learning

This paper focuses on the subword-based neural machine translation (NMT).
We hypothesize that the appropriate subword-units for three modules in the NMT model, namely, (1) encoder embedding layer, (2) decoder embedding layer, and (3) decoder output layer, can differ each other.
We empirically investigate our assumption and find that incorporating several different subword-units for input and output embedding layers can consistently improve the BLEU score on the IWSLT 2012, 2013 and 2014 evaluation datasets for De-En, En-De, Fr-En, and En-Fr translation tasks.

Presentation information

[4Pin1] インタラクティブ(2)

[4Pin1-09] Improving Neural Machine Translation by Incorporating Hierarchical Subword Features