9:00 AM - 10:40 AM
[4Pin1-25] Neural Machine Translation with Kanji Decomposition
Keywords:Neural Machine Translation, Kanji Decomposition
This paper proposes a method for neural machine translation (NMT) with kanji decomposition of Japanese text. NMT models have restrictions of the vocabulary size, which can be solved by applying subword, character-level, or byte-based models. In Japanese text, the vocabulary size would not be minimized even in a character level because of kanji varieties. We report an experimental result of NMT model using Japanese text with kanji decomposition that is expected to satisfy both of decreasing vocabulary size and keeping kanji information.