Presentation information

Interactive Session

[3Rin4] Interactive 1

Thu. Jun 11, 2020 1:40 PM - 3:20 PM Room R01 (jsai2020online-2-33)

[3Rin4-09] Text normalization based on deep learning considering similarities in terms of strings and sounds

〇Riku Kawamura1, Tatsuya Aoki1, Hidetaka Kamigaito1, Hiroya Takamura1,2, Manabu Okumura1 (1.Tokyo Institute of Technology, 2.National Institute of Advanced Industrial Science and Technology)

Keywords:Text Normalization, Slang, Edit Distance

In this paper, we propose deep learning based models that can normalize text by considering the similarities of word strings and sounds. In the experiments, we compare the model that considers both the similarities of word strings and sounds, the model that considers only the similarity of word strings and of sounds, and the model without the similarities as a baseline model. As a result, all the proposed models achieved higher F1 score than the baseline model.

