JSAI2019

Presentation information

General Session

General Session » [GS] J-9 Natural language processing, information retrieval

[3C4-J-9] Natural language processing, information retrieval: correction of documents

Thu. Jun 6, 2019 3:50 PM - 4:50 PM Room C (4F International conference hall)

Chair:Yasutomo Kimura Reviewer:Yoko Nishihara

4:30 PM - 4:50 PM

[3C4-J-9-03] Misspelling Detection by using Multiple Bidirectional LSTM Networks

〇Ryo Takahashi1, Kazuma Minoda1, Akihiro Masuda2, Nobuyuki Ishikawa1 (1. Recruit Technologies Co.,Ltd., 2. PE-BANK, Inc)

Keywords:Natural Language Processing, Deep Learning, Outlier Detection

Companies in the RECRUIT Group provide matching business between clients and customers, and create lots of manuscripts every day in order to tell the attractiveness of our clients. In this paper, we propose a method for detecting misspelling in manuscripts with machine learning. That system mainly consists of two parts. One
is the multiple Bidirectional LSTM networks to estimate the probabilities of correctness in each characters. The other is the random forests algorithm to decide what sentence is correct or not by using outputs of these networks. The efficacy of our approach is demonstrated on two datasets: artificial sentences and real manuscripts created in our services.