JSAI2022

Presentation information

General Session

General Session » GS-5 Language media processing

[4D3-GS-6] Language media processing: applications

Fri. Jun 17, 2022 2:00 PM - 3:40 PM Room D (Room D)

座長:伊藤 友貴(三井物産)[現地]

3:20 PM - 3:40 PM

[4D3-GS-6-05] Predicting Numerals in Text Using Nearest Neighbor Language Models

〇Taku Sakamoto1, Akiko Aizawa2,1 (1. The University of Tokyo, 2. National Institute of Informatics)

[[Online]]

Keywords:neural language model, k-nearest neighbor search, masked numeral prediction

The k-nearest neighbor language model (kNN-LM) is a language model that extends pre-trained neural language models with the idea of kNN search. In this study, we apply the kNN-LM to the masked numeral prediction task (MNP), which requires predicting masked numerals from their contexts, to improve the accuracy. In our experiments, we confirmed that the kNN-LM achieves better accuracy than the base language model in the MNP. We also compared different context spans used for calculating context representations for the kNN search to improve the search accuracy by using only the words that are closely related to the masked numeral: only the mask and its surrounding words, and only the mask and its following words. As a result, we confirmed that the kNN search using only the embeddings of mask tokens for numerals is the most effective for the MNP.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password