JSAI2021

Presentation information

General Session

General Session » GS-5 Language media processing

[3J4-GS-6c] 言語メディア処理:言語モデル

Thu. Jun 10, 2021 3:20 PM - 5:00 PM Room J (GS room 5)

座長:人見 雄太(Insight Edge)

3:40 PM - 4:00 PM

[3J4-GS-6c-02] Detection of the citation-worthiness using BERT and its error analysis

〇Kohji Dohsaka1, Hiromi Narimatsu2, Kohei Koyama3, Ryuichiro Higashinaka2, Yasuhiro Minami3, Daigo Tamori4, Hirotoshi Taira4 (1. Akita Prefectural University, 2. NTT Communication Science Laboratories, 3. The University of Electro-Communications, 4. Osaka Institute of Technology)

Keywords:Thesis writing support, Citation worthy determination, Language model

Due to the explosive increase in academic papers and the need to cite appropriate references in writing papers, research on paper writing support has been conducted. In this paper, we focus on the citation-worthiness task of detecting which sentences need a citation. First, we developed a detection model based on transfer learning of the large-scale language model BERT that uses the existing Citation Worthiness dataset, and we obtained a significant performance improvement over the conventional method using convolutional neural networks. Next, we developed a detection model for each citation function using the Citation Function dataset. The evaluation results showed that the detection performance of citation-worthiness varies by citation functions. The citation functions like ``Background,'' expressed in various expressions, tended to lower performance than those like ``Compare & Contrast,'' expressed in limited surface forms. The error analysis indicated the necessity of a detection model that allows for the citation context.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password