JSAI2023

Presentation information

General Session

General Session » GS-5 Language media processing

[2E6-GS-6] Language media processing

Wed. Jun 7, 2023 5:30 PM - 7:10 PM Room E (A2)

座長:中山 英樹(東京大学) [現地]

5:30 PM - 5:50 PM

[2E6-GS-6-01] Usage classification of te-form subordinate clauses in Japanese

〇Sakiho Noguchi1, Ribeka Tanaka1, Daisuke Bekki1 (1. Ochanomizu University)

Keywords:subordinate clauses, Usage classification, Natural Language Processing

Te-form subordinate clauses are common expressions in Japanese that show multiple usages; thus, it is an important task in natural language processing to automatically determine the usage of these clauses. In this study, we designed an annotation guideline and manually classified the usage of te-form subordinate clauses. Various classifications have been proposed regarding te-form subordinate clauses. However, in attempts to create annotation guidelines, it tends to be difficult for non-linguist to make consistent judgments. Therefore, we design an annotation guideline using "linguistic tests." Linguistic tests include operations such as determining whether a target expression can be paraphrased, which we claim to reduce the variations of judgments.
Moreover, we implement and train a neural classifier based on the BERT language model using the annotated corpus, which automatically classify the usage of te-forms.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password