JSAI2023

Presentation information

General Session

General Session » GS-5 Language media processing

[3A5-GS-6] Language media processing

Thu. Jun 8, 2023 3:30 PM - 5:10 PM Room A (Main hall)

座長:谷中 瞳(東京大学) [現地]

4:50 PM - 5:10 PM

[3A5-GS-6-05] Verification of Applicability of a Japanese Corpus containing Information on Social Situations to Machine Learning Models

〇Muxuan Liu1,2, Tatsuya Ishigaki2, Yui Uehara2, Yusuke Miyao3,2, Hiroya Takamura2, Ichiro Kobayashi1,2 (1. Ochanomizu University, 2. National Institute of Advanced Industrial Science and Technology , 3. Univ. of Tokyo)

Keywords:Social Situation, Multi-label Classification

This study proposes a straightforward way of capturing language use in social situations and showed that more accurate linguistic analysis is possible. Furthermore, using our corpus constructed based on Systemic Functional Linguistics, we achieved a highly accurate classification model based on the social situation in the text. Specifically, this study used a business email corpus to perform multi-label classification of annotation labels based on social context, created a classification model, and evaluated its performance. By measuring the accuracy of the classifier, we discussed the impact of corpus annotation labels on the performance of the model. The results of this study are expected to provide useful insights into the fields of social-situation-based linguistic analysis and natural language processing.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password