JSAI2023

Presentation information

General Session

General Session » GS-5 Language media processing

[1E4-GS-6] Language media processing

Tue. Jun 6, 2023 3:00 PM - 4:40 PM Room E (A2)

座長:長谷川 拓(NTT) [現地]

3:20 PM - 3:40 PM

[1E4-GS-6-02] Controllable Japanese Temporal Inference Dataset

〇Tomoki Sugimoto1, Yasumasa Onoe2, Hitomi Yanaka1 (1. The University of Tokyo, 2. The University of Texas at Austin)

Keywords:Natural Language Inference, Recognizing Textual Entailment, Temporal Relation Recognition, Dataset Construction

Natural Language Inference (NLI) tasks that require temporal inference remain challenging for pre-trained language models (LMs). Although various datasets have been created for this task, they primarily focus on English and do not address the need for resources in other languages. In this paper, we present a Japanese NLI benchmark for temporal inference. To begin the data annotation process, we create inference templates consisting of various inference patterns based on the formal semantics test suites. We then automatically generate diverse NLI examples by assigning nouns, verbs, and temporal expressions to the templates using the Japanese case frame dictionary. We evaluate the generalization capacities of monolingual/multilingual LMs by using controlled splits of our dataset. Our findings demonstrate that LMs struggle with handling specific linguistic phenomena such as habituality.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password