JSAI2023

Presentation information

General Session

General Session » GS-5 Language media processing

[4A3-GS-6] Language media processing

Fri. Jun 9, 2023 2:00 PM - 3:40 PM Room A (Main hall)

座長:庵 愛(NTT) [現地]

2:00 PM - 2:20 PM

[4A3-GS-6-01] Analyzing Character-level representations for Multilingual DRS Semantic Parsing

〇Tomoya Kurosawa1, Hitomi Yanaka1 (1. The University of Tokyo)

Keywords:Discourse Representation Structures, Semantic Parsing, Character-level Information, Neural Models, Multilingual Tasks

Even in the era of massive language models, it has been suggested that character-level representations improve the performance of neural models. The state-of-the-art neural semantic parser for Discourse Representation Structures (DRSs) uses character-level representations, improving performance in all four languages on the Parallel Meaning Bank dataset. However, how and why character-level information improves the parser's performance remains unclear. This study provides in-depth analyses of performance changes by order of character sequences. In the experiments, we compare F1-scores by shuffling the order and randomizing character sequences. Our results indicate that the neural DRS parser is not sensitive to correct character order in English, German, and Dutch. Although we observe overall improvements by incorporating character-level tokens in German, Dutch, and Italian, we find hundreds of cases in which character-level tokens decrease performance.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password