JSAI2024

Presentation information

International Session

International Session » IS-2 Machine learning

[3Q5-IS-2b] Machine learning

Thu. May 30, 2024 3:30 PM - 5:10 PM Room Q (Room 402)

Chair: Rafal Rzepka (Hokkaido University)

3:50 PM - 4:10 PM

[3Q5-IS-2b-02] Evaluating Japanese Language Proficiency in Large Language Models through Definition Modeling Techniques

〇Ran Li1, Edison Marrese-Taylor1,2, Yutaka Matsuo1 (1. Graduate School of Engineering, The University of Tokyo, 2. National Institute of Advanced Industrial Science and Technology)

Keywords:LLM, Japanese, Definition Modelling

With the rapid advancement of Large Language Models(LLMs), a critical issue has been to develop methods and dataset for the evaluation of their language proficiency. Among these, the task of definition modelling has recently been proposed to assess proficiency of language models in certain domains, like finance. By asking the model to generate dictionary-like definitions of a given term under controlled conditions, definition modelling evaluates the capability of lexical understanding of a given model. So far, most of such efforts have focused on English. Japanese, with a complicated writing system and vague grammatical rules, is less explored. In this paper, we propose to use the task of definition modelling to evaluate the proficiency of LLM in the Japanese language. We collect dictionary data in Japanese and use our corpus to explore the effects of different techniques of prompting in various settings.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password