JSAI2023

Presentation information

General Session

General Session » GS-5 Language media processing

[3T1-GS-6] Language media processing

Thu. Jun 8, 2023 9:00 AM - 10:40 AM Room T (Online)

座長:梶原 智之(愛媛大学) [現地]

9:20 AM - 9:40 AM

[3T1-GS-6-02] Verification of Chain-of-Thought Prompting in Japanese

〇Kaito Horio1, Eiki Murata1, Hao Wang1, Tatuya Ide1, Daisuke Kawahara1, Takato Yamazaki2, Kenta Shinzato2, Akifumi Nakamachi2, Shengzhe Li2, Toshinori Sato2 (1. Waseda University, 2. LINE Corporation)

[[Online]]

Keywords:NLP

Foundation models can be adapted to various tasks by Few-Shot learning, which uses a small number of examples as a prompt. To improve Few-Shot learning, Chain-of-Thought (CoT) prompting has been proposed, which divides the process of thinking into steps. Although the effectiveness of CoT has been proved in English tasks requiring logical reasoning, it has not been verified in Japanese. We examine the effectiveness of CoT in Japanese using a Japanese foundation model, HyperCLOVA JP. We first construct Japanese datasets for the following three tasks: arithmetic, commonsense, and symbolic reasoning. Then, we conduct experiments using HyperCLOVA models of four different sizes. The results showed that CoT prompts were more accurate than standard prompts, and that the performance of CoT prompts was correlated with model size.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password