JSAI2024

Presentation information

General Session

General Session » GS-5 Language media processing

[2G5-GS-6] Language media processing:

Wed. May 29, 2024 3:30 PM - 5:10 PM Room G (Room 22+23)

座長:牧田光晴(LINEヤフー株式会社/SB Intuitions株式会社)

4:30 PM - 4:50 PM

[2G5-GS-6-04] A Better LLM Evaluator for Text Generation: The Impact of Prompt Output Sequencing and Optimization

〇KuanChao Chu1, Yi-Pei Chen1, Hideki Nakayama1 (1. The University of Tokyo)

Keywords:Large Language Models, Prompt, Conversation Evaluation

This research investigates prompt designs of evaluating generated texts using large language models (LLMs). While LLMs are increasingly used for scoring various inputs, creating effective prompts for open-ended text evaluation remains challenging due to model sensitivity and subjectivity in evaluation of text generation. Our study experimented with different prompt structures, altering the sequence of output instructions and including explanatory reasons. We found that the order of presenting reasons and scores significantly influences LLMs' scoring, with a "reason-first" approach yielding more comprehensive evaluations. This insight is crucial for enhancing the accuracy and consistency of LLM-based evaluations.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password