JSAI2024

Presentation information

Organized Session

Organized Session » OS-10

[1J4-OS-10b] OS-10

Tue. May 28, 2024 3:00 PM - 4:40 PM Room J (Room 43)

オーガナイザ:砂山 渡(滋賀県立大学)、森 辰則(横浜国立大学)、高間 康史(東京都立大学)、笹嶋 宗彦(兵庫県立大学)、西原 陽子(立命館大学)

3:40 PM - 4:00 PM

[1J4-OS-10b-03] Pseudo Training Data Generation for Automatic Aggregation of Open-Ended Questionnaire Responses by Large Language Models

〇Ryo Hasegawa1, Yuki Zenimoto1, Takehito Utsuro1, Hiromitsu Nishizaki2, Masaharu Yoshioka3, Noriko Kando4 (1. University of Tsukuba, 2. University of Yamanashi, 3. Hokkaido University, 4. National Institute of Informatics)

Keywords:large language models, open-ended questionnaire, pseudo data, automatic aggregation

Analyzing surveys utilizing open-ended responses to questionnaires is a
valuable approach to elucidating respondents' perspectives and opinions,
thereby gaining insights. However, the analysis of responses on a large
scale necessitates a considerable amount of manual labor. Thus, this
paper takes an approach of automating the analysis of open-ended
responses using large language models. We have generated several types
of pseudo data for training category classification models and evaluated
the performance of the models trained on each dataset. Through this
process, we examine the performance improvements of category
classification models using the pseudo datasets automatically generated
and annotated by large language models. Evaluation results show that,
through several operations of pseudo open-ended responses, we improved
the category classification performance against real open-ended
responses from 77% to 83%.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password