JSAI2024

Presentation information

General Session

General Session » GS-5 Language media processing

[4A1-GS-6] Language media processing:

Fri. May 31, 2024 9:00 AM - 10:40 AM Room A (Main hall)

座長:田中 駿(JX通信社)

9:20 AM - 9:40 AM

[4A1-GS-6-02] Learning Methods for LLMs on Game Data Using RLHF

〇Tomoya Murata1, Naoki Mori2, Makoto Okada2 (1. Osaka Prefecture University, 2. Osaka Metropolitan University)

Keywords:LLM, Alignment, RLHF, BERT

Recent advancements in Large Language Models (LLMs) within the artificial intelligence domain have shown exceptional performance across various natural language processing tasks. Amidst these developments, aligning the values and objectives of LLMs with human perspectives has become increasingly important. Reinforcement Learning from Human Feedback (RLHF) has gained notable interest as a method for such alignment adjustments.
This study explored a learning approach for LLMs using RLHF, employing scenarios from the romance simulation game 'Tokimeki Memorial 3' as the game scenario data. Specifically, the research involved an experiment where sentences were generated following five Japanese characters, tailored to align with the personalities of the game characters. While subjective, this evaluation demonstrated the capability of producing sentences that appropriately matched the distinct characters in the game.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password