JSAI2025

Presentation information

Organized Session

Organized Session » OS-45

[3R1-OS-45] OS-45

Thu. May 29, 2025 9:00 AM - 10:40 AM Room R (Room 805)

オーガナイザ:稲葉 通将(電気通信大学),東中 竜一郎(名古屋大学),徳久 良子(愛知工業大学/理化学研究所)

10:20 AM - 10:40 AM

[3R1-OS-45-05] Joint Optimization of Post-Processing for All Modules in Task-Oriented Dialogue Systems

〇Atsumoto Ohashi1, Ryuichiro Higashinaka1 (1. Nagoya University)

Keywords:Task-oriented dialogue, Language model, Reinforcement learning

Post-Processing Networks (PPNs) serve as components that modify outputs from modules in task-oriented dialogue systems, with the goal of enhancing the system's overall task completion capabilities. Traditional PPN approaches, however, have been restricted to handling only a subset of modules within a system, which has significantly constrained potential performance improvements. In this paper, we introduce a method for simultaneously optimizing the post-processing of the outputs of all modules using Universal Post-Processing Networks (UniPPNs). The UniPPN utilizes a single language model capable of processing outputs from any module as a sequence transformation task. We provide a detailed explanation of the UniPPN reinforcement learning algorithm and demonstrate, through both simulation experiments using the MultiWOZ dataset and human evaluation studies, that our approach achieves superior performance compared to conventional PPN approaches.

Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password