JSAI2024

Presentation information

Organized Session

Organized Session » OS-16

[3O1-OS-16b] OS-16

Thu. May 30, 2024 9:00 AM - 10:40 AM Room O (Music studio hall)

オーガナイザ:鈴木 雅大(東京大学)、岩澤 有祐(東京大学)、河野 慎(東京大学)、熊谷 亘(東京大学)、松嶋 達也(東京大学)、森 友亮(株式会社スクウェア・エニックス)、松尾 豊(東京大学)

10:20 AM - 10:40 AM

[3O1-OS-16b-05] Improving Learning Efficiency in Compositional Robot Tasks Using Prior Knowledge of Large Language Model

〇Shota Takashiro1, Tatsuya Matsushima1, Yusuke Iwasawa1, Yutaka Matsuo1 (1. University of Tokyo)

5月31日(金)09:20~09:40:[4O1-OS-16d-02] の時間で発表

Keywords:RL, LLM, IL

Large language model have shown high general performance in various tasks, and their applications are expanding not only in natural language processing but also in various other fields. Although there are many existing studies that utilize large language model in robot control, most of them are used for action planning in compositional tasks, and fail if an action is selected that is not prepared in advance by the robot. In other words, the a priori knowledge in large-scale language models can be used for policy selection during inference, but it cannot be used during actual policy learning. In this paper, we aim to decompose a task using prior knowledge from a large language model and intensively reinforce learning only the failed steps, so that the robot can acquire a new strategy with minimal interaction with the environment.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password