JSAI2024

Presentation information

Organized Session

Organized Session » OS-16

[4O3-OS-16e] OS-16

Fri. May 31, 2024 2:00 PM - 3:20 PM Room O (Music studio hall)

オーガナイザ:鈴木 雅大(東京大学)、岩澤 有祐(東京大学)、河野 慎(東京大学)、熊谷 亘(東京大学)、松嶋 達也(東京大学)、森 友亮(株式会社スクウェア・エニックス)、松尾 豊(東京大学)

2:00 PM - 2:20 PM

[4O3-OS-16e-01] Consistency Models based Scalable Diffusion Policy

〇Sodtavilan Odonchimed1, Yuya Ikeda1, Ryosuke Takanami1, Tatsuya Matsushima1, Yuta Oshima1, Takuya Okubo1, Kai Nabeta2, Yutaka Matsuo1, Yusuke Iwasawa1 (1. The University of Tokyo, 2. University of Tsukuba)

Keywords:Diffusion Models, Imitation Learning

We introduce Consistency Policy, an imitation learning model that can operate in real-time. In imitation learning, the input is required to be multimodal and the output is multimodal in nature, making it more difficult than general supervised learning. In this case, the acquisition of action distributions using generative models, especially Diffusion Policy, has reached a high level of accuracy. However, Diffusion Policy has a trade-off between accuracy and real- time performance. In this study, we propose a new policy model for robot learning that can operate in real-time while maintaining the accuracy of Diffusion Policy: Consistency Models, which have sample speed performance that Diffusion Models cannot achieve, and Conditinoal Consistency Models, which can perform multimodal conditioning. Consistency Policy has achieved in a 10-times increase in action generation speed compared to the Diffusion Policy.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password