2:00 PM - 2:20 PM
[4O3-OS-16e-01] Consistency Models based Scalable Diffusion Policy
Keywords:Diffusion Models, Imitation Learning
We introduce Consistency Policy, an imitation learning model that can operate in real-time. In imitation learning, the input is required to be multimodal and the output is multimodal in nature, making it more difficult than general supervised learning. In this case, the acquisition of action distributions using generative models, especially Diffusion Policy, has reached a high level of accuracy. However, Diffusion Policy has a trade-off between accuracy and real- time performance. In this study, we propose a new policy model for robot learning that can operate in real-time while maintaining the accuracy of Diffusion Policy: Consistency Models, which have sample speed performance that Diffusion Models cannot achieve, and Conditinoal Consistency Models, which can perform multimodal conditioning. Consistency Policy has achieved in a 10-times increase in action generation speed compared to the Diffusion Policy.
Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.