JSAI2021

Presentation information

General Session

General Session » GS-8 Robot and real worlds

[2J3-GS-8b] ロボットと実世界:要素技術

Wed. Jun 9, 2021 1:20 PM - 3:00 PM Room J (GS room 5)

座長:内部 英治(ATR)

1:20 PM - 1:40 PM

[2J3-GS-8b-01] Unsupervised Segmentation for Video Using Convolutional VAE and Gaussian Process

〇Masatoshi Nagano1, Tomoaki Nakamura1, Takayuki Nagai2,1, Daichi Mochihashi3, Ichiro Kobayashi4, Wataru Takano2 (1. The University of Electro-Communications, 2. Osaka University, 3. Institute of Statistical Mathematics, 4. Ochanomizu University)

Keywords:Segmentation, Gaussian Process, Convolutional Variational Autoencoder

Humans recognize perceived continuous high-dimensional information by dividing it into significant segments such as words and unit motions. We believe that such unsupervised segmentation is also an important ability for robots to learn topics such as language and motions. To this end, we have been proposed the Hierarchical Dirichlet Processes-Variational Autoencoder-Gaussian Process-Hidden Semi-Markov Model (HVGH) which is composed of a deep generative model and a statistical model. HVGH can extract features from high-dimensional time-series data by VAE while simultaneously dividing it into segments by Gaussian process. In this paper, we propose a method that can segment not only high-dimensional time-series data but also videos in an unsupervised manner by improving VAE of HVGH to Convolutional VAE. In an experiment, we used a first-person view video of an agent in the maze to demonstrate that our proposed model estimates more accurate segments than the baseline method.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password