JSAI2022

Presentation information

Organized Session

Organized Session » OS-19

[2M1-OS-19a] 世界モデルと知能(1/4)

Wed. Jun 15, 2022 9:00 AM - 10:40 AM Room M (Room B-2)

オーガナイザ:鈴木 雅大(東京大学)、岩澤 有祐(東京大学)[現地]、河野 慎(東京大学)、熊谷 亘(東京大学)、森 友亮(スクウェア・エニックス)、松尾 豊(東京大学)

9:20 AM - 9:40 AM

[2M1-OS-19a-02] A Study on Extraction of Motion Inflection Points Focusing on Objects in an Image

〇Eri Kuroda1, Ichiro Kobayashi1 (1. Ochanomizu University)

Keywords:Graph Structure, Physical Characteristics, World Model, Variational Temporal Abstraction

The use of machine learning to understand the real world has been one of the most important challenges in recent years. Variational Temporal Abstraction (VTA) is a model that extracts the latent structure of a changing environment from visual information. However, VTA extracts the structure of the changing points of image features, not the latent structure that represents the changing points based on the physical behavior of objects in the image. In this study, we improved VTA to extract the latent structure that represents the change point by expressing the physical relationship based on the behavior of the object in the image expressed as a graph structure. By doing so, we tried to realize world recognition based on the recognition of the physical behavior of objects, as humans do. We also verified the accuracy of judging collision, disappearance, stopping, etc. of objects represented as graphs using the proposed method.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password