JSAI2022

Presentation information

General Session

General Session » GS-7 Vision, speech media processing

[1O4-GS-7] Vision, speech media processing: detection / data set creation

Tue. Jun 14, 2022 2:20 PM - 4:00 PM Room O (Room 510)

座長:石原 賢太(NEC)[遠隔]

3:00 PM - 3:20 PM

[1O4-GS-7-03] Action detection in public spaces by video analysis

〇Masahiro Okano Okano1, Riku Ogata1, Junichi Okubo1, Junichiro Fujii1, Takato Yasuno1 (1. Yachiyo Engineering Co., Ltd.)

[[Online]]

Keywords:video, action detection, deep learning

Assessing the value of public spaces is an issue in urban planning. People's staying time in the target space is used as one of the indexes to quantify the value of public space. Until now, surveys have been conducted by people visually checking videos, but labor is required and work efficiency is required. Therefore, in this study, we examined a method to automatically evaluate the staying time by utilizing RTFM (Robust Temporal Feature Magnitude learning) shown by Tian et al. RTFM is a model for detecting abnormal action of surveillance cameras, and this method was applied to detect specific human behavior in public spaces. In this paper, we conducted RTFM learning on images taken with a handy camera in a public space, and compared the accuracy when the loss function and activation function were changed.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password