JSAI2020

Presentation information

General Session

General Session » J-10 Vision, speech

[1H5-GS-10] Vision, speech: Recognition and detection

Tue. Jun 9, 2020 5:20 PM - 7:00 PM Room H (jsai2020online-8)

座長:岡部浩司(NEC)

6:40 PM - 7:00 PM

[1H5-GS-10-05] A Video Dataset for Action Detection and Understanding in Nursery Schools

〇Keita Iida1, Xueting Wang1, Toshihiko Yamasaki1, Satoshi Toriumi2, Mikihisa Hayashi2, Sachiko Nozawa1, Midori Takahashi1, Kengo Hiroto1, Toshihiko Endou1, Kiyomi Akita1 (1. The University of Tokyo, 2. Future Standard Co., Ltd.)

Keywords:Action detection, Actio understanding, Dataset

Action detection, which is a task whose goal is to find person (action localization) and classify what he/she is doing (action classification), is a challenging task in computer vision. Despite its difficulty, various methods for action detection from videos have been developed recently. However, there are not many datasets for detailed action understanding because of great expense of dataset construction.
We introduce a new annotated video dataset for action understanding, based on videos taken in nursery schools. In this dataset, we give detailed tags for every person in the videos, which enable advanced analysis in actions of children. We focus on interactions with their surroundings, by annotating not only their actions but also their IDs and objectives of the actions.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password