JSAI2024

Presentation information

General Session

General Session » GS-7 Vision, speech media processing

[2C6-GS-7] Language media processing:

Wed. May 29, 2024 5:30 PM - 7:10 PM Room C (Temporary room 1)

座長:寺下直行(株式会社日立製作所)

5:30 PM - 5:50 PM

[2C6-GS-7-01] Accident risk estimation with depth information in construction site videos

〇Ryota Goka1, Keisuke Maeda1, Ren Togo1, Takahiro Ogawa1, Miki Haseyama1 (1. Hokkaido University)

Keywords:deep learning, video processing, accident risk estimation

In the construction industry, reducing accident risk and improving safety is one of the high-priority tasks. Recently, several methods have been proposed to estimate the contact accident risk with heavy machinery on construction sites for enhancing safety. Conventional studies based on deep learning estimate the risk by using relations within the image space of detected workers and machinery captured in construction site videos. However, these approaches focus on the distance between detected objects in the image, leading to the problem that accident risk is overestimated even when there is distance between objects in the real world. In this study, to consider 3D spatial information in videos, we propose a method for estimating the accident risk with visual features regarding depth information. Experimental results show that the proposed method performs better than existing methods in estimating the contact accident risk.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password