JSAI2023

Presentation information

General Session

General Session » GS-7 Vision, speech media processing

[1O5-GS-7] Vision, speech media processing

Tue. Jun 6, 2023 5:00 PM - 7:00 PM Room O (E1+E2)

座長:真矢 滋(東芝) [現地]

5:20 PM - 5:40 PM

[1O5-GS-7-02] Saliency Map Estimation with ViNet for Ad Movie with active scene and satatic area

〇Kazuhiro Onishi1, Taro Watanabe1 (1. Irep Ltd., Co.)

Keywords:saliency map, VI net, Ad movie

A saliency map prediction system using ViNet is proposed to improve the accuracy reduction dependent on the speed of moving objects in saliency map prediction considering the characteristics of spatiotemporally interwoven video and still image domains. By solving the problem of partial accuracy reduction in advertising videos with intense motion, and by outputting saliency maps that are closer to the human gaze, the system improves the accuracy of video advertising production and further enhances brand lift and recognition effects. Stable output is confirmed in qualitative evaluation using test-produced video advertisements, and improved accuracy is obtained in quantitative evaluation using multiple indicators. This study will further accelerate a new production flow for advertising videos that takes into account the viewer's perspective in advance.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password