15:20 〜 15:40
[1H4-OS-12b-01] Television Advertisement Analysis Using Attention-based Multimodal Network
キーワード:深層学習、広告動画、動画解析
The impression/emotion (e.g. the recognition rate, favorableness) prediction of an advertisement is important.
It is related to multimodal features including frames, sounds, as well as metadata. In this paper, we propose a
system that can utilize different models to embed different features, and apply attention mechanism to efficiently
combine those features to help predict the impressions/emotions of audience after they watch an advertisement.
Our prediction can achieve the state-of-the-art performance in real-world dataset. This system can also detailed
analyze the importance of advertisement components.
It is related to multimodal features including frames, sounds, as well as metadata. In this paper, we propose a
system that can utilize different models to embed different features, and apply attention mechanism to efficiently
combine those features to help predict the impressions/emotions of audience after they watch an advertisement.
Our prediction can achieve the state-of-the-art performance in real-world dataset. This system can also detailed
analyze the importance of advertisement components.
講演PDFパスワード認証
論文PDFの閲覧にはログインが必要です。参加登録者の方は「参加者用ログイン」画面からログインしてください。あるいは論文PDF閲覧用のパスワードを以下にご入力ください。