3:20 PM - 3:40 PM
[1H4-OS-12b-01] Television Advertisement Analysis Using Attention-based Multimodal Network
Keywords:deep neural network, television advertising, video analysis
The impression/emotion (e.g. the recognition rate, favorableness) prediction of an advertisement is important.
It is related to multimodal features including frames, sounds, as well as metadata. In this paper, we propose a
system that can utilize different models to embed different features, and apply attention mechanism to efficiently
combine those features to help predict the impressions/emotions of audience after they watch an advertisement.
Our prediction can achieve the state-of-the-art performance in real-world dataset. This system can also detailed
analyze the importance of advertisement components.
It is related to multimodal features including frames, sounds, as well as metadata. In this paper, we propose a
system that can utilize different models to embed different features, and apply attention mechanism to efficiently
combine those features to help predict the impressions/emotions of audience after they watch an advertisement.
Our prediction can achieve the state-of-the-art performance in real-world dataset. This system can also detailed
analyze the importance of advertisement components.
Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.