Multimodal Semantic Prediction Utilizing Semantics and Latent Uttarance Topics based on Variational Auto Encoder

Shuhei Tateishi

3:30 PM - 3:50 PM

[3P4-GS-2-01] Multimodal Semantic Prediction Utilizing Semantics and Latent Uttarance Topics based on Variational Auto Encoder

〇Shuhei Tateishi¹, Yuka Ozeki¹, Hirofumi Yashima¹, Makoto Nakatsuji¹ (1. NTT Resonant, Inc.)

[[Online]]

Keywords:AI, Multimodal, Sentiment Analysis, Natural Language Processing

In the field of multimodal machine learning, we are faced on the problem of how to combine multiple sources of input data to produce more accurate results than simply summarize the training results for each input data, anytime. Against this issue, we have developed a new model for multimodal sentiment analysis that superior to existing models for accuracy by using the following three elements: (1) applying semantics to each word, (2) extracting relationships between modalities using attention, and (3) adding topic information based on the latent space for the entire utterance that unifies the modality information.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Presentation information

[3P4-GS-2] Machine learning: NLP

[3P4-GS-2-01] Multimodal Semantic Prediction Utilizing Semantics and Latent Uttarance Topics based on Variational Auto Encoder

Password