JSAI2022

Presentation information

General Session

General Session » GS-2 Machine learning

[3P4-GS-2] Machine learning: NLP

Thu. Jun 16, 2022 3:30 PM - 4:50 PM Room P (Online P)

座長:小林 一郎(お茶の水女子大学)[現地]

3:30 PM - 3:50 PM

[3P4-GS-2-01] Multimodal Semantic Prediction Utilizing Semantics and Latent Uttarance Topics based on Variational Auto Encoder

〇Shuhei Tateishi1, Yuka Ozeki1, Hirofumi Yashima1, Makoto Nakatsuji1 (1. NTT Resonant, Inc.)

[[Online]]

Keywords:AI, Multimodal, Sentiment Analysis, Natural Language Processing

In the field of multimodal machine learning, we are faced on the problem of how to combine multiple sources of input data to produce more accurate results than simply summarize the training results for each input data, anytime. Against this issue, we have developed a new model for multimodal sentiment analysis that superior to existing models for accuracy by using the following three elements: (1) applying semantics to each word, (2) extracting relationships between modalities using attention, and (3) adding topic information based on the latent space for the entire utterance that unifies the modality information.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password