JSAI2024

Presentation information

Poster Session

Poster session » Poster session

[4Xin2] Poster session 2

Fri. May 31, 2024 12:00 PM - 1:40 PM Room X (Event hall 1)

[4Xin2-74] Analysis of Conversation Data Using Speech Emotion Recognition System

〇Shosei Nakamura1, Tatsuji Takahashi2, Takano Takeshi2,4, Nobuhito Manome3, shuji Shinohara2 (1.Graduate School of Tokyo Denki University, 2.Tokyo Denki University, 3.Graduate School of Engineering, The University of Tokyo, 4.The University of Texas Health Science Center)

Keywords:Speech Emotion Recognition, Speech Recognition, Emotion Recognition

The proliferation of remote meetings after the pandemic and lockdown has significantly increased the importance of voice-based communication. In situations where visual information is limited, it becomes difficult to interpret emotions of others, and voice-based recognition and analysis of emotions based on voice are extremely important for enhancing the quality of communication. In this study, we aimed to share new insights on remote communication by analyzing dialogue data using a speech emotion recognition system, which we have developed using Valence-Arousal-Dominance Model proposed by Mehrabian and Russell. We utilized the Utsunomiya University Spoken Dialogue Database for Paralinguistic Information Studies to compare human evaluations of emotions with the estimations made by the speech emotion recognition system. To deeply understand the emotional interactions between interlocutors, we conducted a cross-correlation analysis and investigated the time lags in emotions between speakers. Through this analysis, we captured the dynamics of emotions between the interlocutors and revealed that the mutual influence of emotions gradually weakens over time.

Please log in with your participant account.
» Participant Log In