Keywords:Dialogue system, Backchannel
Backchannels could allow spoken dialogue systems to make communication smoother and to elicit more conversation from users. We propose a model that uses acoustic features, linguistic features, and dialogue histories, to predict appropriate timings of backchannels. Our experimental results show that the proposed method performs better than our baseline model that uses acoustic and linguistic features only. Furthermore, we conducted a subjective experiment on predicting timings of backchannels, which results showed that the proposed method can predict the timings of the giving backchannels with a performance similar to that of a human annotator. We obtained a higher evaluation than the baseline model in our five-grade evaluation by seven human subjects, confirming the effectiveness of our proposed method.
Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.