JSAI2020

Presentation information

General Session

General Session » J-10 Vision, speech

[2Q1-GS-10] Vision, speech: Fundamental theory and application

Wed. Jun 10, 2020 9:00 AM - 10:40 AM Room Q (jsai2020online-17)

座長:橋本博志(NEC)

9:40 AM - 10:00 AM

[2Q1-GS-10-03] Turn-taking timing prediction based on incremental estimation of utterance expectation for spoken dialog systems

〇Shinya Fujie1,2, Hayato Katayama2, Tetsunori Kobayashi2 (1. Chiba Institute of Technology, 2. Waseda University)

Keywords:Spoken Dialog System, Turn Taking

Turn-taking timing prediction method for spoken dialogue systems is proposed and its evaluation is reported. User's utterance to a system is divided into several speech segments by various reasons. A system with simple turn-taking strategy which takes its turn at every user's speech break may cause troubles such as interrupting user's utterance. Recent studies propose predicition methods of end of user's turn to solve this problem. However, precise timing that system should take its turn is hardly discussed. Thus, we extend our former approach for incremental prediction of end of user's turn so that it can predict precise system turn-taking timing. The idea is based on first order delay response to user's unspeech likelihood and apply thresholding to it. The proposed method is evaluated with actual human dialogue corpus.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password