9:40 AM - 10:00 AM
[2N1-03] Incremental estimation of user's turn-taking state
Keywords:Spoken Dialog System, Turn Taking, Speech Recognition
Turn-taking state estimation to determine utterance timing of a spoken dialog system is discussed. We propose the recurrent neural network based method to estimate user's turn-taking state incrementally. The proposed method utilizes acoustic feature extracted using a spectrogram autoencoder as well as linguistic feature extracted from a partial speech recognition result using a neural network based language model. The article shows an example of estimation result and discuss the performance of the proposed method.