JSAI2024

Presentation information

Poster Session

Poster session » Poster session

[4Xin2] Poster session 2

Fri. May 31, 2024 12:00 PM - 1:40 PM Room X (Event hall 1)

[4Xin2-34] Standardization for Absorbing Variations in Pause Duration Distribution in Pause Duration Estimation for Reading-Style Speech Synthesis

〇Shunji Takeshita1, Takuya Matsuzaki1 (1.Tokyo University of Science)

Keywords:Reading-Style Speech Synthesis, Pause Duration Estimation, Standardization, natural language processing

In storytelling speech, the distribution of pause durations varies due to differences in the text, the reader, and whether the text is spoken lines or not. In this study, we attempted to absorb these differences by standardizing the pause durations in the training data when learning to predict the pause position and pause duration based on the text to be read aloud. We found that standardization within each audiobook was the most effective among several standardization methods.

Please log in with your participant account.
» Participant Log In