JSAI2025

Presentation information

Poster Session

Poster session » Poster Session

[3Win5] Poster session 3

Thu. May 29, 2025 3:30 PM - 5:30 PM Room W (Event hall D-E)

[3Win5-50] Prototype and evaluation of a stereophonic sound generation system from performance videos by combining object recognition and sound source separation

〇Ryosuke Hayama1, Takao Nakaguchi1, Yimeng Sun1, Miki Ueno1, Masaharu Imai1 (1.The Kyoto College of Graduate Studies for Informatics)

Keywords:stereophonic sound, Performance Video Processing, sound source separation, object recognition

Stereophonic sound is widely used in movies, games, etc., and can provide viewers with a more realistic and immersive experience. We constructed a system to generate stereophonic sound from performance videos by combining object recognition and sound source separation.The generated stereophonic sound was evaluated qualitatively by questionnaires and quantitatively by comparing acoustic features.The results show that the system can generate stereophonic sound without the need for special equipment or expert engineers.It was also found that the system can be further improved by revising the positioning of the sound sources.

Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password