Prototype and evaluation of a stereophonic sound generation system from performance videos by combining object recognition and sound source separation

Ryosuke Hayama; Takao Nakaguchi; Yimeng Sun; Miki Ueno; Masaharu Imai

[3Win5-50] Prototype and evaluation of a stereophonic sound generation system from performance videos by combining object recognition and sound source separation

〇Ryosuke Hayama¹, Takao Nakaguchi¹, Yimeng Sun¹, Miki Ueno¹, Masaharu Imai¹ (1.The Kyoto College of Graduate Studies for Informatics)

Keywords:stereophonic sound, Performance Video Processing, sound source separation, object recognition

Stereophonic sound is widely used in movies, games, etc., and can provide viewers with a more realistic and immersive experience. We constructed a system to generate stereophonic sound from performance videos by combining object recognition and sound source separation.The generated stereophonic sound was evaluated qualitatively by questionnaires and quantitatively by comparing acoustic features.The results show that the system can generate stereophonic sound without the need for special equipment or expert engineers.It was also found that the system can be further improved by revising the positioning of the sound sources.

Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Presentation information

[3Win5] Poster session 3

[3Win5-50] Prototype and evaluation of a stereophonic sound generation system from performance videos by combining object recognition and sound source separation

Password