JSAI2025

Presentation information

Poster Session

Poster session » Poster Session

[3Win5] Poster session 3

Thu. May 29, 2025 3:30 PM - 5:30 PM Room W (Event hall D-E)

[3Win5-28] Music Diffusion Model Using Discrete Diffusion Processes

〇Hiromu Fukumoto1, Toshiaki Omori1 (1.Kobe University)

Keywords:diffusion model, music generation

In this study, we propose a music diffusion model (MusicDiffusion) based on discrete diffusion process. To realize music generation with substantial temporal structure, we employ discrete latent space model and integrate the extracted latent space with diffusion modeling. By compressing music signals into compact latent representations, the proposed method reduces dimensionality while preserving essential musical characteristics.

Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password