JSAI2025

Presentation information

Organized Session

Organized Session » OS-22

[3O5-OS-22a] OS-22

Thu. May 29, 2025 3:40 PM - 5:20 PM Room O (Room 1010)

オーガナイザ:北原 鉄朗(日本大学文),中村 栄太(九州大学),浜中 雅俊(理化学研究所)

4:20 PM - 4:40 PM

[3O5-OS-22a-03] Co-generating songs through a naming game using multiple latent diffusion models

Koki Sakurai1, 〇Haruto Uenoyama2, Tadahiro Taniguchi1,2, Akira Taniguchi1 (1. Ritsumeikan University, 2. Kyoto University)

Keywords:Music composition, Latent diffusion model, Multi agent

In this study, we aim to generate music with different musical characteristics by jointly generating music by multiple AI agents. Specifically, we proposed the Metropolis-Hastings Music generation Game (MHMG), which integrates a latent diffusion model with the Metropolis-Hastings naming game, a framework that allows knowledge sharing among agents. In the experiment, two latent diffusion models trained on different genres of music (classical and jazz) were used as agents, and it was verified whether a music piece including the features of each genre could be generated. The experimental results showed that MHMG without fine-tuning retained the characteristics of each genre the best and produced high-quality music.

Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password