JSAI2024

Presentation information

Organized Session

Organized Session » OS-29

[1O4-OS-29a] OS-29

Tue. May 28, 2024 3:00 PM - 4:40 PM Room O (Music studio hall)

オーガナイザ:北原 鉄朗(日本大学)、中村 栄太(京都大学)、浜中 雅俊(理化学研究所)

3:20 PM - 3:40 PM

[1O4-OS-29a-02] Exploring the Potential and Challenges of Interactive Music Generation with GPT-4

〇Ryosei Kawaguchi1, Haruhiro Katayose1 (1. Kwansei Gakuin University)

Keywords:Music Generation, LLM, Directability

In recent years, media generated by generative AI has received a lot of attention. Recently, Diffusion-based text to music automatic composition systems have been attracting attention. This research focuses on the commonality that both music and language can be expressed symbolically, and explores the ability to compose music on GPT-4 by treating music as a plain text expression using ABC notation. As a result of the experiment, we confirmed that the text has a certain compositional ability by matching the latent space architecture and musical knowledge. Based on this, we developed the "Grazie Piano Tuner", a composition support system, which has the ability to change the melody by controlling the emotional parameters. Currently, we are working on implementing a means to control emotional parameters as time-series information. In the presentation, we will discuss the possibilities and challenges of a composition support system using LLM while introducing actual examples using this system.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password