JSAI2025

Presentation information

Organized Session

Organized Session » OS-39

[2F5-OS-39b] OS-39

Wed. May 28, 2025 3:40 PM - 5:20 PM Room F (Room 1001)

オーガナイザ:上野 未貴(京都情報大学院大学),大澤 博隆(慶応義塾大学),森 友亮(東大先端研/慶應SFセンター),森 直樹(大阪公立大学)

4:20 PM - 4:40 PM

[2F5-OS-39b-03] A Picture-Book Creation System Using Generative AI

〇Tomoya Murata1, Naoki Mori1 (1. Osaka Metropolitan University)

Keywords:LLM, Stable Diffusion, Generative AI, Story Generation, Creators and AI

In recent years, advances in generative AI have enabled the streamlined creation of stories and illustrations. With the advent of large language models (LLMs), human-level natural language generation has become feasible, accelerating applications in education and entertainment, including children's content. This study proposes an automated picture-book creation system using LLMs for text generation and Stable Diffusion for illustrations. We first define a scenario structure and employ multiple LLMs to produce coherent text. In addition, LoRA-based training is applied to Stable Diffusion to ensure consistent character appearances throughout the book. Preliminary results indicate the successful generation of a four-part story alongside corresponding images.

Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password