JSAI2022

Presentation information

General Session

General Session » GS-7 Vision, speech media processing

[1O1-GS-7] Vision, speech media processing: GAN

Tue. Jun 14, 2022 10:00 AM - 11:40 AM Room O (Room 510)

座長:岩澤 有祐(東京大学)[現地]

10:40 AM - 11:00 AM

[1O1-GS-7-03] Generation Based on Non-verbal Impression by Conditional FastGAN and Color Psychological Effects

〇Komei Hiruta1, Ryusuke Saito1, Taro Hatakeyama1, Atsushi Hashimoto1,2, Satoshi Kurihara1 (1. Keio University, 2. OMRON SINIC X Corp)

Keywords:Deep Generative Model, Conditonal GAN, Impressional Feature Space, Color Psychology, Manga

One of the requirements for creating engaging content such as comics and games is to design fascinating characters. Fascinating characters may come out by intuitive inspiration that cannot be expressed in words. On the other hand, excellent inspiration, which is the source of creative ideas, is not something that people can just come up with. Therefore, the purpose of this research is to help creators expand their imagination by controlling the generated images based on non-verbal impressions. First, we propose Conditional FastGAN, which can generate high-quality data even on small datasets such as artworks. In addition, in order to extract the "impressions" that people receive from images as features, we have developed an annotation system that utilizes "color" as a non-verbal impression medium. Our experiments, used cartoon face images extracted from Osamu Tezuka's works, the MUCT dataset consisting of face photographs in various orientations, and a color-based impression information dataset obtained by our system. The results showed the possibility to generate images corresponding to the conditions of live images, cartoons, and impressions.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password