JSAI2023

Presentation information

General Session

General Session » GS-7 Vision, speech media processing

[1O4-GS-7] Vision, speech media processing

Tue. Jun 6, 2023 3:00 PM - 4:40 PM Room O (E1+E2)

座長:渡辺 友樹(東芝) [現地]

3:20 PM - 3:40 PM

[1O4-GS-7-02] Commutative and Nonlinear Image Editing for Deep Generative Model

〇Takehiro Aoshima1, Takashi Matsubara1 (1. Osaka University)

Keywords:semantic image editing, deep generative models

Deep generative models, such as generative adversarial networks (GANs), can generate high-quality images. However, these models often do not have an inherent way to edit generated images semantically. In order to edit generated images semantically, recent studies have proposed methods to determine linear or nonlinear semantic paths on the latent space and edit images by manipulating latent codes along these paths. However, the quality of the image editing along linear paths is inferior, and the image editing along nonlinear paths is non-commutative. In this study, we propose to discover semantic curvilinear coordinates on the latent space. We experimentally show that the quality of our method's image editing is better than comparison methods, and our method provides commutative image editing.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password