JSAI2024

Presentation information

General Session

General Session » GS-9 Human interface

[4K1-GS-9] Human interface:

Fri. May 31, 2024 9:00 AM - 10:40 AM Room K (Room 44)

座長:福地 庸介(東京都立大学)

10:00 AM - 10:20 AM

[4K1-GS-9-04] Loss function design considering the frequency domain for multidimensional time series data generation based on a diffusion model

〇Yuya Okadome1,2, Yutaka Nakamura2 (1. Tokyo University of Science, 2. Riken R-IH)

Keywords:Diffusion model, Multidimensional time series data, Frequency domain, Interaction

In the generation task of multi-dimensional time series data such as human behavior, the spatial loss function like L1 loss is used for training the generative model. If the diffusion probabilistic model is applied to generate time series data, the model generates the data by iterative denoising. The meaningful slight vibration in the data is considered to be denoised. In this study, we propose the loss function which includes spatial and frequency information for training the diffusion model. In the proposed loss function, the original and generated data are projected onto the frequency domain, and the coherence between these frequencies is calculated. We apply the proposed loss function to train the diffusion model for the generation of human motion during dyadic conversation. The result suggests that Frechet Inception Distance is improved by using the frequency property.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password