JSAI2025

Presentation information

General Session

General Session » GS-7 Vision, speech media processing

[3N4-GS-7] Vision, speech media processing:

Thu. May 29, 2025 1:40 PM - 3:20 PM Room N (Room 1009)

座長:砂川 英一(東芝)

2:00 PM - 2:20 PM

[3N4-GS-7-02] Inpainting-based Unsupervised Anomaly Detection with Diffusion Models

〇Shunsuke Sakai1, Tatsuhito Hasegawa1 (1. University of Fukui)

Keywords:Unsupervised Anomaly Detection, Diffusion Model, Masked Image Modeling, Density Estimation, Image Inpainting

Anomaly detection using conventional diffusion models takes an approach where a certain intensity of noise is added to the input image, and anomalies are removed by following the reverse diffusion process learned on normal images. However, this approach has the issue that the noise intensity significantly affects detection performance. In this study, we introduce a novel anomaly detection method using a diffusion model based on image inpainting to address this issue. In anomaly detection based on image inpainting, the masked regions are restored from complete noise, enabling stable detection performance independent of noise intensity. Furthermore, by employing an iterative mask update strategy based on reconstruction error, we improved detection performance compared to a random masking strategy. The proposed method was evaluated on MVTecAD and demonstrated superior performance compared to the baseline and existing anomaly detection methods based on image inpainting.

Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password