JSAI2025

Presentation information

General Session

General Session » GS-7 Vision, speech media processing

[4N1-GS-7] Vision, speech media processing:

Fri. May 30, 2025 9:00 AM - 10:40 AM Room N (Room 1009)

座長:早川 大智(東芝)

9:40 AM - 10:00 AM

[4N1-GS-7-03] A Study on the Generation of Pseudo-Anomalous Data for Anomaly Detection in Machine Sounds

〇Satoshi Kawamura1, Kohei Yamamoto1, Hideaki Tamai1 (1. Oki Electric Industry Co., Ltd.)

Keywords:sound, anomaly detection, pseudo anomaly, data augmentation, deep learning

The detection of anomalous sounds is crucial for the efficient operation and maintenance of factory machinery. However, obtaining real-world anomalous data is often impractical, limiting the training of anomaly detection models. To address this issue, classification-based approaches utilizing pseudo-anomalous data have garnered significant interest. Pseudo-anomalies can be generated by treating normal sounds from non-target machines as anomalies or by applying perturbations to normal sounds using neural networks. While the former approach necessitates machine-specific models, the latter may fail to adequately capture the unique characteristics of target machines. This study investigates various sound data augmentation techniques for generating pseudo-anomalies and systematically evaluates their combinations to improve anomaly detection performance. Experimental results demonstrate that certain augmentations substantially enhance detection accuracy. Furthermore, we analyze the individual impact of each method and discuss the influence of augmented sound data on anomaly detection models.

Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password