Audio-based Anomaly Detection in Industrial Machines Using Deep One-Class Support Vector Data Description

Kilickaya, Sertac; Ahishali, Mete; Celebioglu, Cansu; Sohrab, Fahad; Eren, Levent; Ince, Turker; Askar, Murat; Gabbouj, Moncef

doi:10.1109/CIESCompanion65073.2025.11010815

Computer Science > Sound

arXiv:2412.10792 (cs)

[Submitted on 14 Dec 2024]

Title:Audio-based Anomaly Detection in Industrial Machines Using Deep One-Class Support Vector Data Description

Authors:Sertac Kilickaya, Mete Ahishali, Cansu Celebioglu, Fahad Sohrab, Levent Eren, Turker Ince, Murat Askar, Moncef Gabbouj

View PDF HTML (experimental)

Abstract:The frequent breakdowns and malfunctions of industrial equipment have driven increasing interest in utilizing cost-effective and easy-to-deploy sensors, such as microphones, for effective condition monitoring of machinery. Microphones offer a low-cost alternative to widely used condition monitoring sensors with their high bandwidth and capability to detect subtle anomalies that other sensors might have less sensitivity. In this study, we investigate malfunctioning industrial machines to evaluate and compare anomaly detection performance across different machine types and fault conditions. Log-Mel spectrograms of machinery sound are used as input, and the performance is evaluated using the area under the curve (AUC) score for two different methods: baseline dense autoencoder (AE) and one-class deep Support Vector Data Description (deep SVDD) with different subspace dimensions. Our results over the MIMII sound dataset demonstrate that the deep SVDD method with a subspace dimension of 2 provides superior anomaly detection performance, achieving average AUC scores of 0.84, 0.80, and 0.69 for 6 dB, 0 dB, and -6 dB signal-to-noise ratios (SNRs), respectively, compared to 0.82, 0.72, and 0.64 for the baseline model. Moreover, deep SVDD requires 7.4 times fewer trainable parameters than the baseline dense AE, emphasizing its advantage in both effectiveness and computational efficiency.

Comments:	To be published in 2025 IEEE Symposium Series on Computational Intelligence
Subjects:	Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2412.10792 [cs.SD]
	(or arXiv:2412.10792v1 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2412.10792
Related DOI:	https://doi.org/10.1109/CIESCompanion65073.2025.11010815

Submission history

From: Sertac Kilickaya [view email]
[v1] Sat, 14 Dec 2024 11:05:06 UTC (488 KB)

Computer Science > Sound

Title:Audio-based Anomaly Detection in Industrial Machines Using Deep One-Class Support Vector Data Description

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:Audio-based Anomaly Detection in Industrial Machines Using Deep One-Class Support Vector Data Description

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators