Quantum-Enhanced Transformers for Robust Acoustic Scene Classification in IoT Environments

Quan, Minh K.; Wijayasundara, Mayuri; Setunge, Sujeeva; Pathirana, Pubudu N.

doi:10.1109/ICNC64010.2025.10993936

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:2501.09394 (eess)

[Submitted on 16 Jan 2025]

Title:Quantum-Enhanced Transformers for Robust Acoustic Scene Classification in IoT Environments

Authors:Minh K. Quan, Mayuri Wijayasundara, Sujeeva Setunge, Pubudu N. Pathirana

View PDF HTML (experimental)

Abstract:The proliferation of Internet of Things (IoT) devices equipped with acoustic sensors necessitates robust acoustic scene classification (ASC) capabilities, even in noisy and data-limited environments. Traditional machine learning methods often struggle to generalize effectively under such conditions. To address this, we introduce Q-ASC, a novel Quantum-Inspired Acoustic Scene Classifier that leverages the power of quantum-inspired transformers. By integrating quantum concepts like superposition and entanglement, Q-ASC achieves superior feature learning and enhanced noise resilience compared to classical models. Furthermore, we introduce a Quantum Variational Autoencoder (QVAE) based data augmentation technique to mitigate the challenge of limited labeled data in IoT deployments. Extensive evaluations on the Tampere University of Technology (TUT) Acoustic Scenes 2016 benchmark dataset demonstrate that Q-ASC achieves remarkable accuracy between 68.3% and 88.5% under challenging conditions, outperforming state-of-the-art methods by over 5% in the best case. This research paves the way for deploying intelligent acoustic sensing in IoT networks, with potential applications in smart homes, industrial monitoring, and environmental surveillance, even in adverse acoustic environments.

Comments:	5 pages, 4 figures
Subjects:	Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Performance (cs.PF); Sound (cs.SD)
Report number:	979-8-3315-2096-0
Cite as:	arXiv:2501.09394 [eess.AS]
	(or arXiv:2501.09394v1 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.2501.09394
Journal reference:	2025 International Conference on Computing, Networking and Communications (ICNC)
Related DOI:	https://doi.org/10.1109/ICNC64010.2025.10993936

Submission history

From: Minh Quan [view email]
[v1] Thu, 16 Jan 2025 09:06:10 UTC (2,699 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Quantum-Enhanced Transformers for Robust Acoustic Scene Classification in IoT Environments

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Quantum-Enhanced Transformers for Robust Acoustic Scene Classification in IoT Environments

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators