A Low-Compexity Deep Learning Framework For Acoustic Scene Classification

Pham, Lam; Tang, Hieu; Jalali, Anahid; Schindler, Alexander; King, Ross

Computer Science > Sound

arXiv:2106.06838 (cs)

[Submitted on 12 Jun 2021]

Title:A Low-Compexity Deep Learning Framework For Acoustic Scene Classification

Authors:Lam Pham, Hieu Tang, Anahid Jalali, Alexander Schindler, Ross King

View PDF

Abstract:In this paper, we presents a low-complexity deep learning frameworks for acoustic scene classification (ASC). The proposed framework can be separated into three main steps: Front-end spectrogram extraction, back-end classification, and late fusion of predicted probabilities. First, we use Mel filter, Gammatone filter and Constant Q Transfrom (CQT) to transform raw audio signal into spectrograms, where both frequency and temporal features are presented. Three spectrograms are then fed into three individual back-end convolutional neural networks (CNNs), classifying into ten urban scenes. Finally, a late fusion of three predicted probabilities obtained from three CNNs is conducted to achieve the final classification result. To reduce the complexity of our proposed CNN network, we apply two model compression techniques: model restriction and decomposed convolution. Our extensive experiments, which are conducted on DCASE 2021 (IEEE AASP Challenge on Detection and Classification of Acoustic Scenes and Events) Task 1A development dataset, achieve a low-complexity CNN based framework with 128 KB trainable parameters and the best classification accuracy of 66.7%, improving DCASE baseline by 19.0%

Subjects:	Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2106.06838 [cs.SD]
	(or arXiv:2106.06838v1 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2106.06838

Submission history

From: Lam Pham [view email]
[v1] Sat, 12 Jun 2021 19:20:39 UTC (437 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.SD

< prev | next >

new | recent | 2021-06

Change to browse by:

cs
eess
eess.AS

References & Citations

DBLP - CS Bibliography

listing | bibtex

Anahid N. Jalali
Alexander Schindler

export BibTeX citation

Computer Science > Sound

Title:A Low-Compexity Deep Learning Framework For Acoustic Scene Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:A Low-Compexity Deep Learning Framework For Acoustic Scene Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators