Joint Acoustic and Class Inference for Weakly Supervised Sound Event Detection

Kothinti, Sandeep; Imoto, Keisuke; Chakrabarty, Debmalya; Sell, Gregory; Watanabe, Shinji; Elhilali, Mounya

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:1811.04048 (eess)

[Submitted on 9 Nov 2018]

Title:Joint Acoustic and Class Inference for Weakly Supervised Sound Event Detection

Authors:Sandeep Kothinti, Keisuke Imoto, Debmalya Chakrabarty, Gregory Sell, Shinji Watanabe, Mounya Elhilali

View PDF

Abstract:Sound event detection is a challenging task, especially for scenes with multiple simultaneous events. While event classification methods tend to be fairly accurate, event localization presents additional challenges, especially when large amounts of labeled data are not available. Task4 of the 2018 DCASE challenge presents an event detection task that requires accuracy in both segmentation and recognition of events while providing only weakly labeled training data. Supervised methods can produce accurate event labels but are limited in event segmentation when training data lacks event timestamps. On the other hand, unsupervised methods that model the acoustic properties of the audio can produce accurate event boundaries but are not guided by the characteristics of event classes and sound categories. We present a hybrid approach that combines an acoustic-driven event boundary detection and a supervised label inference using a deep neural network. This framework leverages benefits of both unsupervised and supervised methodologies and takes advantage of large amounts of unlabeled data, making it ideal for large-scale weakly labeled event detection. Compared to a baseline system, the proposed approach delivers a 15% absolute improvement in F-score, demonstrating the benefits of the hybrid bottom-up, top-down approach.

Comments:	Submitted to ICASSP 2019
Subjects:	Audio and Speech Processing (eess.AS); Sound (cs.SD)
Cite as:	arXiv:1811.04048 [eess.AS]
	(or arXiv:1811.04048v1 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.1811.04048

Submission history

From: Sandeep Reddy Kothinti [view email]
[v1] Fri, 9 Nov 2018 18:06:21 UTC (1,228 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Joint Acoustic and Class Inference for Weakly Supervised Sound Event Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Joint Acoustic and Class Inference for Weakly Supervised Sound Event Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators