Unsupervised Source Separation via Bayesian Inference in the Latent Domain

Mancusi, Michele; Postolache, Emilian; Mariani, Giorgio; Fumero, Marco; Santilli, Andrea; Cosmo, Luca; Rodolà, Emanuele

Computer Science > Machine Learning

arXiv:2110.05313 (cs)

[Submitted on 11 Oct 2021 (v1), last revised 30 Mar 2022 (this version, v4)]

Title:Unsupervised Source Separation via Bayesian Inference in the Latent Domain

Authors:Michele Mancusi, Emilian Postolache, Giorgio Mariani, Marco Fumero, Andrea Santilli, Luca Cosmo, Emanuele Rodolà

View PDF

Abstract:State of the art audio source separation models rely on supervised data-driven approaches, which can be expensive in terms of labeling resources. On the other hand, approaches for training these models without any direct supervision are typically high-demanding in terms of memory and time requirements, and remain impractical to be used at inference time. We aim to tackle these limitations by proposing a simple yet effective unsupervised separation algorithm, which operates directly on a latent representation of time-domain signals. Our algorithm relies on deep Bayesian priors in the form of pre-trained autoregressive networks to model the probability distributions of each source. We leverage the low cardinality of the discrete latent space, trained with a novel loss term imposing a precise arithmetic structure on it, to perform exact Bayesian inference without relying on an approximation strategy. We validate our approach on the Slakh dataset arXiv:1909.08494, demonstrating results in line with state of the art supervised approaches while requiring fewer resources with respect to other unsupervised methods.

Comments:	5 pages, 2 figures, submitted to Interspeech 2022
Subjects:	Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2110.05313 [cs.LG]
	(or arXiv:2110.05313v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2110.05313

Submission history

From: Emilian Postolache [view email]
[v1] Mon, 11 Oct 2021 14:32:55 UTC (883 KB)
[v2] Fri, 19 Nov 2021 14:44:06 UTC (443 KB)
[v3] Mon, 28 Mar 2022 22:19:30 UTC (9,211 KB)
[v4] Wed, 30 Mar 2022 20:03:07 UTC (9,209 KB)

Computer Science > Machine Learning

Title:Unsupervised Source Separation via Bayesian Inference in the Latent Domain

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Unsupervised Source Separation via Bayesian Inference in the Latent Domain

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators