Empirical Bayesian Independent Deeply Learned Matrix Analysis For Multichannel Audio Source Separation

Hasumi, Takuya; Nakamura, Tomohiko; Takamune, Norihiro; Saruwatari, Hiroshi; Kitamura, Daichi; Takahashi, Yu; Kondo, Kazunobu

Computer Science > Sound

arXiv:2106.03492 (cs)

[Submitted on 7 Jun 2021]

Title:Empirical Bayesian Independent Deeply Learned Matrix Analysis For Multichannel Audio Source Separation

Authors:Takuya Hasumi, Tomohiko Nakamura, Norihiro Takamune, Hiroshi Saruwatari, Daichi Kitamura, Yu Takahashi, Kazunobu Kondo

View PDF

Abstract:Independent deeply learned matrix analysis (IDLMA) is one of the state-of-the-art supervised multichannel audio source separation methods. It blindly estimates the demixing filters on the basis of source independence, using the source model estimated by the deep neural network (DNN). However, since the ratios of the source to interferer signals vary widely among time-frequency (TF) slots, it is difficult to obtain reliable estimated power spectrograms of sources at all TF slots. In this paper, we propose an IDLMA extension, empirical Bayesian IDLMA (EB-IDLMA), by introducing a prior distribution of source power spectrograms and treating the source power spectrograms as latent random variables. This treatment allows us to implicitly consider the reliability of the estimated source power spectrograms for the estimation of demixing filters through the hyperparameters of the prior distribution estimated by the DNN. Experimental evaluations show the effectiveness of EB-IDLMA and the importance of introducing the reliability of the estimated source power spectrograms.

Comments:	5 pages, 4 figures, accepted for European Signal Processing Conference 2021 (EUSIPCO 2021)
Subjects:	Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2106.03492 [cs.SD]
	(or arXiv:2106.03492v1 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2106.03492

Submission history

From: Takuya Hasumi [view email]
[v1] Mon, 7 Jun 2021 10:26:02 UTC (662 KB)

Computer Science > Sound

Title:Empirical Bayesian Independent Deeply Learned Matrix Analysis For Multichannel Audio Source Separation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:Empirical Bayesian Independent Deeply Learned Matrix Analysis For Multichannel Audio Source Separation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators