Speech Separation Using Gain-Adapted Factorial Hidden Markov Models

Radfar, Martin H.; Dansereau, Richard M.; Wong, Willy

Computer Science > Sound

arXiv:1901.07604 (cs)

[Submitted on 22 Jan 2019]

Title:Speech Separation Using Gain-Adapted Factorial Hidden Markov Models

Authors:Martin H. Radfar, Richard M. Dansereau, Willy Wong

View PDF

Abstract:We present a new probabilistic graphical model which generalizes factorial hidden Markov models (FHMM) for the problem of single-channel speech separation (SCSS) in which we wish to separate the two speech signals $X(t)$ and $V(t)$ from a single recording of their mixture $Y(t)=X(t)+V(t)$ using the trained models of the speakers' speech signals. Current techniques assume the data used in the training and test phases of the separation model have the same loudness. In this paper, we introduce GFHMM, gain adapted FHMM, to extend SCSS to the general case in which $Y(t)=g_xX(t)+g_vV(t)$, where $g_x$ and $g_v$ are unknown gain factors. GFHMM consists of two independent-state HMMs and a hidden node which model spectral patterns and gain difference, respectively. A novel inference method is presented using the Viterbi algorithm and quadratic optimization with minimal computational overhead. Experimental results, conducted on 180 mixtures with gain differences from 0 to 15~dB, show that the proposed technique significantly outperforms FHMM and its memoryless counterpart, i.e., vector quantization (VQ)-based SCSS.

Subjects:	Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:1901.07604 [cs.SD]
	(or arXiv:1901.07604v1 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.1901.07604

Submission history

From: Martin Radfar [view email]
[v1] Tue, 22 Jan 2019 20:17:07 UTC (1,160 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.SD

< prev | next >

new | recent | 2019-01

Change to browse by:

cs
eess
eess.AS

References & Citations

DBLP - CS Bibliography

listing | bibtex

Martin H. Radfar
Richard M. Dansereau
Willy Wong

export BibTeX citation

Computer Science > Sound

Title:Speech Separation Using Gain-Adapted Factorial Hidden Markov Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:Speech Separation Using Gain-Adapted Factorial Hidden Markov Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators