Distributed Microphone Speech Enhancement based on Deep Learning

Wang, Syu-Siang; Liang, Yu-You; Hung, Jeih-weih; Tsao, Yu; Wang, Hsin-Min; Fang, Shih-Hau

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:1911.08153 (eess)

[Submitted on 19 Nov 2019 (v1), last revised 24 May 2020 (this version, v3)]

Title:Distributed Microphone Speech Enhancement based on Deep Learning

Authors:Syu-Siang Wang, Yu-You Liang, Jeih-weih Hung, Yu Tsao, Hsin-Min Wang, Shih-Hau Fang

View PDF

Abstract:Speech-related applications deliver inferior performance in complex noise environments. Therefore, this study primarily addresses this problem by introducing speech-enhancement (SE) systems based on deep neural networks (DNNs) applied to a distributed microphone architecture, and then investigates the effectiveness of three different DNN-model structures. The first system constructs a DNN model for each microphone to enhance the recorded noisy speech signal, and the second system combines all the noisy recordings into a large feature structure that is then enhanced through a DNN model. As for the third system, a channel-dependent DNN is first used to enhance the corresponding noisy input, and all the channel-wise enhanced outputs are fed into a DNN fusion model to construct a nearly clean signal. All the three DNN SE systems are operated in the acoustic frequency domain of speech signals in a diffuse-noise field environment. Evaluation experiments were conducted on the Taiwan Mandarin Hearing in Noise Test (TMHINT) database, and the results indicate that all the three DNN-based SE systems provide the original noise-corrupted signals with improved speech quality and intelligibility, whereas the third system delivers the highest signal-to-noise ratio (SNR) improvement and optimal speech intelligibility.

Comments:	deep neural network, multi-channel speech enhancement, distributed microphone architecture, diffuse noise environment
Subjects:	Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
Cite as:	arXiv:1911.08153 [eess.AS]
	(or arXiv:1911.08153v3 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.1911.08153

Submission history

From: SyuSiang Wang [view email]
[v1] Tue, 19 Nov 2019 08:23:17 UTC (444 KB)
[v2] Fri, 22 Nov 2019 04:22:48 UTC (444 KB)
[v3] Sun, 24 May 2020 10:07:23 UTC (613 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Distributed Microphone Speech Enhancement based on Deep Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Distributed Microphone Speech Enhancement based on Deep Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators