Semantic Communications for Speech Recognition

Weng, Zhenzi; Qin, Zhijin; Li, Geoffrey Ye

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:2107.11190 (eess)

[Submitted on 22 Jul 2021 (v1), last revised 27 Apr 2024 (this version, v3)]

Title:Semantic Communications for Speech Recognition

Authors:Zhenzi Weng, Zhijin Qin, Geoffrey Ye Li

View PDF HTML (experimental)

Abstract:The traditional communications transmit all the source data represented by bits, regardless of the content of source and the semantic information required by the receiver. However, in some applications, the receiver only needs part of the source data that represents critical semantic information, which prompts to transmit the application-related information, especially when bandwidth resources are limited. In this paper, we consider a semantic communication system for speech recognition by designing the transceiver as an end-to-end (E2E) system. Particularly, a deep learning (DL)-enabled semantic communication system, named DeepSC-SR, is developed to learn and extract text-related semantic features at the transmitter, which motivates the system to transmit much less than the source speech data without performance degradation. Moreover, in order to facilitate the proposed DeepSC-SR for dynamic channel environments, we investigate a robust model to cope with various channel environments without requiring retraining. The simulation results demonstrate that our proposed DeepSC-SR outperforms the traditional communication systems in terms of the speech recognition metrics, such as character-error-rate and word-error-rate, and is more robust to channel variations, especially in the low signal-to-noise (SNR) regime.

Subjects:	Audio and Speech Processing (eess.AS); Sound (cs.SD); Signal Processing (eess.SP)
Cite as:	arXiv:2107.11190 [eess.AS]
	(or arXiv:2107.11190v3 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.2107.11190

Submission history

From: Zhenzi Weng [view email]
[v1] Thu, 22 Jul 2021 11:08:08 UTC (164 KB)
[v2] Mon, 6 Sep 2021 15:54:31 UTC (165 KB)
[v3] Sat, 27 Apr 2024 18:04:57 UTC (165 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Semantic Communications for Speech Recognition

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Semantic Communications for Speech Recognition

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators