Large Generative Model-assisted Talking-face Semantic Communication System

Jiang, Feibo; Tu, Siwei; Dong, Li; Pan, Cunhua; Wang, Jiangzhou; You, Xiaohu

Computer Science > Information Theory

arXiv:2411.03876 (cs)

[Submitted on 6 Nov 2024]

Title:Large Generative Model-assisted Talking-face Semantic Communication System

Authors:Feibo Jiang, Siwei Tu, Li Dong, Cunhua Pan, Jiangzhou Wang, Xiaohu You

View PDF HTML (experimental)

Abstract:The rapid development of generative Artificial Intelligence (AI) continually unveils the potential of Semantic Communication (SemCom). However, current talking-face SemCom systems still encounter challenges such as low bandwidth utilization, semantic ambiguity, and diminished Quality of Experience (QoE). This study introduces a Large Generative Model-assisted Talking-face Semantic Communication (LGM-TSC) System tailored for the talking-face video communication. Firstly, we introduce a Generative Semantic Extractor (GSE) at the transmitter based on the FunASR model to convert semantically sparse talking-face videos into texts with high information density. Secondly, we establish a private Knowledge Base (KB) based on the Large Language Model (LLM) for semantic disambiguation and correction, complemented by a joint knowledge base-semantic-channel coding scheme. Finally, at the receiver, we propose a Generative Semantic Reconstructor (GSR) that utilizes BERT-VITS2 and SadTalker models to transform text back into a high-QoE talking-face video matching the user's timbre. Simulation results demonstrate the feasibility and effectiveness of the proposed LGM-TSC system.

Subjects:	Information Theory (cs.IT); Machine Learning (cs.LG)
Cite as:	arXiv:2411.03876 [cs.IT]
	(or arXiv:2411.03876v1 [cs.IT] for this version)
	https://doi.org/10.48550/arXiv.2411.03876

Submission history

From: Feibo Jiang [view email]
[v1] Wed, 6 Nov 2024 12:45:46 UTC (13,491 KB)

Computer Science > Information Theory

Title:Large Generative Model-assisted Talking-face Semantic Communication System

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Theory

Title:Large Generative Model-assisted Talking-face Semantic Communication System

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators