Improving the Adversarial Robustness for Speaker Verification by Self-Supervised Learning

Wu, Haibin; Li, Xu; Liu, Andy T.; Wu, Zhiyong; Meng, Helen; Lee, Hung-yi

Computer Science > Sound

arXiv:2106.00273 (cs)

[Submitted on 1 Jun 2021 (v1), last revised 5 Jun 2024 (this version, v4)]

Title:Improving the Adversarial Robustness for Speaker Verification by Self-Supervised Learning

Authors:Haibin Wu, Xu Li, Andy T. Liu, Zhiyong Wu, Helen Meng, Hung-yi Lee

View PDF HTML (experimental)

Abstract:Previous works have shown that automatic speaker verification (ASV) is seriously vulnerable to malicious spoofing attacks, such as replay, synthetic speech, and recently emerged adversarial attacks. Great efforts have been dedicated to defending ASV against replay and synthetic speech; however, only a few approaches have been explored to deal with adversarial attacks. All the existing approaches to tackle adversarial attacks for ASV require the knowledge for adversarial samples generation, but it is impractical for defenders to know the exact attack algorithms that are applied by the in-the-wild attackers. This work is among the first to perform adversarial defense for ASV without knowing the specific attack algorithms. Inspired by self-supervised learning models (SSLMs) that possess the merits of alleviating the superficial noise in the inputs and reconstructing clean samples from the interrupted ones, this work regards adversarial perturbations as one kind of noise and conducts adversarial defense for ASV by SSLMs. Specifically, we propose to perform adversarial defense from two perspectives: 1) adversarial perturbation purification and 2) adversarial perturbation detection. Experimental results show that our detection module effectively shields the ASV by detecting adversarial samples with an accuracy of around 80%. Moreover, since there is no common metric for evaluating the adversarial defense performance for ASV, this work also formalizes evaluation metrics for adversarial defense considering both purification and detection based approaches into account. We sincerely encourage future works to benchmark their approaches based on the proposed evaluation framework.

Comments:	Accepted by TASLP
Subjects:	Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2106.00273 [cs.SD]
	(or arXiv:2106.00273v4 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2106.00273

Submission history

From: Haibin Wu [view email]
[v1] Tue, 1 Jun 2021 07:10:54 UTC (4,980 KB)
[v2] Mon, 14 Jun 2021 07:26:40 UTC (4,980 KB)
[v3] Mon, 6 Dec 2021 10:32:44 UTC (6,177 KB)
[v4] Wed, 5 Jun 2024 02:36:34 UTC (6,177 KB)

Computer Science > Sound

Title:Improving the Adversarial Robustness for Speaker Verification by Self-Supervised Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:Improving the Adversarial Robustness for Speaker Verification by Self-Supervised Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators