Adversarial Music: Real World Audio Adversary Against Wake-word Detection System

Li, Juncheng B.; Qu, Shuhui; Li, Xinjian; Szurley, Joseph; Kolter, J. Zico; Metze, Florian

Computer Science > Cryptography and Security

arXiv:1911.00126 (cs)

[Submitted on 31 Oct 2019 (v1), last revised 6 Dec 2019 (this version, v3)]

Title:Adversarial Music: Real World Audio Adversary Against Wake-word Detection System

Authors:Juncheng B. Li, Shuhui Qu, Xinjian Li, Joseph Szurley, J. Zico Kolter, Florian Metze

View PDF

Abstract:Voice Assistants (VAs) such as Amazon Alexa or Google Assistant rely on wake-word detection to respond to people's commands, which could potentially be vulnerable to audio adversarial examples. In this work, we target our attack on the wake-word detection system, jamming the model with some inconspicuous background music to deactivate the VAs while our audio adversary is present. We implemented an emulated wake-word detection system of Amazon Alexa based on recent publications. We validated our models against the real Alexa in terms of wake-word detection accuracy. Then we computed our audio adversaries with consideration of expectation over transform and we implemented our audio adversary with a differentiable synthesizer. Next, we verified our audio adversaries digitally on hundreds of samples of utterances collected from the real world. Our experiments show that we can effectively reduce the recognition F1 score of our emulated model from 93.4% to 11.0%. Finally, we tested our audio adversary over the air, and verified it works effectively against Alexa, reducing its F1 score from 92.5% to 11.0%.; We also verified that non-adversarial music does not disable Alexa as effectively as our music at the same sound level. To the best of our knowledge, this is the first real-world adversarial attack against a commercial-grade VA wake-word detection system. Our code and demo videos can be accessed at \url{this https URL}

Comments:	9 pages, In Proceedings of NeurIPS 2019 Conference
Subjects:	Cryptography and Security (cs.CR); Machine Learning (cs.LG); Sound (cs.SD)
Cite as:	arXiv:1911.00126 [cs.CR]
	(or arXiv:1911.00126v3 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.1911.00126
Journal reference:	NIPS2019_9362, pages = {11908--11918}, year = {2019}, publisher = {Curran Associates, Inc.}, url = {http://papers.nips.cc/paper/9362-adversarial-music-real-world-audio-adversary-against-wake-word-detection-system.pdf} }

Submission history

From: Juncheng Li [view email]
[v1] Thu, 31 Oct 2019 21:58:50 UTC (359 KB)
[v2] Thu, 5 Dec 2019 17:03:17 UTC (359 KB)
[v3] Fri, 6 Dec 2019 02:12:45 UTC (359 KB)

Computer Science > Cryptography and Security

Title:Adversarial Music: Real World Audio Adversary Against Wake-word Detection System

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:Adversarial Music: Real World Audio Adversary Against Wake-word Detection System

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators