Hidden Echoes Survive Training in Audio To Audio Generative Instrument Models

Tralie, Christopher J.; Amery, Matt; Douglas, Benjamin; Utz, Ian

Computer Science > Sound

arXiv:2412.10649 (cs)

[Submitted on 14 Dec 2024]

Title:Hidden Echoes Survive Training in Audio To Audio Generative Instrument Models

Authors:Christopher J. Tralie, Matt Amery, Benjamin Douglas, Ian Utz

View PDF HTML (experimental)

Abstract:As generative techniques pervade the audio domain, there has been increasing interest in tracing back through these complicated models to understand how they draw on their training data to synthesize new examples, both to ensure that they use properly licensed data and also to elucidate their black box behavior. In this paper, we show that if imperceptible echoes are hidden in the training data, a wide variety of audio to audio architectures (differentiable digital signal processing (DDSP), Realtime Audio Variational autoEncoder (RAVE), and ``Dance Diffusion'') will reproduce these echoes in their outputs. Hiding a single echo is particularly robust across all architectures, but we also show promising results hiding longer time spread echo patterns for an increased information capacity. We conclude by showing that echoes make their way into fine tuned models, that they survive mixing/demixing, and that they survive pitch shift augmentation during training. Hence, this simple, classical idea in watermarking shows significant promise for tagging generative audio models.

Comments:	8 pages, 11 Figures, Proceedings of 2025 AAAI Workshop on AI for Music
Subjects:	Sound (cs.SD); Artificial Intelligence (cs.AI); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
ACM classes:	I.2; I.5.4; J.5
Cite as:	arXiv:2412.10649 [cs.SD]
	(or arXiv:2412.10649v1 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2412.10649

Submission history

From: Christopher Tralie [view email]
[v1] Sat, 14 Dec 2024 02:36:45 UTC (1,668 KB)

Computer Science > Sound

Title:Hidden Echoes Survive Training in Audio To Audio Generative Instrument Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:Hidden Echoes Survive Training in Audio To Audio Generative Instrument Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators