Skip to main content

Showing 1–11 of 11 results for author: Kawa, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.14862  [pdf, ps, other

    cs.SD cs.AI eess.AS

    Replay Attacks Against Audio Deepfake Detection

    Authors: Nicolas Müller, Piotr Kawa, Wei-Herng Choong, Adriana Stan, Aditya Tirumala Bukkapatnam, Karla Pizzi, Alexander Wagner, Philip Sperl

    Abstract: We show how replay attacks undermine audio deepfake detection: By playing and re-recording deepfake audio through various speakers and microphones, we make spoofed samples appear authentic to the detection model. To study this phenomenon in more detail, we introduce ReplayDF, a dataset of recordings derived from M-AILABS and MLAAD, featuring 109 speaker-microphone combinations across six languages… ▽ More

    Submitted 1 June, 2025; v1 submitted 20 May, 2025; originally announced May 2025.

    Journal ref: Interspeech 2025

  2. arXiv:2503.02585  [pdf, ps, other

    cs.SD cs.CV eess.AS

    A Hypernetwork-Based Approach to KAN Representation of Audio Signals

    Authors: Patryk Marszałek, Maciej Rut, Piotr Kawa, Przemysław Spurek, Piotr Syga

    Abstract: Implicit neural representations (INR) have gained prominence for efficiently encoding multimedia data, yet their applications in audio signals remain limited. This study introduces the Kolmogorov-Arnold Network (KAN), a novel architecture using learnable activation functions, as an effective INR model for audio representation. KAN demonstrates superior perceptual performance over previous INRs, ac… ▽ More

    Submitted 6 June, 2025; v1 submitted 4 March, 2025; originally announced March 2025.

  3. arXiv:2502.20427  [pdf, other

    cs.CR cs.AI cs.SD eess.AS

    DeePen: Penetration Testing for Audio Deepfake Detection

    Authors: Nicolas Müller, Piotr Kawa, Adriana Stan, Thien-Phuc Doan, Souhwan Jung, Wei Herng Choong, Philip Sperl, Konstantin Böttinger

    Abstract: Deepfakes - manipulated or forged audio and video media - pose significant security risks to individuals, organizations, and society at large. To address these challenges, machine learning-based classifiers are commonly employed to detect deepfake content. In this paper, we assess the robustness of such classifiers through a systematic penetration testing methodology, which we introduce as DeePen.… ▽ More

    Submitted 5 March, 2025; v1 submitted 27 February, 2025; originally announced February 2025.

  4. arXiv:2412.17924  [pdf, other

    cs.SD eess.AS

    Are audio DeepFake detection models polyglots?

    Authors: Bartłomiej Marek, Piotr Kawa, Piotr Syga

    Abstract: Since the majority of audio DeepFake (DF) detection methods are trained on English-centric datasets, their applicability to non-English languages remains largely unexplored. In this work, we present a benchmark for the multilingual audio DF detection challenge by evaluating various adaptation strategies. Our experiments focus on analyzing models trained on English benchmark datasets, as well as in… ▽ More

    Submitted 23 December, 2024; originally announced December 2024.

    Comments: Keywords: Audio DeepFakes, DeepFake detection, multilingual audio DeepFakes

  5. arXiv:2402.06304  [pdf, ps, other

    cs.SD cs.AI eess.AS

    A New Approach to Voice Authenticity

    Authors: Nicolas M. Müller, Piotr Kawa, Shen Hu, Matthias Neu, Jennifer Williams, Philip Sperl, Konstantin Böttinger

    Abstract: Voice faking, driven primarily by recent advances in text-to-speech (TTS) synthesis technology, poses significant societal challenges. Currently, the prevailing assumption is that unaltered human speech can be considered genuine, while fake speech comes from TTS synthesis. We argue that this binary distinction is oversimplified. For instance, altered playback speeds can be used for malicious purpo… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  6. arXiv:2401.09512  [pdf, other

    cs.SD eess.AS

    MLAAD: The Multi-Language Audio Anti-Spoofing Dataset

    Authors: Nicolas M. Müller, Piotr Kawa, Wei Herng Choong, Edresson Casanova, Eren Gölge, Thorsten Müller, Piotr Syga, Philip Sperl, Konstantin Böttinger

    Abstract: Text-to-Speech (TTS) technology offers notable benefits, such as providing a voice for individuals with speech impairments, but it also facilitates the creation of audio deepfakes and spoofing attacks. AI-based detection methods can help mitigate these risks; however, the performance of such models is inherently dependent on the quality and diversity of their training data. Presently, the availabl… ▽ More

    Submitted 26 April, 2025; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: IJCNN 2024

  7. arXiv:2306.01428  [pdf, other

    cs.SD cs.LG eess.AS

    Improved DeepFake Detection Using Whisper Features

    Authors: Piotr Kawa, Marcin Plata, Michał Czuba, Piotr Szymański, Piotr Syga

    Abstract: With a recent influx of voice generation methods, the threat introduced by audio DeepFake (DF) is ever-increasing. Several different detection methods have been presented as a countermeasure. Many methods are based on so-called front-ends, which, by transforming the raw audio, emphasize features crucial for assessing the genuineness of the audio sample. Our contribution contains investigating the… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: Accepted to INTERSPEECH 2023

  8. arXiv:2212.14597  [pdf, other

    cs.SD cs.CR cs.LG eess.AS

    Defense Against Adversarial Attacks on Audio DeepFake Detection

    Authors: Piotr Kawa, Marcin Plata, Piotr Syga

    Abstract: Audio DeepFakes (DF) are artificially generated utterances created using deep learning, with the primary aim of fooling the listeners in a highly convincing manner. Their quality is sufficient to pose a severe threat in terms of security and privacy, including the reliability of news or defamation. Multiple neural network-based methods to detect generated speech have been proposed to prevent the t… ▽ More

    Submitted 10 June, 2023; v1 submitted 30 December, 2022; originally announced December 2022.

    Comments: Accepted to INTERSPEECH 2023

  9. arXiv:2210.06105  [pdf, other

    cs.SD cs.LG eess.AS

    SpecRNet: Towards Faster and More Accessible Audio DeepFake Detection

    Authors: Piotr Kawa, Marcin Plata, Piotr Syga

    Abstract: Audio DeepFakes are utterances generated with the use of deep neural networks. They are highly misleading and pose a threat due to use in fake news, impersonation, or extortion. In this work, we focus on increasing accessibility to the audio DeepFake detection methods by providing SpecRNet, a neural network architecture characterized by a quick inference time and low computational requirements. Ou… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted by TrustCom 2022: The 21st IEEE International Conference on Trust, Security and Privacy in Computing and Communications

  10. Attack Agnostic Dataset: Towards Generalization and Stabilization of Audio DeepFake Detection

    Authors: Piotr Kawa, Marcin Plata, Piotr Syga

    Abstract: Audio DeepFakes allow the creation of high-quality, convincing utterances and therefore pose a threat due to its potential applications such as impersonation or fake news. Methods for detecting these manipulations should be characterized by good generalization and stability leading to robustness against attacks conducted with techniques that are not explicitly included in the training. In this wor… ▽ More

    Submitted 21 July, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: Proceedings of INTERSPEECH 2022 (Updated version: corrected ASVspoof dataset description)

  11. arXiv:2006.05183  [pdf, other

    cs.CV cs.LG

    A Note on Deepfake Detection with Low-Resources

    Authors: Piotr Kawa, Piotr Syga

    Abstract: Deepfakes are videos that include changes, quite often substituting face of a portrayed individual with a different face using neural networks. Even though the technology gained its popularity as a carrier of jokes and parodies it raises a serious threat to ones security - via biometric impersonation or besmearing. In this paper we present two methods that allow detecting Deepfakes for a user with… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.