Skip to main content

Showing 1–12 of 12 results for author: Syga, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.02585  [pdf, ps, other

    cs.SD cs.CV eess.AS

    A Hypernetwork-Based Approach to KAN Representation of Audio Signals

    Authors: Patryk Marszałek, Maciej Rut, Piotr Kawa, Przemysław Spurek, Piotr Syga

    Abstract: Implicit neural representations (INR) have gained prominence for efficiently encoding multimedia data, yet their applications in audio signals remain limited. This study introduces the Kolmogorov-Arnold Network (KAN), a novel architecture using learnable activation functions, as an effective INR model for audio representation. KAN demonstrates superior perceptual performance over previous INRs, ac… ▽ More

    Submitted 6 June, 2025; v1 submitted 4 March, 2025; originally announced March 2025.

  2. arXiv:2501.11171  [pdf, other

    cs.CV cs.AI cs.IR cs.LG cs.MM

    Counteracting temporal attacks in Video Copy Detection

    Authors: Katarzyna Fojcik, Piotr Syga

    Abstract: Video Copy Detection (VCD) plays a crucial role in copyright protection and content verification by identifying duplicates and near-duplicates in large-scale video databases. The META AI Challenge on video copy detection provided a benchmark for evaluating state-of-the-art methods, with the Dual-level detection approach emerging as a winning solution. This method integrates Video Editing Detection… ▽ More

    Submitted 19 January, 2025; originally announced January 2025.

    Comments: 14 pages, 5 figures, 4 tables

  3. arXiv:2412.17924  [pdf, other

    cs.SD eess.AS

    Are audio DeepFake detection models polyglots?

    Authors: Bartłomiej Marek, Piotr Kawa, Piotr Syga

    Abstract: Since the majority of audio DeepFake (DF) detection methods are trained on English-centric datasets, their applicability to non-English languages remains largely unexplored. In this work, we present a benchmark for the multilingual audio DF detection challenge by evaluating various adaptation strategies. Our experiments focus on analyzing models trained on English benchmark datasets, as well as in… ▽ More

    Submitted 23 December, 2024; originally announced December 2024.

    Comments: Keywords: Audio DeepFakes, DeepFake detection, multilingual audio DeepFakes

  4. arXiv:2401.09512  [pdf, other

    cs.SD eess.AS

    MLAAD: The Multi-Language Audio Anti-Spoofing Dataset

    Authors: Nicolas M. Müller, Piotr Kawa, Wei Herng Choong, Edresson Casanova, Eren Gölge, Thorsten Müller, Piotr Syga, Philip Sperl, Konstantin Böttinger

    Abstract: Text-to-Speech (TTS) technology offers notable benefits, such as providing a voice for individuals with speech impairments, but it also facilitates the creation of audio deepfakes and spoofing attacks. AI-based detection methods can help mitigate these risks; however, the performance of such models is inherently dependent on the quality and diversity of their training data. Presently, the availabl… ▽ More

    Submitted 26 April, 2025; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: IJCNN 2024

  5. arXiv:2306.01428  [pdf, other

    cs.SD cs.LG eess.AS

    Improved DeepFake Detection Using Whisper Features

    Authors: Piotr Kawa, Marcin Plata, Michał Czuba, Piotr Szymański, Piotr Syga

    Abstract: With a recent influx of voice generation methods, the threat introduced by audio DeepFake (DF) is ever-increasing. Several different detection methods have been presented as a countermeasure. Many methods are based on so-called front-ends, which, by transforming the raw audio, emphasize features crucial for assessing the genuineness of the audio sample. Our contribution contains investigating the… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: Accepted to INTERSPEECH 2023

  6. arXiv:2212.14597  [pdf, other

    cs.SD cs.CR cs.LG eess.AS

    Defense Against Adversarial Attacks on Audio DeepFake Detection

    Authors: Piotr Kawa, Marcin Plata, Piotr Syga

    Abstract: Audio DeepFakes (DF) are artificially generated utterances created using deep learning, with the primary aim of fooling the listeners in a highly convincing manner. Their quality is sufficient to pose a severe threat in terms of security and privacy, including the reliability of news or defamation. Multiple neural network-based methods to detect generated speech have been proposed to prevent the t… ▽ More

    Submitted 10 June, 2023; v1 submitted 30 December, 2022; originally announced December 2022.

    Comments: Accepted to INTERSPEECH 2023

  7. arXiv:2210.06105  [pdf, other

    cs.SD cs.LG eess.AS

    SpecRNet: Towards Faster and More Accessible Audio DeepFake Detection

    Authors: Piotr Kawa, Marcin Plata, Piotr Syga

    Abstract: Audio DeepFakes are utterances generated with the use of deep neural networks. They are highly misleading and pose a threat due to use in fake news, impersonation, or extortion. In this work, we focus on increasing accessibility to the audio DeepFake detection methods by providing SpecRNet, a neural network architecture characterized by a quick inference time and low computational requirements. Ou… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted by TrustCom 2022: The 21st IEEE International Conference on Trust, Security and Privacy in Computing and Communications

  8. Attack Agnostic Dataset: Towards Generalization and Stabilization of Audio DeepFake Detection

    Authors: Piotr Kawa, Marcin Plata, Piotr Syga

    Abstract: Audio DeepFakes allow the creation of high-quality, convincing utterances and therefore pose a threat due to its potential applications such as impersonation or fake news. Methods for detecting these manipulations should be characterized by good generalization and stability leading to robustness against attacks conducted with techniques that are not explicitly included in the training. In this wor… ▽ More

    Submitted 21 July, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: Proceedings of INTERSPEECH 2022 (Updated version: corrected ASVspoof dataset description)

  9. arXiv:2006.05183  [pdf, other

    cs.CV cs.LG

    A Note on Deepfake Detection with Low-Resources

    Authors: Piotr Kawa, Piotr Syga

    Abstract: Deepfakes are videos that include changes, quite often substituting face of a portrayed individual with a different face using neural networks. Even though the technology gained its popularity as a carrier of jokes and parodies it raises a serious threat to ones security - via biometric impersonation or besmearing. In this paper we present two methods that allow detecting Deepfakes for a user with… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

  10. arXiv:2006.03921  [pdf, other

    cs.MM cs.CR cs.CV

    Robust watermarking with double detector-discriminator approach

    Authors: Marcin Plata, Piotr Syga

    Abstract: In this paper we present a novel deep framework for a watermarking - a technique of embedding a transparent message into an image in a way that allows retrieving the message from a (perturbed) copy, so that copyright infringement can be tracked. For this technique, it is essential to extract the information from the image even after imposing some digital processing operations on it. Our framework… ▽ More

    Submitted 6 June, 2020; originally announced June 2020.

  11. Robust Spatial-spread Deep Neural Image Watermarking

    Authors: Marcin Plata, Piotr Syga

    Abstract: Watermarking is an operation of embedding an information into an image in a way that allows to identify ownership of the image despite applying some distortions on it. In this paper, we presented a novel end-to-end solution for embedding and recovering the watermark in the digital image using convolutional neural networks. The method is based on spreading the message over the spatial domain of the… ▽ More

    Submitted 4 November, 2020; v1 submitted 24 May, 2020; originally announced May 2020.

    Comments: The article was accepted on TrustCom 2020: The 19th IEEE International Conference on Trust, Security and Privacy in Computing and Communications

  12. arXiv:1602.04138  [pdf, other

    cs.CR

    Practical Fault-Tolerant Data Aggregation

    Authors: Krzysztof Grining, Marek Klonowski, Piotr Syga

    Abstract: During Financial Cryptography 2012 Chan et al. presented a novel privacy-protection fault-tolerant data aggregation protocol. Comparing to previous work, their scheme guaranteed provable privacy of individuals and could work even if some number of users refused to participate. In our paper we demonstrate that despite its merits, their method provides unacceptably low accuracy of aggregated data fo… ▽ More

    Submitted 31 May, 2016; v1 submitted 12 February, 2016; originally announced February 2016.

    Comments: Submitted to ACNS 2016;30 pages