Skip to main content

Showing 1–4 of 4 results for author: Dowerah, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2509.02859  [pdf, ps, other

    cs.SD cs.CL eess.AS

    Speech DF Arena: A Leaderboard for Speech DeepFake Detection Models

    Authors: Sandipana Dowerah, Atharva Kulkarni, Ajinkya Kulkarni, Hoan My Tran, Joonas Kalda, Artem Fedorchenko, Benoit Fauve, Damien Lolive, Tanel Alumäe, Matthew Magimai Doss

    Abstract: Parallel to the development of advanced deepfake audio generation, audio deepfake detection has also seen significant progress. However, a standardized and comprehensive benchmark is still missing. To address this, we introduce Speech DeepFake (DF) Arena, the first comprehensive benchmark for audio deepfake detection. Speech DF Arena provides a toolkit to uniformly evaluate detection systems, curr… ▽ More

    Submitted 2 September, 2025; originally announced September 2025.

  2. arXiv:2506.02085  [pdf, ps, other

    cs.SD cs.AI cs.CL eess.AS

    Unveiling Audio Deepfake Origins: A Deep Metric learning And Conformer Network Approach With Ensemble Fusion

    Authors: Ajinkya Kulkarni, Sandipana Dowerah, Tanel Alumae, Mathew Magimai. -Doss

    Abstract: Audio deepfakes are acquiring an unprecedented level of realism with advanced AI. While current research focuses on discerning real speech from spoofed speech, tracing the source system is equally crucial. This work proposes a novel audio source tracing system combining deep metric multi-class N-pair loss with Real Emphasis and Fake Dispersion framework, a Conformer classification network, and ens… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Comments: Accepted at Interspeech 2025, Netherlands

  3. arXiv:2307.02244  [pdf, other

    cs.SD eess.AS

    Self-supervised learning with diffusion-based multichannel speech enhancement for speaker verification under noisy conditions

    Authors: Sandipana Dowerah, Ajinkya Kulkarni, Romain Serizel, Denis Jouvet

    Abstract: The paper introduces Diff-Filter, a multichannel speech enhancement approach based on the diffusion probabilistic model, for improving speaker verification performance under noisy and reverberant conditions. It also presents a new two-step training procedure that takes the benefit of self-supervised learning. In the first stage, the Diff-Filter is trained by conducting timedomain speech filtering… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  4. arXiv:2210.08834  [pdf

    cs.SD cs.HC eess.AS

    How to Leverage DNN-based speech enhancement for multi-channel speaker verification?

    Authors: Sandipana Dowerah, Romain Serizel, Denis Jouvet, Mohammad Mohammadamini, Driss Matrouf

    Abstract: Speaker verification (SV) suffers from unsatisfactory performance in far-field scenarios due to environmental noise andthe adverse impact of room reverberation. This work presents a benchmark of multichannel speech enhancement for far-fieldspeaker verification. One approach is a deep neural network-based, and the other is a combination of deep neural network andsignal processing. We integrated a D… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Journal ref: 4th International Conference on Advances in Signal Processing and Artificial Intelligence (ASPAI' 2022), Oct 2022, Corfu, Greece