Skip to main content

Showing 1–6 of 6 results for author: Korzh, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.06751  [pdf, ps, other

    cs.CL

    Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models

    Authors: Mikhail Salnikov, Dmitrii Korzh, Ivan Lazichny, Elvir Karimov, Artyom Iudin, Ivan Oseledets, Oleg Y. Rogov, Natalia Loukachevitch, Alexander Panchenko, Elena Tutubalina

    Abstract: This paper evaluates geopolitical biases in LLMs with respect to various countries though an analysis of their interpretation of historical events with conflicting national perspectives (USA, UK, USSR, and China). We introduce a novel dataset with neutral event descriptions and contrasting viewpoints from different countries. Our findings show significant geopolitical biases, with models favoring… ▽ More

    Submitted 20 June, 2025; v1 submitted 7 June, 2025; originally announced June 2025.

  2. arXiv:2505.19951  [pdf, ps, other

    cs.SD cs.AI cs.CR eess.AS

    Novel Loss-Enhanced Universal Adversarial Patches for Sustainable Speaker Privacy

    Authors: Elvir Karimov, Alexander Varlamov, Danil Ivanov, Dmitrii Korzh, Oleg Y. Rogov

    Abstract: Deep learning voice models are commonly used nowadays, but the safety processing of personal data, such as human identity and speech content, remains suspicious. To prevent malicious user identification, speaker anonymization methods were proposed. Current methods, particularly based on universal adversarial patch (UAP) applications, have drawbacks such as significant degradation of audio quality,… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: 5 pages, 3 figures, 1 table; Submitted to Interspeech 2025

  3. arXiv:2410.18057  [pdf, ps, other

    cs.CV cs.CL

    CLEAR: Character Unlearning in Textual and Visual Modalities

    Authors: Alexey Dontsov, Dmitrii Korzh, Alexey Zhavoronkin, Boris Mikheev, Denis Bobkov, Aibek Alanov, Oleg Y. Rogov, Ivan Oseledets, Elena Tutubalina

    Abstract: Machine Unlearning (MU) is critical for removing private or hazardous information from deep learning models. While MU has advanced significantly in unimodal (text or vision) settings, multimodal unlearning (MMU) remains underexplored due to the lack of open benchmarks for evaluating cross-modal data removal. To address this gap, we introduce CLEAR, the first open-source benchmark designed specific… ▽ More

    Submitted 31 May, 2025; v1 submitted 23 October, 2024; originally announced October 2024.

  4. arXiv:2408.17352  [pdf, other

    cs.SD cs.AI eess.AS

    AASIST3: KAN-Enhanced AASIST Speech Deepfake Detection using SSL Features and Additional Regularization for the ASVspoof 2024 Challenge

    Authors: Kirill Borodin, Vasiliy Kudryavtsev, Dmitrii Korzh, Alexey Efimenko, Grach Mkrtchian, Mikhail Gorodnichev, Oleg Y. Rogov

    Abstract: Automatic Speaker Verification (ASV) systems, which identify speakers based on their voice characteristics, have numerous applications, such as user authentication in financial transactions, exclusive access control in smart devices, and forensic fraud detection. However, the advancement of deep learning algorithms has enabled the generation of synthetic audio through Text-to-Speech (TTS) and Voic… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Comments: 8 pages, 2 figures, 2 tables. Accepted paper at the ASVspoof 2024 (the 25th Interspeech Conference)

  5. arXiv:2404.18791  [pdf, other

    cs.SD cs.AI eess.AS

    Certification of Speaker Recognition Models to Additive Perturbations

    Authors: Dmitrii Korzh, Elvir Karimov, Mikhail Pautov, Oleg Y. Rogov, Ivan Oseledets

    Abstract: Speaker recognition technology is applied to various tasks, from personal virtual assistants to secure access systems. However, the robustness of these systems against adversarial attacks, particularly to additive perturbations, remains a significant challenge. In this paper, we pioneer applying robustness certification techniques to speaker recognition, initially developed for the image domain. O… ▽ More

    Submitted 18 December, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: 13 pages, 10 figures; AAAI-2025 accepted paper

  6. arXiv:2309.16710  [pdf, other

    cs.CV cs.AI

    General Lipschitz: Certified Robustness Against Resolvable Semantic Transformations via Transformation-Dependent Randomized Smoothing

    Authors: Dmitrii Korzh, Mikhail Pautov, Olga Tsymboi, Ivan Oseledets

    Abstract: Randomized smoothing is the state-of-the-art approach to construct image classifiers that are provably robust against additive adversarial perturbations of bounded magnitude. However, it is more complicated to construct reasonable certificates against semantic transformation (e.g., image blurring, translation, gamma correction) and their compositions. In this work, we propose \emph{General Lipschi… ▽ More

    Submitted 9 August, 2024; v1 submitted 17 August, 2023; originally announced September 2023.