Skip to main content

Showing 1–3 of 3 results for author: Ciobanu, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.00462  [pdf, ps, other

    cs.SD cs.AI cs.CL cs.LG eess.AS

    XMAD-Bench: Cross-Domain Multilingual Audio Deepfake Benchmark

    Authors: Ioan-Paul Ciobanu, Andrei-Iulian Hiji, Nicolae-Catalin Ristea, Paul Irofti, Cristian Rusu, Radu Tudor Ionescu

    Abstract: Recent advances in audio generation led to an increasing number of deepfakes, making the general public more vulnerable to financial scams, identity theft, and misinformation. Audio deepfake detectors promise to alleviate this issue, with many recent studies reporting accuracy rates close to 99%. However, these methods are typically tested in an in-domain setup, where the deepfake samples from the… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

  2. arXiv:2502.20747  [pdf, other

    cs.SE

    Measuring Determinism in Large Language Models for Software Code Review

    Authors: Eugene Klishevich, Yegor Denisov-Blanch, Simon Obstbaum, Igor Ciobanu, Michal Kosinski

    Abstract: Large Language Models (LLMs) promise to streamline software code reviews, but their ability to produce consistent assessments remains an open question. In this study, we tested four leading LLMs -- GPT-4o mini, GPT-4o, Claude 3.5 Sonnet, and LLaMA 3.2 90B Vision -- on 70 Java commits from both private and public repositories. By setting each model's temperature to zero, clearing context, and repea… ▽ More

    Submitted 28 February, 2025; originally announced February 2025.

  3. arXiv:2409.15152  [pdf, ps, other

    cs.SE

    Predicting Expert Evaluations in Software Code Reviews

    Authors: Yegor Denisov-Blanch, Igor Ciobanu, Simon Obstbaum, Michal Kosinski

    Abstract: Manual code reviews are an essential but time-consuming part of software development, often leading reviewers to prioritize technical issues while skipping valuable assessments. This paper presents an algorithmic model that automates aspects of code review typically avoided due to their complexity or subjectivity, such as assessing coding time, implementation time, and code complexity. Instead of… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.