Skip to main content

Showing 1–8 of 8 results for author: Hondru, V

.
  1. arXiv:2505.11109  [pdf, ps, other

    cs.CV cs.AI cs.LG cs.MM

    MAVOS-DD: Multilingual Audio-Video Open-Set Deepfake Detection Benchmark

    Authors: Florinel-Alin Croitoru, Vlad Hondru, Marius Popescu, Radu Tudor Ionescu, Fahad Shahbaz Khan, Mubarak Shah

    Abstract: We present the first large-scale open-set benchmark for multilingual audio-video deepfake detection. Our dataset comprises over 250 hours of real and fake videos across eight languages, with 60% of data being generated. For each language, the fake videos are generated with seven distinct deepfake generation models, selected based on the quality of the generated content. We organize the training, v… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: 15 pages

  2. arXiv:2503.14421  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.MM

    ExDDV: A New Dataset for Explainable Deepfake Detection in Video

    Authors: Vlad Hondru, Eduard Hogea, Darian Onchis, Radu Tudor Ionescu

    Abstract: The ever growing realism and quality of generated videos makes it increasingly harder for humans to spot deepfake content, who need to rely more and more on automatic deepfake detectors. However, deepfake detectors are also prone to errors, and their decisions are not explainable, leaving humans vulnerable to deepfake-based fraud and misinformation. To this end, we introduce ExDDV, the first datas… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

  3. arXiv:2411.19537  [pdf, other

    cs.CV cs.AI cs.LG cs.MM cs.SD eess.AS

    Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook

    Authors: Florinel-Alin Croitoru, Andrei-Iulian Hiji, Vlad Hondru, Nicolae Catalin Ristea, Paul Irofti, Marius Popescu, Cristian Rusu, Radu Tudor Ionescu, Fahad Shahbaz Khan, Mubarak Shah

    Abstract: With the recent advancements in generative modeling, the realism of deepfake content has been increasing at a steady pace, even reaching the point where people often fail to detect manipulated media content online, thus being deceived into various kinds of scams. In this paper, we survey deepfake generation and detection techniques, including the most recent developments in the field, such as diff… ▽ More

    Submitted 29 November, 2024; originally announced November 2024.

  4. arXiv:2408.06687  [pdf, other

    cs.CV cs.AI cs.LG

    Masked Image Modeling: A Survey

    Authors: Vlad Hondru, Florinel Alin Croitoru, Shervin Minaee, Radu Tudor Ionescu, Nicu Sebe

    Abstract: In this work, we survey recent studies on masked image modeling (MIM), an approach that emerged as a powerful self-supervised learning technique in computer vision. The MIM task involves masking some information, e.g.~pixels, patches, or even latent representations, and training a model, usually an autoencoder, to predicting the missing information by using the context available in the visible par… ▽ More

    Submitted 9 January, 2025; v1 submitted 13 August, 2024; originally announced August 2024.

    Comments: Revised version

  5. arXiv:2405.13637  [pdf, other

    cs.CV cs.AI cs.LG

    Curriculum Direct Preference Optimization for Diffusion and Consistency Models

    Authors: Florinel-Alin Croitoru, Vlad Hondru, Radu Tudor Ionescu, Nicu Sebe, Mubarak Shah

    Abstract: Direct Preference Optimization (DPO) has been proposed as an effective and efficient alternative to reinforcement learning from human feedback (RLHF). In this paper, we propose a novel and enhanced version of DPO based on curriculum learning for text-to-image generation. Our method is divided into two training stages. First, a ranking of the examples generated for each prompt is obtained by employ… ▽ More

    Submitted 9 May, 2025; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: Accepted at CVPR 2025

  6. arXiv:2310.00096  [pdf, other

    cs.CV cs.LG

    Towards Few-Call Model Stealing via Active Self-Paced Knowledge Distillation and Diffusion-Based Image Generation

    Authors: Vlad Hondru, Radu Tudor Ionescu

    Abstract: Diffusion models showcase strong capabilities in image synthesis, being used in many computer vision tasks with great success. To this end, we propose to explore a new use case, namely to copy black-box classification models without having access to the original training data, the architecture, and the weights of the model, i.e. the model is only exposed through an inference API. More specifically… ▽ More

    Submitted 14 March, 2025; v1 submitted 29 September, 2023; originally announced October 2023.

    Comments: Accepted in Artificial Intelligence Review

  7. arXiv:2308.01472  [pdf, other

    cs.CV cs.CL cs.LG

    Reverse Stable Diffusion: What prompt was used to generate this image?

    Authors: Florinel-Alin Croitoru, Vlad Hondru, Radu Tudor Ionescu, Mubarak Shah

    Abstract: Text-to-image diffusion models have recently attracted the interest of many researchers, and inverting the diffusion process can play an important role in better understanding the generative process and how to engineer prompts in order to obtain the desired images. To this end, we study the task of predicting the prompt embedding given an image generated by a generative diffusion model. We conside… ▽ More

    Submitted 19 October, 2024; v1 submitted 2 August, 2023; originally announced August 2023.

    Comments: Accepted for publication in Computer Vision and Image Understanding

  8. Diffusion Models in Vision: A Survey

    Authors: Florinel-Alin Croitoru, Vlad Hondru, Radu Tudor Ionescu, Mubarak Shah

    Abstract: Denoising diffusion models represent a recent emerging topic in computer vision, demonstrating remarkable results in the area of generative modeling. A diffusion model is a deep generative model that is based on two stages, a forward diffusion stage and a reverse diffusion stage. In the forward diffusion stage, the input data is gradually perturbed over several steps by adding Gaussian noise. In t… ▽ More

    Submitted 16 January, 2025; v1 submitted 10 September, 2022; originally announced September 2022.

    Comments: Accepted in IEEE Transactions on Pattern Analysis and Machine Intelligence. 25 pages, 3 figures