Skip to main content

Showing 1–18 of 18 results for author: Ghorbel, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.14333  [pdf, ps, other

    cs.CV

    Domain Adaptation for Multi-label Image Classification: a Discriminator-free Approach

    Authors: Inder Pal Singh, Enjie Ghorbel, Anis Kacem, Djamila Aouada

    Abstract: This paper introduces a discriminator-free adversarial-based approach termed DDA-MLIC for Unsupervised Domain Adaptation (UDA) in the context of Multi-Label Image Classification (MLIC). While recent efforts have explored adversarial-based UDA methods for MLIC, they typically include an additional discriminator subnet. Nevertheless, decoupling the classification and the discrimination tasks may har… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: The paper is under consideration at Computer Vision and Image Understanding. arXiv admin note: text overlap with arXiv:2301.10611

  2. arXiv:2503.13053  [pdf, other

    cs.CV

    Uncertainty-Aware Knowledge Distillation for Compact and Efficient 6DoF Pose Estimation

    Authors: Nassim Ali Ousalah, Anis Kacem, Enjie Ghorbel, Emmanuel Koumandakis, Djamila Aouada

    Abstract: Compact and efficient 6DoF object pose estimation is crucial in applications such as robotics, augmented reality, and space autonomous navigation systems, where lightweight models are critical for real-time accurate performance. This paper introduces a novel uncertainty-aware end-to-end Knowledge Distillation (KD) framework focused on keypoint-based 6DoF pose estimation. Keypoints predicted by a l… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

  3. arXiv:2502.21022  [pdf, other

    cs.LG cs.CV

    When Unsupervised Domain Adaptation meets One-class Anomaly Detection: Addressing the Two-fold Unsupervised Curse by Leveraging Anomaly Scarcity

    Authors: Nesryne Mejri, Enjie Ghorbel, Anis Kacem, Pavel Chernakov, Niki Foteinopoulou, Djamila Aouada

    Abstract: This paper introduces the first fully unsupervised domain adaptation (UDA) framework for unsupervised anomaly detection (UAD). The performance of UAD techniques degrades significantly in the presence of a domain shift, difficult to avoid in a real-world setting. While UDA has contributed to solving this issue in binary and multi-class classification, such a strategy is ill-posed in UAD. This might… ▽ More

    Submitted 9 March, 2025; v1 submitted 28 February, 2025; originally announced February 2025.

  4. arXiv:2501.08137  [pdf, other

    cs.CV cs.CR cs.MM cs.SD eess.AS

    Audio-Visual Deepfake Detection With Local Temporal Inconsistencies

    Authors: Marcella Astrid, Enjie Ghorbel, Djamila Aouada

    Abstract: This paper proposes an audio-visual deepfake detection approach that aims to capture fine-grained temporal inconsistencies between audio and visual modalities. To achieve this, both architectural and data synthesis strategies are introduced. From an architectural perspective, a temporal distance map, coupled with an attention mechanism, is designed to capture these inconsistencies while minimizing… ▽ More

    Submitted 13 March, 2025; v1 submitted 14 January, 2025; originally announced January 2025.

    Comments: Accepted in ICASSP 2025

  5. arXiv:2501.01184  [pdf, other

    cs.CV

    Vulnerability-Aware Spatio-Temporal Learning for Generalizable and Interpretable Deepfake Video Detection

    Authors: Dat Nguyen, Marcella Astrid, Anis Kacem, Enjie Ghorbel, Djamila Aouada

    Abstract: Detecting deepfake videos is highly challenging due to the complex intertwined spatial and temporal artifacts in forged sequences. Most recent approaches rely on binary classifiers trained on both real and fake data. However, such methods may struggle to focus on important artifacts, which can hinder their generalization capability. Additionally, these models often lack interpretability, making it… ▽ More

    Submitted 16 January, 2025; v1 submitted 2 January, 2025; originally announced January 2025.

  6. arXiv:2410.21964  [pdf, other

    cs.CV

    FakeFormer: Efficient Vulnerability-Driven Transformers for Generalisable Deepfake Detection

    Authors: Dat Nguyen, Marcella Astrid, Enjie Ghorbel, Djamila Aouada

    Abstract: Recently, Vision Transformers (ViTs) have achieved unprecedented effectiveness in the general domain of image classification. Nonetheless, these models remain underexplored in the field of deepfake detection, given their lower performance as compared to Convolution Neural Networks (CNNs) in that specific context. In this paper, we start by investigating why plain ViT architectures exhibit a subopt… ▽ More

    Submitted 25 November, 2024; v1 submitted 29 October, 2024; originally announced October 2024.

  7. arXiv:2410.00485  [pdf, other

    cs.CV

    A Hitchhikers Guide to Fine-Grained Face Forgery Detection Using Common Sense Reasoning

    Authors: Niki Maria Foteinopoulou, Enjie Ghorbel, Djamila Aouada

    Abstract: Explainability in artificial intelligence is crucial for restoring trust, particularly in areas like face forgery detection, where viewers often struggle to distinguish between real and fabricated content. Vision and Large Language Models (VLLM) bridge computer vision and natural language, offering numerous applications driven by strong common-sense reasoning. Despite their success in various task… ▽ More

    Submitted 30 October, 2024; v1 submitted 1 October, 2024; originally announced October 2024.

    Comments: Accepted at NeurIPS'2024 (D&B)

  8. arXiv:2408.06753  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    Detecting Audio-Visual Deepfakes with Fine-Grained Inconsistencies

    Authors: Marcella Astrid, Enjie Ghorbel, Djamila Aouada

    Abstract: Existing methods on audio-visual deepfake detection mainly focus on high-level features for modeling inconsistencies between audio and visual data. As a result, these approaches usually overlook finer audio-visual artifacts, which are inherent to deepfakes. Herein, we propose the introduction of fine-grained mechanisms for detecting subtle artifacts in both spatial and temporal domains. First, we… ▽ More

    Submitted 14 October, 2024; v1 submitted 13 August, 2024; originally announced August 2024.

    Comments: Accepted in BMVC 2024

  9. arXiv:2407.11650  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    Statistics-aware Audio-visual Deepfake Detector

    Authors: Marcella Astrid, Enjie Ghorbel, Djamila Aouada

    Abstract: In this paper, we propose an enhanced audio-visual deep detection method. Recent methods in audio-visual deepfake detection mostly assess the synchronization between audio and visual features. Although they have shown promising results, they are based on the maximization/minimization of isolated feature distances without considering feature statistics. Moreover, they rely on cumbersome deep learni… ▽ More

    Submitted 17 July, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

    Comments: Accepted in ICIP 2024

  10. arXiv:2407.07598  [pdf, other

    cs.SD cs.LG eess.AS

    Targeted Augmented Data for Audio Deepfake Detection

    Authors: Marcella Astrid, Enjie Ghorbel, Djamila Aouada

    Abstract: The availability of highly convincing audio deepfake generators highlights the need for designing robust audio deepfake detectors. Existing works often rely solely on real and fake data available in the training set, which may lead to overfitting, thereby reducing the robustness to unseen manipulations. To enhance the generalization capabilities of audio deepfake detectors, we propose a novel augm… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Accepted in EUSIPCO 2024

  11. arXiv:2401.13856  [pdf, ps, other

    cs.CV

    LAA-Net: Localized Artifact Attention Network for Quality-Agnostic and Generalizable Deepfake Detection

    Authors: Dat Nguyen, Nesryne Mejri, Inder Pal Singh, Polina Kuleshova, Marcella Astrid, Anis Kacem, Enjie Ghorbel, Djamila Aouada

    Abstract: This paper introduces a novel approach for high-quality deepfake detection called Localized Artifact Attention Network (LAA-Net). Existing methods for high-quality deepfake detection are mainly based on a supervised binary classifier coupled with an implicit attention mechanism. As a result, they do not generalize well to unseen manipulations. To handle this issue, two main contributions are made.… ▽ More

    Submitted 24 May, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: Accepted by CVPR2024

  12. arXiv:2305.12621  [pdf, other

    eess.IV cs.CV cs.LG

    DermSynth3D: Synthesis of in-the-wild Annotated Dermatology Images

    Authors: Ashish Sinha, Jeremy Kawahara, Arezou Pakzad, Kumar Abhishek, Matthieu Ruthven, Enjie Ghorbel, Anis Kacem, Djamila Aouada, Ghassan Hamarneh

    Abstract: In recent years, deep learning (DL) has shown great potential in the field of dermatological image analysis. However, existing datasets in this domain have significant limitations, including a small number of image samples, limited disease conditions, insufficient annotations, and non-standardized image acquisitions. To address these shortcomings, we propose a novel framework called DermSynth3D. D… ▽ More

    Submitted 21 April, 2024; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: Accepted to Medical Image Analysis (MedIA) 2024

  13. arXiv:2301.10611  [pdf, other

    cs.CV

    Discriminator-free Unsupervised Domain Adaptation for Multi-label Image Classification

    Authors: Indel Pal Singh, Enjie Ghorbel, Anis Kacem, Arunkumar Rathinam, Djamila Aouada

    Abstract: In this paper, a discriminator-free adversarial-based Unsupervised Domain Adaptation (UDA) for Multi-Label Image Classification (MLIC) referred to as DDA-MLIC is proposed. Recently, some attempts have been made for introducing adversarial-based UDA methods in the context of MLIC. However, these methods which rely on an additional discriminator subnet present one major shortcoming. The learning of… ▽ More

    Submitted 8 November, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

  14. Multi-label Image Classification using Adaptive Graph Convolutional Networks: from a Single Domain to Multiple Domains

    Authors: Indel Pal Singh, Enjie Ghorbel, Oyebade Oyedotun, Djamila Aouada

    Abstract: This paper proposes an adaptive graph-based approach for multi-label image classification. Graph-based methods have been largely exploited in the field of multi-label classification, given their ability to model label correlations. Specifically, their effectiveness has been proven not only when considering a single domain but also when taking into account multiple domains. However, the topology of… ▽ More

    Submitted 22 July, 2024; v1 submitted 11 January, 2023; originally announced January 2023.

  15. Unsupervised Anomaly Detection in Time-series: An Extensive Evaluation and Analysis of State-of-the-art Methods

    Authors: Nesryne Mejri, Laura Lopez-Fuentes, Kankana Roy, Pavel Chernakov, Enjie Ghorbel, Djamila Aouada

    Abstract: Unsupervised anomaly detection in time-series has been extensively investigated in the literature. Notwithstanding the relevance of this topic in numerous application fields, a comprehensive and extensive evaluation of recent state-of-the-art techniques taking into account real-world constraints is still needed. Some efforts have been made to compare existing unsupervised time-series anomaly detec… ▽ More

    Submitted 12 August, 2024; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: Accepted at Expert Systems with Applications journal

  16. arXiv:2104.09248  [pdf, other

    cs.CV cs.LG

    LSPnet: A 2D Localization-oriented Spacecraft Pose Estimation Neural Network

    Authors: Albert Garcia, Mohamed Adel Musallam, Vincent Gaudilliere, Enjie Ghorbel, Kassem Al Ismaeil, Marcos Perez, Djamila Aouada

    Abstract: Being capable of estimating the pose of uncooperative objects in space has been proposed as a key asset for enabling safe close-proximity operations such as space rendezvous, in-orbit servicing and active debris removal. Usual approaches for pose estimation involve classical computer vision-based solutions or the application of Deep Learning (DL) techniques. This work explores a novel DL-based met… ▽ More

    Submitted 23 August, 2021; v1 submitted 19 April, 2021; originally announced April 2021.

    Comments: 9 pages, 5 figures, published at AI4Space 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops (2021) p. 2048-2056

  17. arXiv:1912.09745  [pdf, other

    cs.CV

    Vertex Feature Encoding and Hierarchical Temporal Modeling in a Spatial-Temporal Graph Convolutional Network for Action Recognition

    Authors: Konstantinos Papadopoulos, Enjie Ghorbel, Djamila Aouada, Björn Ottersten

    Abstract: This paper extends the Spatial-Temporal Graph Convolutional Network (ST-GCN) for skeleton-based action recognition by introducing two novel modules, namely, the Graph Vertex Feature Encoder (GVFE) and the Dilated Hierarchical Temporal Convolutional Network (DH-TCN). On the one hand, the GVFE module learns appropriate vertex features for action recognition by encoding raw skeleton data into a new f… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

  18. arXiv:1904.05244  [pdf, other

    cs.CV

    Localized Trajectories for 2D and 3D Action Recognition

    Authors: Konstantinos Papadopoulos, Girum Demisse, Enjie Ghorbel, Michel Antunes, Djamila Aouada, Björn Ottersten

    Abstract: The Dense Trajectories concept is one of the most successful approaches in action recognition, suitable for scenarios involving a significant amount of motion. However, due to noise and background motion, many generated trajectories are irrelevant to the actual human activity and can potentially lead to performance degradation. In this paper, we propose Localized Trajectories as an improved versio… ▽ More

    Submitted 10 April, 2019; originally announced April 2019.

    Comments: 36 pages, 2 figures