Skip to main content

Showing 1–12 of 12 results for author: Kerkouri, M A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.19696  [pdf, other

    cs.CV cs.MM eess.IV

    Modeling Beyond MOS: Quality Assessment Models Must Integrate Context, Reasoning, and Multimodality

    Authors: Mohamed Amine Kerkouri, Marouane Tliba, Aladine Chetouani, Nour Aburaed, Alessandro Bruno

    Abstract: This position paper argues that Mean Opinion Score (MOS), while historically foundational, is no longer sufficient as the sole supervisory signal for multimedia quality assessment models. MOS reduces rich, context-sensitive human judgments to a single scalar, obscuring semantic failures, user intent, and the rationale behind quality decisions. We contend that modern quality assessment models must… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: Under review

  2. arXiv:2504.15007  [pdf, other

    cs.CV cs.HC

    Shifts in Doctors' Eye Movements Between Real and AI-Generated Medical Images

    Authors: David C Wong, Bin Wang, Gorkem Durak, Marouane Tliba, Mohamed Amine Kerkouri, Aladine Chetouani, Ahmet Enis Cetin, Cagdas Topel, Nicolo Gennaro, Camila Vendrami, Tugce Agirlar Trabzonlu, Amir Ali Rahsepar, Laetitia Perronne, Matthew Antalek, Onural Ozturk, Gokcan Okur, Andrew C. Gordon, Ayis Pyrros, Frank H Miller, Amir A Borhani, Hatice Savas, Eric M. Hart, Elizabeth A Krupinski, Ulas Bagci

    Abstract: Eye-tracking analysis plays a vital role in medical imaging, providing key insights into how radiologists visually interpret and diagnose clinical cases. In this work, we first analyze radiologists' attention and agreement by measuring the distribution of various eye-movement patterns, including saccades direction, amplitude, and their joint distribution. These metrics help uncover patterns in att… ▽ More

    Submitted 24 April, 2025; v1 submitted 21 April, 2025; originally announced April 2025.

    Comments: This paper was accepted at ETRA 2025 Japan

  3. arXiv:2403.09947  [pdf, other

    cs.CV

    Shifting Focus: From Global Semantics to Local Prominent Features in Swin-Transformer for Knee Osteoarthritis Severity Assessment

    Authors: Aymen Sekhri, Marouane Tliba, Mohamed Amine Kerkouri, Yassine Nasser, Aladine Chetouani, Alessandro Bruno, Rachid Jennane

    Abstract: Conventional imaging diagnostics frequently encounter bottlenecks due to manual inspection, which can lead to delays and inconsistencies. Although deep learning offers a pathway to automation and enhanced accuracy, foundational models in computer vision often emphasize global context at the expense of local details, which are vital for medical imaging diagnostics. To address this, we harness the S… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  4. arXiv:2403.09939  [pdf, other

    cs.CV

    Quantization Effects on Neural Networks Perception: How would quantization change the perceptual field of vision models?

    Authors: Mohamed Amine Kerkouri, Marouane Tliba, Aladine Chetouani, Alessandro Bruno

    Abstract: Neural network quantization is a critical technique for deploying models on resource-limited devices. Despite its widespread use, the impact of quantization on model perceptual fields, particularly in relation to class activation maps (CAMs), remains underexplored. This study investigates how quantization influences the spatial recognition abilities of vision models by examining the alignment betw… ▽ More

    Submitted 18 October, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: Accepted & presented at IPTA 2024

  5. arXiv:2311.08117  [pdf, other

    cs.CL

    Insights into Classifying and Mitigating LLMs' Hallucinations

    Authors: Alessandro Bruno, Pier Luigi Mazzeo, Aladine Chetouani, Marouane Tliba, Mohamed Amine Kerkouri

    Abstract: The widespread adoption of large language models (LLMs) across diverse AI applications is proof of the outstanding achievements obtained in several tasks, such as text mining, text generation, and question answering. However, LLMs are not exempt from drawbacks. One of the most concerning aspects regards the emerging problematic phenomena known as "Hallucinations". They manifest in text generation… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: Accepted at AIxIA 2023

  6. arXiv:2307.04442  [pdf, other

    cs.CV

    Automatic diagnosis of knee osteoarthritis severity using Swin transformer

    Authors: Aymen Sekhri, Marouane Tliba, Mohamed Amine Kerkouri, Yassine Nasser, Aladine Chetouani, Alessandro Bruno, Rachid Jennane

    Abstract: Knee osteoarthritis (KOA) is a widespread condition that can cause chronic pain and stiffness in the knee joint. Early detection and diagnosis are crucial for successful clinical intervention and management to prevent severe complications, such as loss of mobility. In this paper, we propose an automated approach that employs the Swin Transformer to predict the severity of KOA. Our model uses publi… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: CBMI 2023

  7. arXiv:2211.07336  [pdf, other

    cs.CV

    An Inter-observer consistent deep adversarial training for visual scanpath prediction

    Authors: Mohamed Amine Kerkouri, Marouane Tliba, Aladine Chetouani, Alessandro Bruno

    Abstract: The visual scanpath is a sequence of points through which the human gaze moves while exploring a scene. It represents the fundamental concepts upon which visual attention research is based. As a result, the ability to predict them has emerged as an important task in recent years. In this paper, we propose an inter-observer consistent adversarial training approach for scanpath prediction through a… ▽ More

    Submitted 11 July, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: ICIP2023

  8. arXiv:2210.10533  [pdf, other

    eess.IV cs.CV cs.LG

    Deep-based quality assessment of medical images through domain adaptation

    Authors: Marouane Tliba, Aymen Sekhri, Mohamed Amine Kerkouri, Aladine Chetouani

    Abstract: Predicting the quality of multimedia content is often needed in different fields. In some applications, quality metrics are crucial with a high impact, and can affect decision making such as diagnosis from medical multimedia. In this paper, we focus on such applications by proposing an efficient and shallow model for predicting the quality of medical images without reference from a small amount of… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: ICIP 2022

  9. A domain adaptive deep learning solution for scanpath prediction of paintings

    Authors: Mohamed Amine Kerkouri, Marouane Tliba, Aladine Chetouani, Alessandro Bruno

    Abstract: Cultural heritage understanding and preservation is an important issue for society as it represents a fundamental aspect of its identity. Paintings represent a significant part of cultural heritage, and are the subject of study continuously. However, the way viewers perceive paintings is strictly related to the so-called HVS (Human Vision System) behaviour. This paper focuses on the eye-movement a… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

    Comments: Accepted at CBMI2022 graz, austria

  10. arXiv:2201.00096  [pdf, other

    cs.CV

    SalyPath360: Saliency and Scanpath Prediction Framework for Omnidirectional Images

    Authors: Mohamed Amine Kerkouri, Marouane Tliba, Aladine Chetouani, Mohamed Sayeh

    Abstract: This paper introduces a new framework to predict visual attention of omnidirectional images. The key setup of our architecture is the simultaneous prediction of the saliency map and a corresponding scanpath for a given stimulus. The framework implements a fully encoder-decoder convolutional neural network augmented by an attention module to generate representative saliency maps. In addition, an au… ▽ More

    Submitted 31 December, 2021; originally announced January 2022.

    Comments: Accepted at Electornic Imaging Sympotium 2022

  11. arXiv:2112.04610  [pdf, other

    cs.CV

    A Simple and efficient deep Scanpath Prediction

    Authors: Mohamed Amine Kerkouri, Aladine Chetouani

    Abstract: Visual scanpath is the sequence of fixation points that the human gaze travels while observing an image, and its prediction helps in modeling the visual attention of an image. To this end several models were proposed in the literature using complex deep learning architectures and frameworks. Here, we explore the efficiency of using common deep learning architectures, in a simple fully convolutiona… ▽ More

    Submitted 8 December, 2021; originally announced December 2021.

    Comments: Electronic Imaging Symposium 2022 (EI 2022)

  12. arXiv:2107.00559  [pdf, other

    cs.CV

    SALYPATH: A Deep-Based Architecture for visual attention prediction

    Authors: Mohamed Amine Kerkouri, Marouane Tliba, Aladine Chetouani, Rachid Harba

    Abstract: Human vision is naturally more attracted by some regions within their field of view than others. This intrinsic selectivity mechanism, so-called visual attention, is influenced by both high- and low-level factors; such as the global environment (illumination, background texture, etc.), stimulus characteristics (color, intensity, orientation, etc.), and some prior visual information. Visual attenti… ▽ More

    Submitted 29 June, 2021; originally announced July 2021.

    Comments: Accepted at ICIP, 5 pages, 2 figures and 3 tables