Skip to main content

Showing 1–4 of 4 results for author: Seddati, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.18988  [pdf, other

    cs.CV cs.IR

    A Recipe for Efficient SBIR Models: Combining Relative Triplet Loss with Batch Normalization and Knowledge Distillation

    Authors: Omar Seddati, Nathan Hubens, Stéphane Dupont, Thierry Dutoit

    Abstract: Sketch-Based Image Retrieval (SBIR) is a crucial task in multimedia retrieval, where the goal is to retrieve a set of images that match a given sketch query. Researchers have already proposed several well-performing solutions for this task, but most focus on enhancing embedding through different approaches such as triplet loss, quadruplet loss, adding data augmentation, and using edge extraction.… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  2. arXiv:2209.06629  [pdf, other

    cs.CV cs.AI cs.IR

    Transformers and CNNs both Beat Humans on SBIR

    Authors: Omar Seddati, Stéphane Dupont, Saïd Mahmoudi, Thierry Dutoit

    Abstract: Sketch-based image retrieval (SBIR) is the task of retrieving natural images (photos) that match the semantics and the spatial configuration of hand-drawn sketch queries. The universality of sketches extends the scope of possible applications and increases the demand for efficient SBIR solutions. In this paper, we study classic triplet-based SBIR solutions and show that a persistent invariance to… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

    ACM Class: I.2.10

  3. arXiv:1801.06349  [pdf

    cs.HC cs.AI cs.CV

    Proceedings of eNTERFACE 2015 Workshop on Intelligent Interfaces

    Authors: Matei Mancas, Christian Frisson, Joëlle Tilmanne, Nicolas d'Alessandro, Petr Barborka, Furkan Bayansar, Francisco Bernard, Rebecca Fiebrink, Alexis Heloir, Edgar Hemery, Sohaib Laraba, Alexis Moinet, Fabrizio Nunnari, Thierry Ravet, Loïc Reboursière, Alvaro Sarasua, Mickaël Tits, Noé Tits, François Zajéga, Paolo Alborno, Ksenia Kolykhalova, Emma Frid, Damiano Malafronte, Lisanne Huis in't Veld, Hüseyin Cakmak , et al. (49 additional authors not shown)

    Abstract: The 11th Summer Workshop on Multimodal Interfaces eNTERFACE 2015 was hosted by the Numediart Institute of Creative Technologies of the University of Mons from August 10th to September 2015. During the four weeks, students and researchers from all over the world came together in the Numediart Institute of the University of Mons to work on eight selected projects structured around intelligent interf… ▽ More

    Submitted 19 January, 2018; originally announced January 2018.

    Comments: 159 pages

  4. Visually Grounded Word Embeddings and Richer Visual Features for Improving Multimodal Neural Machine Translation

    Authors: Jean-Benoit Delbrouck, Stéphane Dupont, Omar Seddati

    Abstract: In Multimodal Neural Machine Translation (MNMT), a neural model generates a translated sentence that describes an image, given the image itself and one source descriptions in English. This is considered as the multimodal image caption translation task. The images are processed with Convolutional Neural Network (CNN) to extract visual features exploitable by the translation model. So far, the CNNs… ▽ More

    Submitted 16 December, 2017; v1 submitted 4 July, 2017; originally announced July 2017.

    Comments: Accepted to GLU 2017. arXiv admin note: text overlap with arXiv:1707.00995

    Journal ref: Proc. GLU 2017 International Workshop on Grounding Language Understanding