Skip to main content

Showing 1–4 of 4 results for author: Hajimiri, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.08181  [pdf, other

    cs.CV

    Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation

    Authors: Sina Hajimiri, Ismail Ben Ayed, Jose Dolz

    Abstract: Despite the significant progress in deep learning for dense visual recognition problems, such as semantic segmentation, traditional methods are constrained by fixed class sets. Meanwhile, vision-language foundation models, such as CLIP, have showcased remarkable effectiveness in numerous zero-shot image-level tasks, owing to their robust generalizability. Recently, a body of work has investigated… ▽ More

    Submitted 16 September, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: Accepted to WACV 2025

  2. arXiv:2312.12730  [pdf, other

    cs.CV

    A Closer Look at the Few-Shot Adaptation of Large Vision-Language Models

    Authors: Julio Silva-Rodríguez, Sina Hajimiri, Ismail Ben Ayed, Jose Dolz

    Abstract: Efficient transfer learning (ETL) is receiving increasing attention to adapt large pre-trained language-vision models on downstream tasks with a few labeled samples. While significant progress has been made, we reveal that state-of-the-art ETL approaches exhibit strong performance only in narrowly-defined experimental setups, and with a careful adjustment of hyperparameters based on a large corpus… ▽ More

    Submitted 25 March, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: CVPR 2024. Code: https://github.com/jusiro/CLAP

  3. arXiv:2211.14126  [pdf, other

    cs.CV

    A Strong Baseline for Generalized Few-Shot Semantic Segmentation

    Authors: Sina Hajimiri, Malik Boudiaf, Ismail Ben Ayed, Jose Dolz

    Abstract: This paper introduces a generalized few-shot segmentation framework with a straightforward training process and an easy-to-optimize inference phase. In particular, we propose a simple yet effective model based on the well-known InfoMax principle, where the Mutual Information (MI) between the learned feature representations and their corresponding predictions is maximized. In addition, the terms de… ▽ More

    Submitted 3 April, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

    Comments: Accepted to CVPR 2023

  4. arXiv:2102.00892  [pdf, other

    cs.LG stat.ML

    Semi-Supervised Disentanglement of Class-Related and Class-Independent Factors in VAE

    Authors: Sina Hajimiri, Aryo Lotfi, Mahdieh Soleymani Baghshah

    Abstract: In recent years, extending variational autoencoder's framework to learn disentangled representations has received much attention. We address this problem by proposing a framework capable of disentangling class-related and class-independent factors of variation in data. Our framework employs an attention mechanism in its latent space in order to improve the process of extracting class-related facto… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Comments: 16 pages, 10 figures