Skip to main content

Showing 1–3 of 3 results for author: Heyden, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.00511  [pdf, other

    cs.CV

    RevCD -- Reversed Conditional Diffusion for Generalized Zero-Shot Learning

    Authors: William Heyden, Habib Ullah, M. Salman Siddiqui, Fadi Al Machot

    Abstract: In Generalized Zero-Shot Learning (GZSL), we aim to recognize both seen and unseen categories using a model trained only on seen categories. In computer vision, this translates into a classification problem, where knowledge from seen categories is transferred to unseen categories by exploiting the relationships between visual features and available semantic information, such as text corpora or man… ▽ More

    Submitted 19 May, 2025; v1 submitted 31 August, 2024; originally announced September 2024.

    Comments: Accepted as Full Paper of DeLTA 2025. The Conference Proceedings will be published by Springer

  2. SEER-ZSL: Semantic Encoder-Enhanced Representations for Generalized Zero-Shot Learning

    Authors: William Heyden, Habib Ullah, M. Salman Siddiqui, Fadi Al Machot

    Abstract: Zero-Shot Learning (ZSL) presents the challenge of identifying categories not seen during training. This task is crucial in domains where it is costly, prohibited, or simply not feasible to collect training data. ZSL depends on a mapping between the visual space and available semantic information. Prior works learn a mapping between spaces that can be exploited during inference. We contend, howeve… ▽ More

    Submitted 6 January, 2025; v1 submitted 20 December, 2023; originally announced December 2023.

  3. An Integral Projection-based Semantic Autoencoder for Zero-Shot Learning

    Authors: William Heyden, Habib Ullah, M. Salman Siddiqui, Fadi Al Machot

    Abstract: Zero-shot Learning (ZSL) classification categorizes or predicts classes (labels) that are not included in the training set (unseen classes). Recent works proposed different semantic autoencoder (SAE) models where the encoder embeds a visual feature vector space into the semantic space and the decoder reconstructs the original visual feature space. The objective is to learn the embedding by leverag… ▽ More

    Submitted 11 August, 2023; v1 submitted 26 June, 2023; originally announced June 2023.