ARTEMIS: Attention-based Retrieval with Text-Explicit Matching and Implicit Similarity

Delmas, Ginger; de Rezende, Rafael Sampaio; Csurka, Gabriela; Larlus, Diane

Computer Science > Computer Vision and Pattern Recognition

arXiv:2203.08101 (cs)

[Submitted on 15 Mar 2022 (v1), last revised 16 May 2022 (this version, v2)]

Title:ARTEMIS: Attention-based Retrieval with Text-Explicit Matching and Implicit Similarity

Authors:Ginger Delmas, Rafael Sampaio de Rezende, Gabriela Csurka, Diane Larlus

View PDF

Abstract:An intuitive way to search for images is to use queries composed of an example image and a complementary text. While the first provides rich and implicit context for the search, the latter explicitly calls for new traits, or specifies how some elements of the example image should be changed to retrieve the desired target image. Current approaches typically combine the features of each of the two elements of the query into a single representation, which can then be compared to the ones of the potential target images. Our work aims at shedding new light on the task by looking at it through the prism of two familiar and related frameworks: text-to-image and image-to-image retrieval. Taking inspiration from them, we exploit the specific relation of each query element with the targeted image and derive light-weight attention mechanisms which enable to mediate between the two complementary modalities. We validate our approach on several retrieval benchmarks, querying with images and their associated free-form text modifiers. Our method obtains state-of-the-art results without resorting to side information, multi-level features, heavy pre-training nor large architectures as in previous works.

Comments:	Published in ICLR 2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
Cite as:	arXiv:2203.08101 [cs.CV]
	(or arXiv:2203.08101v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2203.08101

Submission history

From: Rafael Sampaio de Rezende [view email]
[v1] Tue, 15 Mar 2022 17:29:20 UTC (8,649 KB)
[v2] Mon, 16 May 2022 15:20:04 UTC (8,594 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:ARTEMIS: Attention-based Retrieval with Text-Explicit Matching and Implicit Similarity

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:ARTEMIS: Attention-based Retrieval with Text-Explicit Matching and Implicit Similarity

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators