Evaluating Text-to-Image Matching using Binary Image Selection (BISON)

Hu, Hexiang; Misra, Ishan; van der Maaten, Laurens

Computer Science > Computer Vision and Pattern Recognition

arXiv:1901.06595 (cs)

[Submitted on 19 Jan 2019 (v1), last revised 5 Apr 2019 (this version, v2)]

Title:Evaluating Text-to-Image Matching using Binary Image Selection (BISON)

Authors:Hexiang Hu, Ishan Misra, Laurens van der Maaten

View PDF

Abstract:Providing systems the ability to relate linguistic and visual content is one of the hallmarks of computer vision. Tasks such as text-based image retrieval and image captioning were designed to test this ability but come with evaluation measures that have a high variance or are difficult to interpret. We study an alternative task for systems that match text and images: given a text query, the system is asked to select the image that best matches the query from a pair of semantically similar images. The system's accuracy on this Binary Image SelectiON (BISON) task is interpretable, eliminates the reliability problems of retrieval evaluations, and focuses on the system's ability to understand fine-grained visual structure. We gather a BISON dataset that complements the COCO dataset and use it to evaluate modern text-based image retrieval and image captioning systems. Our results provide novel insights into the performance of these systems. The COCO-BISON dataset and corresponding evaluation code are publicly available from \url{this http URL}.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:1901.06595 [cs.CV]
	(or arXiv:1901.06595v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1901.06595

Submission history

From: Hexiang Hu [view email]
[v1] Sat, 19 Jan 2019 22:12:01 UTC (8,239 KB)
[v2] Fri, 5 Apr 2019 16:34:48 UTC (8,152 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2019-01

Change to browse by:

cs
cs.AI
cs.CL

References & Citations

DBLP - CS Bibliography

listing | bibtex

Hexiang Hu
Ishan Misra
Laurens van der Maaten

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Evaluating Text-to-Image Matching using Binary Image Selection (BISON)

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Evaluating Text-to-Image Matching using Binary Image Selection (BISON)

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators