FooDI-ML: a large multi-language dataset of food, drinks and groceries images and descriptions

Olóndriz, David Amat; Puigdevall, Ponç Palau; Palau, Adrià Salvador

Computer Science > Computer Vision and Pattern Recognition

arXiv:2110.02035 (cs)

[Submitted on 5 Oct 2021 (v1), last revised 26 Aug 2022 (this version, v2)]

Title:FooDI-ML: a large multi-language dataset of food, drinks and groceries images and descriptions

Authors:David Amat Olóndriz, Ponç Palau Puigdevall, Adrià Salvador Palau

View PDF

Abstract:In this paper we introduce the FooDI-ML dataset. This dataset contains over 1.5M unique images and over 9.5M store names, product names descriptions, and collection sections gathered from the Glovo application. The data made available corresponds to food, drinks and groceries products from 37 countries in Europe, the Middle East, Africa and Latin America. The dataset comprehends 33 languages, including 870K samples of languages of countries from Eastern Europe and Western Asia such as Ukrainian and Kazakh, which have been so far underrepresented in publicly available visio-linguistic datasets. The dataset also includes widely spoken languages such as Spanish and English. To assist further research, we include benchmarks over two tasks: text-image retrieval and conditional image generation.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2110.02035 [cs.CV]
	(or arXiv:2110.02035v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2110.02035

Submission history

From: Adrià Salvador Palau [view email]
[v1] Tue, 5 Oct 2021 13:33:08 UTC (14,554 KB)
[v2] Fri, 26 Aug 2022 11:23:29 UTC (24,587 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2021-10

Change to browse by:

cs
cs.CL
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:FooDI-ML: a large multi-language dataset of food, drinks and groceries images and descriptions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:FooDI-ML: a large multi-language dataset of food, drinks and groceries images and descriptions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators