On Learning Semantic Representations for Million-Scale Free-Hand Sketches

Xu, Peng; Huang, Yongye; Yuan, Tongtong; Xiang, Tao; Hospedales, Timothy M.; Song, Yi-Zhe; Wang, Liang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2007.04101 (cs)

[Submitted on 7 Jul 2020]

Title:On Learning Semantic Representations for Million-Scale Free-Hand Sketches

Authors:Peng Xu, Yongye Huang, Tongtong Yuan, Tao Xiang, Timothy M. Hospedales, Yi-Zhe Song, Liang Wang

View PDF

Abstract:In this paper, we study learning semantic representations for million-scale free-hand sketches. This is highly challenging due to the domain-unique traits of sketches, e.g., diverse, sparse, abstract, noisy. We propose a dual-branch CNNRNN network architecture to represent sketches, which simultaneously encodes both the static and temporal patterns of sketch strokes. Based on this architecture, we further explore learning the sketch-oriented semantic representations in two challenging yet practical settings, i.e., hashing retrieval and zero-shot recognition on million-scale sketches. Specifically, we use our dual-branch architecture as a universal representation framework to design two sketch-specific deep models: (i) We propose a deep hashing model for sketch retrieval, where a novel hashing loss is specifically designed to accommodate both the abstract and messy traits of sketches. (ii) We propose a deep embedding model for sketch zero-shot recognition, via collecting a large-scale edge-map dataset and proposing to extract a set of semantic vectors from edge-maps as the semantic knowledge for sketch zero-shot domain alignment. Both deep models are evaluated by comprehensive experiments on million-scale sketches and outperform the state-of-the-art competitors.

Comments:	arXiv admin note: substantial text overlap with arXiv:1804.01401
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2007.04101 [cs.CV]
	(or arXiv:2007.04101v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2007.04101

Submission history

From: Peng Xu [view email]
[v1] Tue, 7 Jul 2020 15:23:22 UTC (1,635 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:On Learning Semantic Representations for Million-Scale Free-Hand Sketches

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:On Learning Semantic Representations for Million-Scale Free-Hand Sketches

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators