Transductive Zero-Shot Action Recognition by Word-Vector Embedding

Xu, Xun; Hospedales, Timothy; Gong, Shaogang

Computer Science > Computer Vision and Pattern Recognition

arXiv:1511.04458 (cs)

[Submitted on 13 Nov 2015 (v1), last revised 2 Dec 2016 (this version, v2)]

Title:Transductive Zero-Shot Action Recognition by Word-Vector Embedding

Authors:Xun Xu, Timothy Hospedales, Shaogang Gong

View PDF

Abstract:The number of categories for action recognition is growing rapidly and it has become increasingly hard to label sufficient training data for learning conventional models for all categories. Instead of collecting ever more data and labelling them exhaustively for all categories, an attractive alternative approach is zero-shot learning" (ZSL). To that end, in this study we construct a mapping between visual features and a semantic descriptor of each action category, allowing new categories to be recognised in the absence of any visual training data. Existing ZSL studies focus primarily on still images, and attribute-based semantic representations. In this work, we explore word-vectors as the shared semantic space to embed videos and category labels for ZSL action recognition. This is a more challenging problem than existing ZSL of still images and/or attributes, because the mapping between video spacetime features of actions and the semantic space is more complex and harder to learn for the purpose of generalising over any cross-category domain shift. To solve this generalisation problem in ZSL action recognition, we investigate a series of synergistic strategies to improve upon the standard ZSL pipeline. Most of these strategies are transductive in nature which means access to testing data in the training phase.

Comments:	Accepted by IJCV
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1511.04458 [cs.CV]
	(or arXiv:1511.04458v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1511.04458

Submission history

From: Xun Xu [view email]
[v1] Fri, 13 Nov 2015 21:05:20 UTC (4,866 KB)
[v2] Fri, 2 Dec 2016 07:17:03 UTC (5,675 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Transductive Zero-Shot Action Recognition by Word-Vector Embedding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Transductive Zero-Shot Action Recognition by Word-Vector Embedding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators