Joint Learning of Distributed Representations for Images and Texts

He, Xiaodong; Srivastava, Rupesh; Gao, Jianfeng; Deng, Li

Computer Science > Computer Vision and Pattern Recognition

arXiv:1504.03083 (cs)

This paper has been withdrawn by Xiaodong He

[Submitted on 13 Apr 2015 (v1), last revised 28 Apr 2015 (this version, v2)]

Title:Joint Learning of Distributed Representations for Images and Texts

Authors:Xiaodong He, Rupesh Srivastava, Jianfeng Gao, Li Deng

No PDF available, click to view other formats

Abstract:This technical report provides extra details of the deep multimodal similarity model (DMSM) which was proposed in (Fang et al. 2015, arXiv:1411.4952). The model is trained via maximizing global semantic similarity between images and their captions in natural language using the public Microsoft COCO database, which consists of a large set of images and their corresponding captions. The learned representations attempt to capture the combination of various visual concepts and cues.

Comments:	This is a previous tech report of a part of the work of arXiv:1411.4952. In order to avoid confusion, we'd like to withdraw this report from arXiv
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1504.03083 [cs.CV]
	(or arXiv:1504.03083v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1504.03083

Submission history

From: Xiaodong He [view email]
[v1] Mon, 13 Apr 2015 07:36:08 UTC (688 KB)
[v2] Tue, 28 Apr 2015 17:24:00 UTC (1 KB) (withdrawn)

Full-text links:

Access Paper:

Withdrawn

Current browse context:

cs.CV

< prev | next >

new | recent | 2015-04

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Xiaodong He
Rupesh Kumar Srivastava
Rupesh Srivastava
Rupesh K. Srivastava
Jianfeng Gao

…

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Joint Learning of Distributed Representations for Images and Texts

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Joint Learning of Distributed Representations for Images and Texts

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators