Training Deep Neural Networks via Direct Loss Minimization

Song, Yang; Schwing, Alexander G.; Zemel, Richard S.; Urtasun, Raquel

Computer Science > Machine Learning

arXiv:1511.06411 (cs)

[Submitted on 19 Nov 2015 (v1), last revised 2 Jun 2016 (this version, v2)]

Title:Training Deep Neural Networks via Direct Loss Minimization

Authors:Yang Song, Alexander G. Schwing, Richard S. Zemel, Raquel Urtasun

View PDF

Abstract:Supervised training of deep neural nets typically relies on minimizing cross-entropy. However, in many domains, we are interested in performing well on metrics specific to the application. In this paper we propose a direct loss minimization approach to train deep neural networks, which provably minimizes the application-specific loss function. This is often non-trivial, since these functions are neither smooth nor decomposable and thus are not amenable to optimization with standard gradient-based methods. We demonstrate the effectiveness of our approach in the context of maximizing average precision for ranking problems. Towards this goal, we develop a novel dynamic programming algorithm that can efficiently compute the weight updates. Our approach proves superior to a variety of baselines in the context of action classification and object detection, especially in the presence of label noise.

Comments:	ICML2016
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1511.06411 [cs.LG]
	(or arXiv:1511.06411v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1511.06411

Submission history

From: Yang Song [view email]
[v1] Thu, 19 Nov 2015 22:02:26 UTC (504 KB)
[v2] Thu, 2 Jun 2016 00:56:59 UTC (764 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2015-11

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Yang Song
Alexander G. Schwing
Richard S. Zemel
Raquel Urtasun

export BibTeX citation

Computer Science > Machine Learning

Title:Training Deep Neural Networks via Direct Loss Minimization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Training Deep Neural Networks via Direct Loss Minimization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators