Deep Learning on Small Datasets without Pre-Training using Cosine Loss

Barz, Björn; Denzler, Joachim

Computer Science > Machine Learning

arXiv:1901.09054 (cs)

[Submitted on 25 Jan 2019 (v1), last revised 11 Dec 2019 (this version, v2)]

Title:Deep Learning on Small Datasets without Pre-Training using Cosine Loss

Authors:Björn Barz, Joachim Denzler

View PDF

Abstract:Two things seem to be indisputable in the contemporary deep learning discourse: 1. The categorical cross-entropy loss after softmax activation is the method of choice for classification. 2. Training a CNN classifier from scratch on small datasets does not work well. In contrast to this, we show that the cosine loss function provides significantly better performance than cross-entropy on datasets with only a handful of samples per class. For example, the accuracy achieved on the CUB-200-2011 dataset without pre-training is by 30% higher than with the cross-entropy loss. Further experiments on other popular datasets confirm our findings. Moreover, we demonstrate that integrating prior knowledge in the form of class hierarchies is straightforward with the cosine loss and improves classification performance further.

Comments:	Presented at WACV 2020
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:1901.09054 [cs.LG]
	(or arXiv:1901.09054v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1901.09054
Journal reference:	2020 IEEE Winter Conference on Applications of Computer Vision (WACV), Snowmass Village, CO, USA, 2020

Submission history

From: Björn Barz [view email]
[v1] Fri, 25 Jan 2019 19:13:03 UTC (235 KB)
[v2] Wed, 11 Dec 2019 15:15:46 UTC (197 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-01

Change to browse by:

cs
cs.CV
stat
stat.ML

References & Citations

1 blog link

(what is this?)

DBLP - CS Bibliography

listing | bibtex

Björn Barz
Joachim Denzler

export BibTeX citation

Computer Science > Machine Learning

Title:Deep Learning on Small Datasets without Pre-Training using Cosine Loss

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Deep Learning on Small Datasets without Pre-Training using Cosine Loss

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators