Learning robust visual representations using data augmentation invariance

Hernández-García, Alex; König, Peter; Kietzmann, Tim C.

Computer Science > Computer Vision and Pattern Recognition

arXiv:1906.04547 (cs)

[Submitted on 11 Jun 2019]

Title:Learning robust visual representations using data augmentation invariance

Authors:Alex Hernández-García, Peter König, Tim C. Kietzmann

View PDF

Abstract:Deep convolutional neural networks trained for image object categorization have shown remarkable similarities with representations found across the primate ventral visual stream. Yet, artificial and biological networks still exhibit important differences. Here we investigate one such property: increasing invariance to identity-preserving image transformations found along the ventral stream. Despite theoretical evidence that invariance should emerge naturally from the optimization process, we present empirical evidence that the activations of convolutional neural networks trained for object categorization are not robust to identity-preserving image transformations commonly used in data augmentation. As a solution, we propose data augmentation invariance, an unsupervised learning objective which improves the robustness of the learned representations by promoting the similarity between the activations of augmented image samples. Our results show that this approach is a simple, yet effective and efficient (10 % increase in training time) way of increasing the invariance of the models while obtaining similar categorization performance.

Comments:	6 pages, 2 figures, work in progress
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1906.04547 [cs.CV]
	(or arXiv:1906.04547v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1906.04547

Submission history

From: Alex Hernández García [view email]
[v1] Tue, 11 Jun 2019 13:03:19 UTC (160 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2019-06

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Alex Hernández-García
Peter König
Tim C. Kietzmann

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Learning robust visual representations using data augmentation invariance

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Learning robust visual representations using data augmentation invariance

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators