Transformational Sparse Coding

Gklezakos, Dimitrios C.; Rao, Rajesh P. N.

Computer Science > Computer Vision and Pattern Recognition

arXiv:1712.03257 (cs)

[Submitted on 8 Dec 2017]

Title:Transformational Sparse Coding

Authors:Dimitrios C. Gklezakos, Rajesh P. N. Rao

View PDF

Abstract:A fundamental problem faced by object recognition systems is that objects and their features can appear in different locations, scales and orientations. Current deep learning methods attempt to achieve invariance to local translations via pooling, discarding the locations of features in the process. Other approaches explicitly learn transformed versions of the same feature, leading to representations that quickly explode in size. Instead of discarding the rich and useful information about feature transformations to achieve invariance, we argue that models should learn object features conjointly with their transformations to achieve equivariance. We propose a new model of unsupervised learning based on sparse coding that can learn object features jointly with their affine transformations directly from images. Results based on learning from natural images indicate that our approach matches the reconstruction quality of traditional sparse coding but with significantly fewer degrees of freedom while simultaneously learning transformations from data. These results open the door to scaling up unsupervised learning to allow deep feature+transformation learning in a manner consistent with the ventral+dorsal stream architecture of the primate visual cortex.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1712.03257 [cs.CV]
	(or arXiv:1712.03257v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1712.03257

Submission history

From: Dimitrios Christoforos Gklezakos [view email]
[v1] Fri, 8 Dec 2017 19:21:15 UTC (6,445 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2017-12

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Dimitrios C. Gklezakos
Rajesh P. N. Rao

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Transformational Sparse Coding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Transformational Sparse Coding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators