Rotated Digit Recognition by Variational Autoencoders with Fixed Output Distributions

Yevick, David

Computer Science > Computer Vision and Pattern Recognition

arXiv:2206.13388 (cs)

[Submitted on 18 Jun 2022]

Title:Rotated Digit Recognition by Variational Autoencoders with Fixed Output Distributions

Authors:David Yevick

View PDF

Abstract:This paper demonstrates that a simple modification of the variational autoencoder (VAE) formalism enables the method to identify and classify rotated and distorted digits. In particular, the conventional objective (cost) function employed during the training process of a VAE both quantifies the agreement between the input and output data records and ensures that the latent space representation of the input data record is statistically generated with an appropriate mean and standard deviation. After training, simulated data realizations are generated by decoding appropriate latent space points. Since, however, standard VAE:s trained on randomly rotated MNIST digits cannot reliably distinguish between different digit classes since the rotated input data is effectively compared to a similarly rotated output data record. In contrast, an alternative implementation in which the objective function compares the output associated with each rotated digit to a corresponding fixed unreferenced reference digit is shown here to discriminate accurately among the rotated digits in latent space even when the dimension of the latent space is 2 or 3.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
Cite as:	arXiv:2206.13388 [cs.CV]
	(or arXiv:2206.13388v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2206.13388

Submission history

From: David Yevick [view email]
[v1] Sat, 18 Jun 2022 00:21:49 UTC (2,399 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Rotated Digit Recognition by Variational Autoencoders with Fixed Output Distributions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Rotated Digit Recognition by Variational Autoencoders with Fixed Output Distributions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators