Generative Adversarial Network Architectures For Image Synthesis Using Capsule Networks

Upadhyay, Yash; Schrater, Paul

Computer Science > Computer Vision and Pattern Recognition

arXiv:1806.03796v2 (cs)

[Submitted on 11 Jun 2018 (v1), revised 25 Jul 2018 (this version, v2), latest version 20 Nov 2018 (v4)]

Title:Generative Adversarial Network Architectures For Image Synthesis Using Capsule Networks

Authors:Yash Upadhyay, Paul Schrater

View PDF

Abstract:In this paper, we propose Generative Adversarial Network (GAN) architectures using Capsule Networks for conditional and random image-synthesis. Capsule Networks encode meta-properties and spatial relationships between the features of the image, which helps it become a more powerful critic in comparison to the Convolutional Neural Networks (CNNs) used in current architectures for image synthesis. Our architectures use losses analogous to Wasserstein loss and Capsule Networks, which prove to be a more effective critic in comparison to CNNs. Thus, our proposed GAN architectures learn the data manifold much faster and therefore, show significant reduction in the number of training samples required to train when compared to the current work horses for image synthesis, DCGANs and its variants which utilize CNNs as discriminators. Also, our architecture generalizes over the datasets' manifold much better because of dynamic routing between capsules which is a more robust algorithm for feature globalization in comparison to max-pooling used by CNNs. This helps synthesize more diverse, yet visually accurate images. We have demonstrated the performance of our architectures over MNIST, Fashion-MNIST and their variants and compared them with the images synthesised using Improved Wasserstein GANs that use CNNs.

Comments:	NIPS 2018 submission pre-print
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
Cite as:	arXiv:1806.03796 [cs.CV]
	(or arXiv:1806.03796v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1806.03796

Submission history

From: Yash Upadhyay [view email]
[v1] Mon, 11 Jun 2018 03:54:24 UTC (1,275 KB)
[v2] Wed, 25 Jul 2018 07:55:20 UTC (1,368 KB)
[v3] Mon, 19 Nov 2018 15:45:28 UTC (3,458 KB)
[v4] Tue, 20 Nov 2018 18:33:11 UTC (3,462 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Generative Adversarial Network Architectures For Image Synthesis Using Capsule Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Generative Adversarial Network Architectures For Image Synthesis Using Capsule Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators