Deep Perceptual Compression

Patel, Yash; Appalaraju, Srikar; Manmatha, R.

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:1907.08310 (eess)

[Submitted on 18 Jul 2019 (v1), last revised 31 Jul 2019 (this version, v2)]

Title:Deep Perceptual Compression

Authors:Yash Patel, Srikar Appalaraju, R. Manmatha

View PDF

Abstract:Several deep learned lossy compression techniques have been proposed in the recent literature. Most of these are optimized by using either MS-SSIM (multi-scale structural similarity) or MSE (mean squared error) as a loss function. Unfortunately, neither of these correlate well with human perception and this is clearly visible from the resulting compressed images. In several cases, the MS-SSIM for deep learned techniques is higher than say a conventional, non-deep learned codec such as JPEG-2000 or BPG. However, the images produced by these deep learned techniques are in many cases clearly worse to human eyes than those produced by JPEG-2000 or BPG.
We propose the use of an alternative, deep perceptual metric, which has been shown to align better with human perceptual similarity. We then propose Deep Perceptual Compression (DPC) which makes use of an encoder-decoder based image compression model to jointly optimize on the deep perceptual metric and MS-SSIM. Via extensive human evaluations, we show that the proposed method generates visually better results than previous learning based compression methods and JPEG-2000, and is comparable to BPG. Furthermore, we demonstrate that for tasks like object-detection, images compressed with DPC give better accuracy.

Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1907.08310 [eess.IV]
	(or arXiv:1907.08310v2 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.1907.08310

Submission history

From: Yash Patel [view email]
[v1] Thu, 18 Jul 2019 22:17:52 UTC (14,498 KB)
[v2] Wed, 31 Jul 2019 21:17:27 UTC (14,505 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Deep Perceptual Compression

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Deep Perceptual Compression

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators