D2Styler: Advancing Arbitrary Style Transfer with Discrete Diffusion Methods

Susladkar, Onkar; Deshmukh, Gayatri; Mittal, Sparsh; Shastri, Parth

Computer Science > Computer Vision and Pattern Recognition

arXiv:2408.03558 (cs)

[Submitted on 7 Aug 2024]

Title:D2Styler: Advancing Arbitrary Style Transfer with Discrete Diffusion Methods

Authors:Onkar Susladkar, Gayatri Deshmukh, Sparsh Mittal, Parth Shastri

View PDF HTML (experimental)

Abstract:In image processing, one of the most challenging tasks is to render an image's semantic meaning using a variety of artistic approaches. Existing techniques for arbitrary style transfer (AST) frequently experience mode-collapse, over-stylization, or under-stylization due to a disparity between the style and content images. We propose a novel framework called D$^2$Styler (Discrete Diffusion Styler) that leverages the discrete representational capability of VQ-GANs and the advantages of discrete diffusion, including stable training and avoidance of mode collapse. Our method uses Adaptive Instance Normalization (AdaIN) features as a context guide for the reverse diffusion process. This makes it easy to move features from the style image to the content image without bias. The proposed method substantially enhances the visual quality of style-transferred images, allowing the combination of content and style in a visually appealing manner. We take style images from the WikiArt dataset and content images from the COCO dataset. Experimental results demonstrate that D$^2$Styler produces high-quality style-transferred images and outperforms twelve existing methods on nearly all the metrics. The qualitative results and ablation studies provide further insights into the efficacy of our technique. The code is available at this https URL.

Comments:	Paper accepted at 27th International Conference on Pattern Recognition (ICPR), 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2408.03558 [cs.CV]
	(or arXiv:2408.03558v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2408.03558

Submission history

From: Sparsh Mittal [view email]
[v1] Wed, 7 Aug 2024 05:47:06 UTC (17,810 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:D2Styler: Advancing Arbitrary Style Transfer with Discrete Diffusion Methods

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:D2Styler: Advancing Arbitrary Style Transfer with Discrete Diffusion Methods

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators