DiffArtist: Towards Structure and Appearance Controllable Image Stylization

Jiang, Ruixiang; Chen, Changwen

doi:10.1145/3746027.3755010

Computer Science > Computer Vision and Pattern Recognition

arXiv:2407.15842 (cs)

[Submitted on 22 Jul 2024 (v1), last revised 27 Aug 2025 (this version, v4)]

Title:DiffArtist: Towards Structure and Appearance Controllable Image Stylization

Authors:Ruixiang Jiang, Changwen Chen

View PDF HTML (experimental)

Abstract:Artistic styles are defined by both their structural and appearance elements. Existing neural stylization techniques primarily focus on transferring appearance-level features such as color and texture, often neglecting the equally crucial aspect of structural stylization. To address this gap, we introduce \textbf{DiffArtist}, the first 2D stylization method to offer fine-grained, simultaneous control over both structure and appearance style strength. This dual controllability is achieved by representing structure and appearance generation as separate diffusion processes, necessitating no further tuning or additional adapters. To properly evaluate this new capability of dual stylization, we further propose a Multimodal LLM-based stylization evaluator that aligns significantly better with human preferences than existing metrics. Extensive analysis shows that DiffArtist achieves superior style fidelity and dual-controllability compared to state-of-the-art methods. Its text-driven, training-free design and unprecedented dual controllability make it a powerful and interactive tool for various creative applications. Project homepage: this https URL.

Comments:	Accepted to ACM MM 2025, Homepage: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
Cite as:	arXiv:2407.15842 [cs.CV]
	(or arXiv:2407.15842v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.15842
Related DOI:	https://doi.org/10.1145/3746027.3755010

Submission history

From: Ruixiang Jiang [view email]
[v1] Mon, 22 Jul 2024 17:58:05 UTC (13,236 KB)
[v2] Sun, 22 Dec 2024 10:03:12 UTC (44,638 KB)
[v3] Wed, 23 Apr 2025 17:46:08 UTC (41,480 KB)
[v4] Wed, 27 Aug 2025 10:30:27 UTC (38,851 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:DiffArtist: Towards Structure and Appearance Controllable Image Stylization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:DiffArtist: Towards Structure and Appearance Controllable Image Stylization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators