When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on its Contour-following Ability

Xuan, Wenjie; Xu, Yufei; Zhao, Shanshan; Wang, Chaoyue; Liu, Juhua; Du, Bo; Tao, Dacheng

doi:10.1145/3664647.3680692

Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.00467 (cs)

[Submitted on 1 Mar 2024 (v1), last revised 15 Oct 2024 (this version, v3)]

Title:When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on its Contour-following Ability

Authors:Wenjie Xuan, Yufei Xu, Shanshan Zhao, Chaoyue Wang, Juhua Liu, Bo Du, Dacheng Tao

View PDF HTML (experimental)

Abstract:ControlNet excels at creating content that closely matches precise contours in user-provided masks. However, when these masks contain noise, as a frequent occurrence with non-expert users, the output would include unwanted artifacts. This paper first highlights the crucial role of controlling the impact of these inexplicit masks with diverse deterioration levels through in-depth analysis. Subsequently, to enhance controllability with inexplicit masks, an advanced Shape-aware ControlNet consisting of a deterioration estimator and a shape-prior modulation block is devised. The deterioration estimator assesses the deterioration factor of the provided masks. Then this factor is utilized in the modulation block to adaptively modulate the model's contour-following ability, which helps it dismiss the noise part in the inexplicit masks. Extensive experiments prove its effectiveness in encouraging ControlNet to interpret inaccurate spatial conditions robustly rather than blindly following the given contours, suitable for diverse kinds of conditions. We showcase application scenarios like modifying shape priors and composable shape-controllable generation. Codes are available at github.

Comments:	Accepted by ACM-MM 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2403.00467 [cs.CV]
	(or arXiv:2403.00467v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.00467
Related DOI:	https://doi.org/10.1145/3664647.3680692

Submission history

From: Wenjie Xuan [view email]
[v1] Fri, 1 Mar 2024 11:45:29 UTC (35,326 KB)
[v2] Wed, 28 Aug 2024 09:11:40 UTC (35,142 KB)
[v3] Tue, 15 Oct 2024 01:42:21 UTC (27,598 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on its Contour-following Ability

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on its Contour-following Ability

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators