Zero-Shot Video Editing through Adaptive Sliding Score Distillation

Zhu, Lianghan; Bao, Yanqi; Huo, Jing; Wu, Jing; Lai, Yu-Kun; Li, Wenbin; Gao, Yang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2406.04888 (cs)

[Submitted on 7 Jun 2024 (v1), last revised 6 Sep 2024 (this version, v2)]

Title:Zero-Shot Video Editing through Adaptive Sliding Score Distillation

Authors:Lianghan Zhu, Yanqi Bao, Jing Huo, Jing Wu, Yu-Kun Lai, Wenbin Li, Yang Gao

View PDF HTML (experimental)

Abstract:The rapidly evolving field of Text-to-Video generation (T2V) has catalyzed renewed interest in controllable video editing research. While the application of editing prompts to guide diffusion model denoising has gained prominence, mirroring advancements in image editing, this noise-based inference process inherently compromises the original video's integrity, resulting in unintended over-editing and temporal discontinuities. To address these challenges, this study proposes a novel paradigm of video-based score distillation, facilitating direct manipulation of original video content. Specifically, distinguishing it from image-based score distillation, we propose an Adaptive Sliding Score Distillation strategy, which incorporates both global and local video guidance to reduce the impact of editing errors. Combined with our proposed Image-based Joint Guidance mechanism, it has the ability to mitigate the inherent instability of the T2V model and single-step sampling. Additionally, we design a Weighted Attention Fusion module to further preserve the key features of the original video and avoid over-editing. Extensive experiments demonstrate that these strategies effectively address existing challenges, achieving superior performance compared to current state-of-the-art methods.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2406.04888 [cs.CV]
	(or arXiv:2406.04888v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2406.04888

Submission history

From: Lianghan Zhu [view email]
[v1] Fri, 7 Jun 2024 12:33:59 UTC (11,073 KB)
[v2] Fri, 6 Sep 2024 14:55:48 UTC (11,537 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Zero-Shot Video Editing through Adaptive Sliding Score Distillation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Zero-Shot Video Editing through Adaptive Sliding Score Distillation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators