VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models

Jain, Ajay; Xie, Amber; Abbeel, Pieter

Computer Science > Computer Vision and Pattern Recognition

arXiv:2211.11319 (cs)

[Submitted on 21 Nov 2022]

Title:VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models

Authors:Ajay Jain, Amber Xie, Pieter Abbeel

View PDF

Abstract:Diffusion models have shown impressive results in text-to-image synthesis. Using massive datasets of captioned images, diffusion models learn to generate raster images of highly diverse objects and scenes. However, designers frequently use vector representations of images like Scalable Vector Graphics (SVGs) for digital icons or art. Vector graphics can be scaled to any size, and are compact. We show that a text-conditioned diffusion model trained on pixel representations of images can be used to generate SVG-exportable vector graphics. We do so without access to large datasets of captioned SVGs. By optimizing a differentiable vector graphics rasterizer, our method, VectorFusion, distills abstract semantic knowledge out of a pretrained diffusion model. Inspired by recent text-to-3D work, we learn an SVG consistent with a caption using Score Distillation Sampling. To accelerate generation and improve fidelity, VectorFusion also initializes from an image sample. Experiments show greater quality than prior work, and demonstrate a range of styles including pixel art and sketches. See our project webpage at this https URL .

Comments:	Project webpage: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
Cite as:	arXiv:2211.11319 [cs.CV]
	(or arXiv:2211.11319v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2211.11319

Submission history

From: Ajay Jain [view email]
[v1] Mon, 21 Nov 2022 10:04:27 UTC (20,097 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators