Skip to main content

Showing 1–1 of 1 results for author: Huang, V S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2503.21254  [pdf, other

    cs.CV cs.AI cs.MM cs.SD eess.AS

    Vision-to-Music Generation: A Survey

    Authors: Zhaokai Wang, Chenxi Bao, Le Zhuo, Jingrui Han, Yang Yue, Yihong Tang, Victor Shea-Jay Huang, Yue Liao

    Abstract: Vision-to-music Generation, including video-to-music and image-to-music tasks, is a significant branch of multimodal artificial intelligence demonstrating vast application prospects in fields such as film scoring, short video creation, and dance music synthesis. However, compared to the rapid development of modalities like text and images, research in vision-to-music is still in its preliminary st… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.