Skip to main content

Showing 1–1 of 1 results for author: Al-Sinan, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.08175  [pdf, ps, other

    cs.SD cs.AI cs.LG cs.MM eess.AS

    Fast Text-to-Audio Generation with Adversarial Post-Training

    Authors: Zachary Novack, Zach Evans, Zack Zukowski, Josiah Taylor, CJ Carr, Julian Parker, Adnan Al-Sinan, Gian Marco Iodice, Julian McAuley, Taylor Berg-Kirkpatrick, Jordi Pons

    Abstract: Text-to-audio systems, while increasingly performant, are slow at inference time, thus making their latency unpractical for many creative applications. We present Adversarial Relativistic-Contrastive (ARC) post-training, the first adversarial acceleration algorithm for diffusion/flow models not based on distillation. While past adversarial post-training methods have struggled to compare against th… ▽ More

    Submitted 14 May, 2025; v1 submitted 12 May, 2025; originally announced May 2025.