Skip to main content

Showing 1–1 of 1 results for author: Foti, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.08470  [pdf, ps, other

    cs.SD cs.AI cs.MM eess.AS

    On the Design of Diffusion-based Neural Speech Codecs

    Authors: Pietro Foti, Andreas Brendel

    Abstract: Recently, neural speech codecs (NSCs) trained as generative models have shown superior performance compared to conventional codecs at low bitrates. Although most state-of-the-art NSCs are trained as Generative Adversarial Networks (GANs), Diffusion Models (DMs), a recent class of generative models, represent a promising alternative due to their superior performance in image generation relative to… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.