Skip to main content

Showing 1–1 of 1 results for author: Vanderbyl, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.23320  [pdf, other

    eess.AS cs.AI cs.SD

    Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesis

    Authors: Théodor Lemerle, Harrison Vanderbyl, Vaibhav Srivastav, Nicolas Obin, Axel Roebel

    Abstract: Neural codec language models have achieved state-of-the-art performance in text-to-speech (TTS) synthesis, leveraging scalable architectures like autoregressive transformers and large-scale speech datasets. By framing voice cloning as a prompt continuation task, these models excel at cloning voices from short audio samples. However, this approach is limited in its ability to handle numerous or len… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

    Comments: Preprint