Skip to main content

Showing 1–1 of 1 results for author: Kritsis, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2204.13437  [pdf, other

    cs.SD cs.LG eess.AS

    Regotron: Regularizing the Tacotron2 architecture via monotonic alignment loss

    Authors: Efthymios Georgiou, Kosmas Kritsis, Georgios Paraskevopoulos, Athanasios Katsamanis, Vassilis Katsouros, Alexandros Potamianos

    Abstract: Recent deep learning Text-to-Speech (TTS) systems have achieved impressive performance by generating speech close to human parity. However, they suffer from training stability issues as well as incorrect alignment of the intermediate acoustic representation with the input text sequence. In this work, we introduce Regotron, a regularized version of Tacotron2 which aims to alleviate the training iss… ▽ More

    Submitted 14 July, 2022; v1 submitted 28 April, 2022; originally announced April 2022.