Skip to main content

Showing 1–1 of 1 results for author: Tyndel, M S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2106.04283  [pdf, other

    cs.SD cs.AI cs.CV cs.LG eess.AS

    NWT: Towards natural audio-to-video generation with representation learning

    Authors: Rayhane Mama, Marc S. Tyndel, Hashiam Kadhim, Cole Clifford, Ragavan Thurairatnam

    Abstract: In this work we introduce NWT, an expressive speech-to-video model. Unlike approaches that use domain-specific intermediate representations such as pose keypoints, NWT learns its own latent representations, with minimal assumptions about the audio and video content. To this end, we propose a novel discrete variational autoencoder with adversarial loss, dVAE-Adv, which learns a new discrete latent… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.