Skip to main content

Showing 1–3 of 3 results for author: Nortje, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:1912.05193  [pdf, other

    eess.IV cs.CV

    Deep motion estimation for parallel inter-frame prediction in video compression

    Authors: André Nortje, Herman A. Engelbrecht, Herman Kamper

    Abstract: Standard video codecs rely on optical flow to guide inter-frame prediction: pixels from reference frames are moved via motion vectors to predict target video frames. We propose to learn binary motion codes that are encoded based on an input video sequence. These codes are not limited to 2D translations, but can capture complex motion (warping, rotation and occlusion). Our motion codes are learned… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

    Comments: 25 pages, 11 figures, 5 tables

  2. BINet: a binary inpainting network for deep patch-based image compression

    Authors: André Nortje, Willie Brink, Herman A. Engelbrecht, Herman Kamper

    Abstract: Recent deep learning models outperform standard lossy image compression codecs. However, applying these models on a patch-by-patch basis requires that each image patch be encoded and decoded independently. The influence from adjacent patches is therefore lost, leading to block artefacts at low bitrates. We propose the Binary Inpainting Network (BINet), an autoencoder framework which incorporates b… ▽ More

    Submitted 13 January, 2021; v1 submitted 11 December, 2019; originally announced December 2019.

    Comments: Signal Processing: Image Communication

    Journal ref: Signal Processing: Image Communication 92C (2021) 116119

  3. arXiv:1904.07556  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Unsupervised acoustic unit discovery for speech synthesis using discrete latent-variable neural networks

    Authors: Ryan Eloff, André Nortje, Benjamin van Niekerk, Avashna Govender, Leanne Nortje, Arnu Pretorius, Elan van Biljon, Ewald van der Westhuizen, Lisa van Staden, Herman Kamper

    Abstract: For our submission to the ZeroSpeech 2019 challenge, we apply discrete latent-variable neural networks to unlabelled speech and use the discovered units for speech synthesis. Unsupervised discrete subword modelling could be useful for studies of phonetic category learning in infants or in low-resource speech technology requiring symbolic input. We use an autoencoder (AE) architecture with intermed… ▽ More

    Submitted 28 June, 2019; v1 submitted 16 April, 2019; originally announced April 2019.

    Comments: Interspeech 2019