Skip to main content

Showing 1–7 of 7 results for author: Kamper, H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2206.11706  [pdf, other

    eess.AS cs.CL cs.LG stat.ML

    A Temporal Extension of Latent Dirichlet Allocation for Unsupervised Acoustic Unit Discovery

    Authors: Werner van der Merwe, Herman Kamper, Johan du Preez

    Abstract: Latent Dirichlet allocation (LDA) is widely used for unsupervised topic modelling on sets of documents. No temporal information is used in the model. However, there is often a relationship between the corresponding topics of consecutive tokens. In this paper, we present an extension to LDA that uses a Markov chain to model temporal information. We use this new model for acoustic unit discovery fro… ▽ More

    Submitted 29 June, 2022; v1 submitted 23 June, 2022; originally announced June 2022.

  2. arXiv:1910.05725  [pdf, other

    stat.ML cs.LG

    If dropout limits trainable depth, does critical initialisation still matter? A large-scale statistical analysis on ReLU networks

    Authors: Arnu Pretorius, Elan van Biljon, Benjamin van Niekerk, Ryan Eloff, Matthew Reynard, Steve James, Benjamin Rosman, Herman Kamper, Steve Kroon

    Abstract: Recent work in signal propagation theory has shown that dropout limits the depth to which information can propagate through a neural network. In this paper, we investigate the effect of initialisation on training speed and generalisation for ReLU networks within this depth limit. We ask the following research question: given that critical initialisation is crucial for training at large depth, if d… ▽ More

    Submitted 20 February, 2020; v1 submitted 13 October, 2019; originally announced October 2019.

    Comments: 8 pages, 6 figures, under consideration at Pattern Recognition Letters

  3. On the expected behaviour of noise regularised deep neural networks as Gaussian processes

    Authors: Arnu Pretorius, Herman Kamper, Steve Kroon

    Abstract: Recent work has established the equivalence between deep neural networks and Gaussian processes (GPs), resulting in so-called neural network Gaussian processes (NNGPs). The behaviour of these models depends on the initialisation of the corresponding network. In this work, we consider the impact of noise regularisation (e.g. dropout) on NNGPs, and relate their behaviour to signal propagation theory… ▽ More

    Submitted 12 October, 2019; originally announced October 2019.

    Comments: 8 pages, 6 figures, preliminary work

    Journal ref: Pattern Recognition Letters 138 (2020) 75-81

  4. arXiv:1811.08284  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Feature exploration for almost zero-resource ASR-free keyword spotting using a multilingual bottleneck extractor and correspondence autoencoders

    Authors: Raghav Menon, Herman Kamper, Ewald van der Westhuizen, John Quinn, Thomas Niesler

    Abstract: We compare features for dynamic time warping (DTW) when used to bootstrap keyword spotting (KWS) in an almost zero-resource setting. Such quickly-deployable systems aim to support United Nations (UN) humanitarian relief efforts in parts of Africa with severely under-resourced languages. Our objective is to identify acoustic features that provide acceptable KWS performance in such environments. As… ▽ More

    Submitted 12 July, 2019; v1 submitted 14 November, 2018; originally announced November 2018.

    Comments: 5 pages, 2 figures, 2 tables, 38 references, Accepted at Interspeech 2019

  5. arXiv:1811.00293  [pdf, other

    stat.ML cs.LG

    Critical initialisation for deep signal propagation in noisy rectifier neural networks

    Authors: Arnu Pretorius, Elan Van Biljon, Steve Kroon, Herman Kamper

    Abstract: Stochastic regularisation is an important weapon in the arsenal of a deep learning practitioner. However, despite recent theoretical advances, our understanding of how noise influences signal propagation in deep neural networks remains limited. By extending recent work based on mean field theory, we develop a new framework for signal propagation in stochastic regularised neural networks. Our noisy… ▽ More

    Submitted 30 November, 2018; v1 submitted 1 November, 2018; originally announced November 2018.

    Comments: 20 pages, 11 figures, accepted at the 32nd Conference on Neural Information Processing Systems (NeurIPS 2018)

  6. arXiv:1807.08666  [pdf, other

    cs.CL stat.ML

    ASR-free CNN-DTW keyword spotting using multilingual bottleneck features for almost zero-resource languages

    Authors: Raghav Menon, Herman Kamper, Emre Yilmaz, John Quinn, Thomas Niesler

    Abstract: We consider multilingual bottleneck features (BNFs) for nearly zero-resource keyword spotting. This forms part of a United Nations effort using keyword spotting to support humanitarian relief programmes in parts of Africa where languages are severely under-resourced. We use 1920 isolated keywords (40 types, 34 minutes) as exemplars for dynamic time warping (DTW) template matching, which is perform… ▽ More

    Submitted 23 July, 2018; originally announced July 2018.

    Comments: 5 pages, 3 figures, 3 tables, 1 equation accepted at SLTU 2018

  7. arXiv:1806.05413  [pdf, other

    stat.ML cs.LG

    Learning Dynamics of Linear Denoising Autoencoders

    Authors: Arnu Pretorius, Steve Kroon, Herman Kamper

    Abstract: Denoising autoencoders (DAEs) have proven useful for unsupervised representation learning, but a thorough theoretical understanding is still lacking of how the input noise influences learning. Here we develop theory for how noise influences learning in DAEs. By focusing on linear DAEs, we are able to derive analytic expressions that exactly describe their learning dynamics. We verify our theoretic… ▽ More

    Submitted 29 July, 2018; v1 submitted 14 June, 2018; originally announced June 2018.

    Comments: 14 pages, 7 figures, accepted at the 35th International Conference on Machine Learning (ICML) 2018