Skip to main content

Showing 1–9 of 9 results for author: Pretorius, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2111.06721  [pdf, other

    cs.LG cs.AI stat.ML

    Causal Multi-Agent Reinforcement Learning: Review and Open Problems

    Authors: St John Grimbly, Jonathan Shock, Arnu Pretorius

    Abstract: This paper serves to introduce the reader to the field of multi-agent reinforcement learning (MARL) and its intersection with methods from the study of causality. We highlight key challenges in MARL and discuss these in the context of how causal methods may assist in tackling them. We promote moving toward a 'causality first' perspective on MARL. Specifically, we argue that causality can offer imp… ▽ More

    Submitted 1 December, 2021; v1 submitted 12 November, 2021; originally announced November 2021.

    Comments: Accepted at Cooperative AI Workshop, NeurIPS 2021

  2. arXiv:2111.03904  [pdf, other

    cs.LG stat.AP

    On pseudo-absence generation and machine learning for locust breeding ground prediction in Africa

    Authors: Ibrahim Salihu Yusuf, Kale-ab Tessera, Thomas Tumiel, Zohra Slim, Amine Kerkeni, Sella Nevo, Arnu Pretorius

    Abstract: Desert locust outbreaks threaten the food security of a large part of Africa and have affected the livelihoods of millions of people over the years. Machine learning (ML) has been demonstrated as an effective approach to locust distribution modelling which could assist in early warning. ML requires a significant amount of labelled data to train. Most publicly available labelled data on locusts are… ▽ More

    Submitted 20 May, 2022; v1 submitted 6 November, 2021; originally announced November 2021.

    Comments: AI for Humanitarian Assistance and Disaster Response (AI+HADR) workshop, NeurIPS 2021

  3. arXiv:2110.05167  [pdf, other

    stat.ML cs.LG

    Robust and Scalable SDE Learning: A Functional Perspective

    Authors: Scott Cameron, Tyron Cameron, Arnu Pretorius, Stephen Roberts

    Abstract: Stochastic differential equations provide a rich class of flexible generative models, capable of describing a wide range of spatio-temporal processes. A host of recent work looks to learn data-representing SDEs, using neural networks and other flexible function approximators. Despite these advances, learning remains computationally expensive due to the sequential nature of SDE integrators. In this… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

  4. arXiv:2001.06178  [pdf, other

    cs.LG stat.ML

    DNNs as Layers of Cooperating Classifiers

    Authors: Marelie H. Davel, Marthinus W. Theunissen, Arnold M. Pretorius, Etienne Barnard

    Abstract: A robust theoretical framework that can describe and predict the generalization ability of deep neural networks (DNNs) in general circumstances remains elusive. Classical attempts have produced complexity metrics that rely heavily on global measures of compactness and capacity with little investigation into the effects of sub-component collaboration. We demonstrate intriguing regularities in the a… ▽ More

    Submitted 17 January, 2020; originally announced January 2020.

    Comments: Accepted at AAAI-2020. The preprint contains additional figures and an appendix not included in the conference version. Main text remains unchanged

  5. arXiv:1910.10386  [pdf, other

    cs.LG stat.ML

    Stabilising priors for robust Bayesian deep learning

    Authors: Felix McGregor, Arnu Pretorius, Johan du Preez, Steve Kroon

    Abstract: Bayesian neural networks (BNNs) have developed into useful tools for probabilistic modelling due to recent advances in variational inference enabling large scale BNNs. However, BNNs remain brittle and hard to train, especially: (1) when using deep architectures consisting of many hidden layers and (2) in situations with large weight variances. We use signal propagation theory to quantify these cha… ▽ More

    Submitted 23 October, 2019; originally announced October 2019.

    Comments: 3 pages, accepted at Bayesian Deep learning workshop NeurIPS 2019

  6. arXiv:1910.05725  [pdf, other

    stat.ML cs.LG

    If dropout limits trainable depth, does critical initialisation still matter? A large-scale statistical analysis on ReLU networks

    Authors: Arnu Pretorius, Elan van Biljon, Benjamin van Niekerk, Ryan Eloff, Matthew Reynard, Steve James, Benjamin Rosman, Herman Kamper, Steve Kroon

    Abstract: Recent work in signal propagation theory has shown that dropout limits the depth to which information can propagate through a neural network. In this paper, we investigate the effect of initialisation on training speed and generalisation for ReLU networks within this depth limit. We ask the following research question: given that critical initialisation is crucial for training at large depth, if d… ▽ More

    Submitted 20 February, 2020; v1 submitted 13 October, 2019; originally announced October 2019.

    Comments: 8 pages, 6 figures, under consideration at Pattern Recognition Letters

  7. On the expected behaviour of noise regularised deep neural networks as Gaussian processes

    Authors: Arnu Pretorius, Herman Kamper, Steve Kroon

    Abstract: Recent work has established the equivalence between deep neural networks and Gaussian processes (GPs), resulting in so-called neural network Gaussian processes (NNGPs). The behaviour of these models depends on the initialisation of the corresponding network. In this work, we consider the impact of noise regularisation (e.g. dropout) on NNGPs, and relate their behaviour to signal propagation theory… ▽ More

    Submitted 12 October, 2019; originally announced October 2019.

    Comments: 8 pages, 6 figures, preliminary work

    Journal ref: Pattern Recognition Letters 138 (2020) 75-81

  8. arXiv:1811.00293  [pdf, other

    stat.ML cs.LG

    Critical initialisation for deep signal propagation in noisy rectifier neural networks

    Authors: Arnu Pretorius, Elan Van Biljon, Steve Kroon, Herman Kamper

    Abstract: Stochastic regularisation is an important weapon in the arsenal of a deep learning practitioner. However, despite recent theoretical advances, our understanding of how noise influences signal propagation in deep neural networks remains limited. By extending recent work based on mean field theory, we develop a new framework for signal propagation in stochastic regularised neural networks. Our noisy… ▽ More

    Submitted 30 November, 2018; v1 submitted 1 November, 2018; originally announced November 2018.

    Comments: 20 pages, 11 figures, accepted at the 32nd Conference on Neural Information Processing Systems (NeurIPS 2018)

  9. arXiv:1806.05413  [pdf, other

    stat.ML cs.LG

    Learning Dynamics of Linear Denoising Autoencoders

    Authors: Arnu Pretorius, Steve Kroon, Herman Kamper

    Abstract: Denoising autoencoders (DAEs) have proven useful for unsupervised representation learning, but a thorough theoretical understanding is still lacking of how the input noise influences learning. Here we develop theory for how noise influences learning in DAEs. By focusing on linear DAEs, we are able to derive analytic expressions that exactly describe their learning dynamics. We verify our theoretic… ▽ More

    Submitted 29 July, 2018; v1 submitted 14 June, 2018; originally announced June 2018.

    Comments: 14 pages, 7 figures, accepted at the 35th International Conference on Machine Learning (ICML) 2018