Skip to main content

Showing 1–4 of 4 results for author: Berglund, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:1903.11981  [pdf, other

    cs.LG cs.RO stat.ML

    Regularizing Trajectory Optimization with Denoising Autoencoders

    Authors: Rinu Boney, Norman Di Palo, Mathias Berglund, Alexander Ilin, Juho Kannala, Antti Rasmus, Harri Valpola

    Abstract: Trajectory optimization using a learned model of the environment is one of the core elements of model-based reinforcement learning. This procedure often suffers from exploiting inaccuracies of the learned model. We propose to regularize trajectory optimization by means of a denoising autoencoder that is trained on the same trajectories as the model of the environment. We show that the proposed reg… ▽ More

    Submitted 25 December, 2019; v1 submitted 28 March, 2019; originally announced March 2019.

    Comments: NeurIPS 2019

  2. arXiv:1507.02672  [pdf, other

    cs.NE cs.LG stat.ML

    Semi-Supervised Learning with Ladder Networks

    Authors: Antti Rasmus, Harri Valpola, Mikko Honkala, Mathias Berglund, Tapani Raiko

    Abstract: We combine supervised learning with unsupervised learning in deep neural networks. The proposed model is trained to simultaneously minimize the sum of supervised and unsupervised cost functions by backpropagation, avoiding the need for layer-wise pre-training. Our work builds on the Ladder network proposed by Valpola (2015), which we extend by combining the model with supervision. We show that the… ▽ More

    Submitted 24 November, 2015; v1 submitted 9 July, 2015; originally announced July 2015.

    Comments: Revised denoising function, updated results, fixed typos

  3. arXiv:1406.2989  [pdf, other

    stat.ML cs.LG cs.NE

    Techniques for Learning Binary Stochastic Feedforward Neural Networks

    Authors: Tapani Raiko, Mathias Berglund, Guillaume Alain, Laurent Dinh

    Abstract: Stochastic binary hidden units in a multi-layer perceptron (MLP) network give at least three potential benefits when compared to deterministic MLP networks. (1) They allow to learn one-to-many type of mappings. (2) They can be used in structured prediction problems, where modeling the internal structure of the output is important. (3) Stochasticity has been shown to be an excellent regularizer, wh… ▽ More

    Submitted 9 April, 2015; v1 submitted 11 June, 2014; originally announced June 2014.

  4. arXiv:1312.6002  [pdf, other

    cs.NE cs.LG stat.ML

    Stochastic Gradient Estimate Variance in Contrastive Divergence and Persistent Contrastive Divergence

    Authors: Mathias Berglund, Tapani Raiko

    Abstract: Contrastive Divergence (CD) and Persistent Contrastive Divergence (PCD) are popular methods for training the weights of Restricted Boltzmann Machines. However, both methods use an approximate method for sampling from the model distribution. As a side effect, these approximations yield significantly different biases and variances for stochastic gradient estimates of individual data points. It is we… ▽ More

    Submitted 14 February, 2014; v1 submitted 20 December, 2013; originally announced December 2013.

    Comments: ICLR2014 Workshop Track submission. Rephrased parts of text. Results unchanged

    MSC Class: 62M45 ACM Class: I.2.6