Skip to main content

Showing 1–7 of 7 results for author: Klushyn, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.12102  [pdf, other

    cs.LG stat.ML

    BALI: Learning Neural Networks via Bayesian Layerwise Inference

    Authors: Richard Kurle, Alexej Klushyn, Ralf Herbrich

    Abstract: We introduce a new method for learning Bayesian neural networks, treating them as a stack of multivariate Bayesian linear regression models. The main idea is to infer the layerwise posterior exactly if we know the target outputs of each layer. We define these pseudo-targets as the layer outputs from the forward pass, updated by the backpropagated gradients of the objective function. The resulting… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

  2. arXiv:2002.04881  [pdf, other

    stat.ML cs.LG

    Learning Flat Latent Manifolds with VAEs

    Authors: Nutan Chen, Alexej Klushyn, Francesco Ferroni, Justin Bayer, Patrick van der Smagt

    Abstract: Measuring the similarity between data points often requires domain knowledge, which can in parts be compensated by relying on unsupervised methods such as latent-variable models, where similarity/distance is estimated in a more compact latent space. Prevalent is the use of the Euclidean metric, which has the drawback of ignoring information about similarity of data stored in the decoder, as captur… ▽ More

    Submitted 12 August, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: Thirty-seventh International Conference on Machine Learning (ICML) 2020

    Journal ref: International Conference on Machine Learning 2020

  3. arXiv:1908.08750  [pdf, other

    stat.ML cs.LG

    Increasing the Generalisation Capacity of Conditional VAEs

    Authors: Alexej Klushyn, Nutan Chen, Botond Cseke, Justin Bayer, Patrick van der Smagt

    Abstract: We address the problem of one-to-many mappings in supervised learning, where a single instance has many different solutions of possibly equal cost. The framework of conditional variational autoencoders describes a class of methods to tackle such structured-prediction tasks by means of latent variables. We propose to incentivise informative latent representations for increasing the generalisation c… ▽ More

    Submitted 10 September, 2019; v1 submitted 23 August, 2019; originally announced August 2019.

  4. arXiv:1905.04982  [pdf, other

    stat.ML cs.LG

    Learning Hierarchical Priors in VAEs

    Authors: Alexej Klushyn, Nutan Chen, Richard Kurle, Botond Cseke, Patrick van der Smagt

    Abstract: We propose to learn a hierarchical prior in the context of variational autoencoders to avoid the over-regularisation resulting from a standard normal prior distribution. To incentivise an informative latent representation of the data, we formulate the learning problem as a constrained optimisation problem by extending the Taming VAEs framework to two-level hierarchical models. We introduce a graph… ▽ More

    Submitted 5 October, 2019; v1 submitted 13 May, 2019; originally announced May 2019.

    Comments: Published at NeurIPS 2019 (spotlight)

  5. arXiv:1812.08284  [pdf, other

    stat.ML cs.LG

    Fast Approximate Geodesics for Deep Generative Models

    Authors: Nutan Chen, Francesco Ferroni, Alexej Klushyn, Alexandros Paraschos, Justin Bayer, Patrick van der Smagt

    Abstract: The length of the geodesic between two data points along a Riemannian manifold, induced by a deep generative model, yields a principled measure of similarity. Current approaches are limited to low-dimensional latent spaces, due to the computational complexity of solving a non-convex optimisation problem. We propose finding shortest paths in a finite graph of samples from the aggregate approximate… ▽ More

    Submitted 23 May, 2019; v1 submitted 19 December, 2018; originally announced December 2018.

    Comments: 28th International Conference on Artificial Neural Networks, 2019

    Journal ref: 28th International Conference on Artificial Neural Networks, 2019

  6. arXiv:1808.02026  [pdf, other

    stat.ML cs.LG

    Active Learning based on Data Uncertainty and Model Sensitivity

    Authors: Nutan Chen, Alexej Klushyn, Alexandros Paraschos, Djalel Benbouzid, Patrick van der Smagt

    Abstract: Robots can rapidly acquire new skills from demonstrations. However, during generalisation of skills or transitioning across fundamentally different skills, it is unclear whether the robot has the necessary knowledge to perform the task. Failing to detect missing information often leads to abrupt movements or to collisions with the environment. Active learning can quantify the uncertainty of perfor… ▽ More

    Submitted 6 August, 2018; originally announced August 2018.

    Comments: Published on 2018 IEEE/RSJ International Conference on Intelligent Robots and System

  7. arXiv:1711.01204  [pdf, other

    stat.ML cs.LG

    Metrics for Deep Generative Models

    Authors: Nutan Chen, Alexej Klushyn, Richard Kurle, Xueyan Jiang, Justin Bayer, Patrick van der Smagt

    Abstract: Neural samplers such as variational autoencoders (VAEs) or generative adversarial networks (GANs) approximate distributions by transforming samples from a simple random source---the latent space---to samples from a more complex distribution represented by a dataset. While the manifold hypothesis implies that the density induced by a dataset contains large regions of low density, the training crite… ▽ More

    Submitted 8 February, 2018; v1 submitted 3 November, 2017; originally announced November 2017.

    Comments: Published on the 21st International Conference on Artificial Intelligence and Statistics (AISTATS), 2018

    Journal ref: The 21st International Conference on Artificial Intelligence and Statistics, 2018