Skip to main content

Showing 1–18 of 18 results for author: Paquet, U

.
  1. arXiv:2411.03097  [pdf, other

    stat.ML cs.LG

    Correlating Variational Autoencoders Natively For Multi-View Imputation

    Authors: Ella S. C. Orme, Marina Evangelou, Ulrich Paquet

    Abstract: Multi-view data from the same source often exhibit correlation. This is mirrored in correlation between the latent spaces of separate variational autoencoders (VAEs) trained on each data-view. A multi-view VAE approach is proposed that incorporates a joint prior with a non-zero correlation structure between the latent spaces of the VAEs. By enforcing such correlation structure, more strongly corre… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

    Comments: Accepted at 'UniReps: 2nd Edition of the Workshop on Unifying Representations in Neural Models', a workshop at NeurIPS 2024

  2. arXiv:2310.16410  [pdf, other

    cs.AI cs.HC cs.LG stat.ML

    Bridging the Human-AI Knowledge Gap: Concept Discovery and Transfer in AlphaZero

    Authors: Lisa Schut, Nenad Tomasev, Tom McGrath, Demis Hassabis, Ulrich Paquet, Been Kim

    Abstract: Artificial Intelligence (AI) systems have made remarkable progress, attaining super-human performance across various domains. This presents us with an opportunity to further human knowledge and improve human expert performance by leveraging the hidden knowledge encoded within these highly performant AI systems. Yet, this knowledge is often hard to extract, and may be hard to understand or learn fr… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 61 pages, 29 figures

  3. arXiv:2112.06751  [pdf, other

    cs.AI cs.HC

    Role of Human-AI Interaction in Selective Prediction

    Authors: Elizabeth Bondi, Raphael Koster, Hannah Sheahan, Martin Chadwick, Yoram Bachrach, Taylan Cemgil, Ulrich Paquet, Krishnamurthy Dvijotham

    Abstract: Recent work has shown the potential benefit of selective prediction systems that can learn to defer to a human when the predictions of the AI are unreliable, particularly to improve the reliability of AI systems in high-stakes applications like healthcare or conservation. However, most prior work assumes that human behavior remains unchanged when they solve a prediction task as part of a human-AI… ▽ More

    Submitted 16 May, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: Published in AAAI 2022; added link to data, small formatting corrections for camera-ready, including small changes to Fig 6-7 that do not change conclusions

  4. Acquisition of Chess Knowledge in AlphaZero

    Authors: Thomas McGrath, Andrei Kapishnikov, Nenad Tomašev, Adam Pearce, Demis Hassabis, Been Kim, Ulrich Paquet, Vladimir Kramnik

    Abstract: What is learned by sophisticated neural network agents such as AlphaZero? This question is of both scientific and practical interest. If the representations of strong neural networks bear no resemblance to human concepts, our ability to understand faithful explanations of their decisions will be restricted, ultimately limiting what we can achieve with neural network interpretability. In this work… ▽ More

    Submitted 18 August, 2022; v1 submitted 17 November, 2021; originally announced November 2021.

    Comments: 69 pages, 44 figures

  5. arXiv:2009.04374  [pdf, other

    cs.AI stat.ML

    Assessing Game Balance with AlphaZero: Exploring Alternative Rule Sets in Chess

    Authors: Nenad Tomašev, Ulrich Paquet, Demis Hassabis, Vladimir Kramnik

    Abstract: It is non-trivial to design engaging and balanced sets of game rules. Modern chess has evolved over centuries, but without a similar recourse to history, the consequences of rule changes to game dynamics are difficult to predict. AlphaZero provides an alternative in silico means of game balance assessment. It is a system that can learn near-optimal strategies for any rule set from scratch, without… ▽ More

    Submitted 15 September, 2020; v1 submitted 9 September, 2020; originally announced September 2020.

    Comments: 98 pages. Game AZ-8 on page 39 (Stalemate=win variant) replaced from version 1

  6. arXiv:1907.12906  [pdf, other

    cs.CV cs.LG stat.ML

    Unsupervised Separation of Dynamics from Pixels

    Authors: Silvia Chiappa, Ulrich Paquet

    Abstract: We present an approach to learn the dynamics of multiple objects from image sequences in an unsupervised way. We introduce a probabilistic model that first generate noisy positions for each object through a separate linear state-space model, and then renders the positions of all objects in the same image through a highly non-linear process. Such a linear representation of the dynamics enables us t… ▽ More

    Submitted 20 July, 2019; originally announced July 2019.

    Journal ref: METRON, Springer, 2019

  7. arXiv:1812.07480  [pdf, other

    stat.ML cs.AI cs.LG

    A Factorial Mixture Prior for Compositional Deep Generative Models

    Authors: Ulrich Paquet, Sumedh K. Ghaisas, Olivier Tieleman

    Abstract: We assume that a high-dimensional datum, like an image, is a compositional expression of a set of properties, with a complicated non-linear relationship between the datum and its properties. This paper proposes a factorial mixture prior for capturing latent properties, thereby adding structured compositionality to deep generative models. The prior treats a latent vector as belonging to Cartesian p… ▽ More

    Submitted 18 December, 2018; originally announced December 2018.

    Comments: 16 pagers, 10 figures

  8. arXiv:1810.11893  [pdf, other

    stat.ML cs.LG

    An Efficient Implementation of Riemannian Manifold Hamiltonian Monte Carlo for Gaussian Process Models

    Authors: Ulrich Paquet, Marco Fraccaro

    Abstract: This technical report presents pseudo-code for a Riemannian manifold Hamiltonian Monte Carlo (RMHMC) method to efficiently simulate samples from $N$-dimensional posterior distributions $p(x|y)$, where $x \in R^N$ is drawn from a Gaussian Process (GP) prior, and observations $y_n$ are independent given $x_n$. Sufficient technical and algorithmic details are provided for the implementation of RMHMC… ▽ More

    Submitted 28 October, 2018; originally announced October 2018.

    Comments: Technical report accompanying arXiv:1604.01972, "An Adaptive Resample-Move Algorithm for Estimating Normalizing Constants" (2016)

  9. arXiv:1711.08028  [pdf, other

    cs.AI

    Recurrent Relational Networks

    Authors: Rasmus Berg Palm, Ulrich Paquet, Ole Winther

    Abstract: This paper is concerned with learning to solve tasks that require a chain of interdependent steps of relational inference, like answering complex questions about the relationships between objects, or solving puzzles where the smaller elements of a solution mutually constrain each other. We introduce the recurrent relational network, a general purpose module that operates on a graph representation… ▽ More

    Submitted 29 November, 2018; v1 submitted 21 November, 2017; originally announced November 2017.

    Comments: Accepted at NIPS 2018

  10. arXiv:1710.05741  [pdf, other

    stat.ML cs.LG

    A Disentangled Recognition and Nonlinear Dynamics Model for Unsupervised Learning

    Authors: Marco Fraccaro, Simon Kamronn, Ulrich Paquet, Ole Winther

    Abstract: This paper takes a step towards temporal reasoning in a dynamically changing video, not in the pixel space that constitutes its frames, but in a latent space that describes the non-linear dynamics of the objects in its world. We introduce the Kalman variational auto-encoder, a framework for unsupervised learning of sequential data that disentangles two latent representations: an object's represent… ▽ More

    Submitted 30 October, 2017; v1 submitted 16 October, 2017; originally announced October 2017.

    Comments: NIPS 2017

  11. arXiv:1608.04245  [pdf, other

    stat.ML cs.LG

    The Bayesian Low-Rank Determinantal Point Process Mixture Model

    Authors: Mike Gartrell, Ulrich Paquet, Noam Koenigstein

    Abstract: Determinantal point processes (DPPs) are an elegant model for encoding probabilities over subsets, such as shopping baskets, of a ground set, such as an item catalog. They are useful for a number of machine learning tasks, including product recommendation. DPPs are parametrized by a positive semi-definite kernel matrix. Recent work has shown that using a low-rank factorization of this kernel provi… ▽ More

    Submitted 16 August, 2016; v1 submitted 15 August, 2016; originally announced August 2016.

    Comments: 9 pages, 6 figures. This article draws heavily from arXiv:1602.05436

  12. arXiv:1605.07571  [pdf, other

    stat.ML cs.LG

    Sequential Neural Models with Stochastic Layers

    Authors: Marco Fraccaro, Søren Kaae Sønderby, Ulrich Paquet, Ole Winther

    Abstract: How can we efficiently propagate uncertainty in a latent state representation with recurrent neural networks? This paper introduces stochastic recurrent neural networks which glue a deterministic recurrent neural network and a state space model together to form a stochastic and sequential neural generative model. The clear separation of deterministic and stochastic layers allows a structured varia… ▽ More

    Submitted 13 November, 2016; v1 submitted 24 May, 2016; originally announced May 2016.

    Comments: NIPS 2016

  13. arXiv:1604.01972  [pdf, other

    stat.ML

    An Adaptive Resample-Move Algorithm for Estimating Normalizing Constants

    Authors: Marco Fraccaro, Ulrich Paquet, Ole Winther

    Abstract: The estimation of normalizing constants is a fundamental step in probabilistic model comparison. Sequential Monte Carlo methods may be used for this task and have the advantage of being inherently parallelizable. However, the standard choice of using a fixed number of particles at each iteration is suboptimal because some steps will contribute disproportionately to the variance of the estimate. We… ▽ More

    Submitted 15 August, 2016; v1 submitted 7 April, 2016; originally announced April 2016.

    Comments: 11 pages, 5 figures

  14. arXiv:1602.05436  [pdf, other

    stat.ML cs.LG

    Low-Rank Factorization of Determinantal Point Processes for Recommendation

    Authors: Mike Gartrell, Ulrich Paquet, Noam Koenigstein

    Abstract: Determinantal point processes (DPPs) have garnered attention as an elegant probabilistic model of set diversity. They are useful for a number of subset selection tasks, including product recommendation. DPPs are parametrized by a positive semi-definite kernel matrix. In this work we present a new method for learning the DPP kernel from observed data using a low-rank factorization of this kernel. W… ▽ More

    Submitted 17 February, 2016; originally announced February 2016.

    Comments: 10 pages, 4 figures. Submitted to KDD 2016

  15. arXiv:1507.04505  [pdf, other

    stat.ML

    On the Convergence of Stochastic Variational Inference in Bayesian Networks

    Authors: Ulrich Paquet

    Abstract: We highlight a pitfall when applying stochastic variational inference to general Bayesian networks. For global random variables approximated by an exponential family distribution, natural gradient steps, commonly starting from a unit length step size, are averaged to convergence. This useful insight into the scaling of initial step sizes is lost when the approximation factorizes across a general B… ▽ More

    Submitted 16 July, 2015; originally announced July 2015.

    Comments: NIPS 2014 Workshop on Advances in Variational Inference. Montreal, Canada

  16. arXiv:1409.2824  [pdf, other

    stat.ML

    Scalable Bayesian Modelling of Paired Symbols

    Authors: Ulrich Paquet, Noam Koenigstein, Ole Winther

    Abstract: We present a novel, scalable and Bayesian approach to modelling the occurrence of pairs of symbols (i,j) drawn from a large vocabulary. Observed pairs are assumed to be generated by a simple popularity based selection process followed by censoring using a preference function. By basing inference on the well-founded principle of variational bounding, and using new site-independent bounds, we show h… ▽ More

    Submitted 10 September, 2014; v1 submitted 9 September, 2014; originally announced September 2014.

    Comments: 15 pages, 6 figures

  17. arXiv:1309.6786  [pdf, other

    stat.ML cs.LG

    One-class Collaborative Filtering with Random Graphs: Annotated Version

    Authors: Ulrich Paquet, Noam Koenigstein

    Abstract: The bane of one-class collaborative filtering is interpreting and modelling the latent signal from the missing class. In this paper we present a novel Bayesian generative model for implicit collaborative filtering. It forms a core component of the Xbox Live architecture, and unlike previous approaches, delineates the odds of a user disliking an item from simply not considering it. The latent signa… ▽ More

    Submitted 24 September, 2014; v1 submitted 26 September, 2013; originally announced September 2013.

    Comments: 11 pages, 7 figures. Detailed, annotated and expanded version of conference paper "One-class Collaborative Filtering with Random Graphs" (WWW 2013)

    ACM Class: G.3

  18. arXiv:1301.2724  [pdf, ps, other

    stat.ML

    Perturbative Corrections for Approximate Inference in Gaussian Latent Variable Models

    Authors: Manfred Opper, Ulrich Paquet, Ole Winther

    Abstract: Expectation Propagation (EP) provides a framework for approximate inference. When the model under consideration is over a latent Gaussian field, with the approximation being Gaussian, we show how these approximations can systematically be corrected. A perturbative expansion is made of the exact but intractable correction, and can be applied to the model's partition function and other moments of in… ▽ More

    Submitted 25 October, 2013; v1 submitted 12 January, 2013; originally announced January 2013.

    Comments: 45 pages, 10 figures