Skip to main content

Showing 51–100 of 123 results for author: Hennig, P

.
  1. arXiv:2106.08717  [pdf, other

    cs.LG cs.AI

    Probabilistic DAG Search

    Authors: Julia Grosse, Cheng Zhang, Philipp Hennig

    Abstract: Exciting contemporary machine learning problems have recently been phrased in the classic formalism of tree search -- most famously, the game of Go. Interestingly, the state-space underlying these sequential decision-making problems often posses a more general latent structure than can be captured by a tree. In this work, we develop a probabilistic framework to exploit a search space's latent stru… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: 10 pages, 8 figures, to be published at the Conference on Uncertainty in Artificial Intelligence (UAI) 2021

  2. arXiv:2106.07761  [pdf, other

    stat.ML cs.LG math.NA

    Linear-Time Probabilistic Solutions of Boundary Value Problems

    Authors: Nicholas Krämer, Philipp Hennig

    Abstract: We propose a fast algorithm for the probabilistic solution of boundary value problems (BVPs), which are ordinary differential equations subject to boundary conditions. In contrast to previous work, we introduce a Gauss--Markov prior and tailor it specifically to BVPs, which allows computing a posterior distribution over the solution in linear time, at a quality and cost comparable to that of well-… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

  3. arXiv:2106.02624  [pdf, other

    cs.LG stat.ML

    ViViT: Curvature access through the generalized Gauss-Newton's low-rank structure

    Authors: Felix Dangel, Lukas Tatzel, Philipp Hennig

    Abstract: Curvature in form of the Hessian or its generalized Gauss-Newton (GGN) approximation is valuable for algorithms that rely on a local model for the loss to train, compress, or explain deep networks. Existing methods based on implicit multiplication via automatic differentiation or Kronecker-factored block diagonal approximations do not consider noise in the mini-batch. We present ViViT, a curvature… ▽ More

    Submitted 10 February, 2022; v1 submitted 4 June, 2021; originally announced June 2021.

    Comments: Main text: 10 pages, 6 figures; Supplements: 26 pages, 27 figures, 5 tables

  4. arXiv:2105.06331  [pdf, other

    cs.LG

    Informed Equation Learning

    Authors: Matthias Werner, Andrej Junginger, Philipp Hennig, Georg Martius

    Abstract: Distilling data into compact and interpretable analytic equations is one of the goals of science. Instead, contemporary supervised machine learning methods mostly produce unstructured and dense maps from input to output. Particularly in deep learning, this property is owed to the generic nature of simple standard link functions. To learn equations rather than maps, standard non-linearities can be… ▽ More

    Submitted 13 May, 2021; originally announced May 2021.

  5. arXiv:2105.03109  [pdf, other

    cs.LG stat.ML

    Laplace Matching for fast Approximate Inference in Latent Gaussian Models

    Authors: Marius Hobbhahn, Philipp Hennig

    Abstract: Bayesian inference on non-Gaussian data is often non-analytic and requires computationally expensive approximations such as sampling or variational inference. We propose an approximate inference framework primarily designed to be computationally cheap while still achieving high approximation quality. The concept, which we call Laplace Matching, involves closed-form, approximate, bi-directional tra… ▽ More

    Submitted 11 October, 2022; v1 submitted 7 May, 2021; originally announced May 2021.

    Comments: Added experiments and clarifications; Currently under review at JMLR

    MSC Class: 68T37 ACM Class: G.3; I.2.0

  6. arXiv:2103.10153  [pdf, other

    stat.ML cs.LG stat.ME

    A Probabilistic State Space Model for Joint Inference from Differential Equations and Data

    Authors: Jonathan Schmidt, Nicholas Krämer, Philipp Hennig

    Abstract: Mechanistic models with differential equations are a key component of scientific applications of machine learning. Inference in such models is usually computationally demanding, because it involves repeatedly solving the differential equation. The main problem here is that the numerical solver is hard to combine with standard inference techniques. Recent work in probabilistic numerics has develope… ▽ More

    Submitted 5 July, 2022; v1 submitted 18 March, 2021; originally announced March 2021.

    Comments: 12 pages (+ 5 pages appendix), 7 figures. In: Advances in Neural Information Processing Systems (NeurIPS 2021)

  7. arXiv:2102.10880  [pdf, other

    cs.LG

    A Probabilistically Motivated Learning Rate Adaptation for Stochastic Optimization

    Authors: Filip de Roos, Carl Jidling, Adrian Wills, Thomas Schön, Philipp Hennig

    Abstract: Machine learning practitioners invest significant manual and computational resources in finding suitable learning rates for optimization algorithms. We provide a probabilistic motivation, in terms of Gaussian inference, for popular stochastic first-order methods. As an important special case, it recovers the Polyak step with a general metric. The inference allows us to relate the learning rate to… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

  8. arXiv:2102.07542  [pdf, other

    cs.LG

    High-Dimensional Gaussian Process Inference with Derivatives

    Authors: Filip de Roos, Alexandra Gessner, Philipp Hennig

    Abstract: Although it is widely known that Gaussian processes can be conditioned on observations of the gradient, this functionality is of limited use due to the prohibitive computational cost of $\mathcal{O}(N^3 D^3)$ in data points $N$ and dimension $D$. The dilemma of gradient observations is that a single one of them comes at the same cost as $D$ independent function evaluations, so the latter are often… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

  9. arXiv:2102.06645  [pdf, other

    cs.LG stat.ML

    Bayesian Quadrature on Riemannian Data Manifolds

    Authors: Christian Fröhlich, Alexandra Gessner, Philipp Hennig, Bernhard Schölkopf, Georgios Arvanitidis

    Abstract: Riemannian manifolds provide a principled way to model nonlinear geometric structure inherent in data. A Riemannian metric on said manifolds determines geometry-aware shortest paths and provides the means to define statistical models accordingly. However, these operations are typically computationally demanding. To ease this computational burden, we advocate probabilistic numerical methods for Rie… ▽ More

    Submitted 10 June, 2021; v1 submitted 12 February, 2021; originally announced February 2021.

  10. arXiv:2102.06604  [pdf, other

    cs.LG stat.ML

    Cockpit: A Practical Debugging Tool for the Training of Deep Neural Networks

    Authors: Frank Schneider, Felix Dangel, Philipp Hennig

    Abstract: When engineers train deep learning models, they are very much 'flying blind'. Commonly used methods for real-time training diagnostics, such as monitoring the train/test loss, are limited. Assessing a network's training process solely through these performance indicators is akin to debugging software without access to internal states through a debugger. To address this, we present Cockpit, a colle… ▽ More

    Submitted 26 October, 2021; v1 submitted 12 February, 2021; originally announced February 2021.

    Comments: (NeurIPS 2021) Main text: 13 pages, 6 figures, 1 table; Supplements: 23 pages, 13 figures, 1 table, 1 listing

  11. arXiv:2012.10106  [pdf, other

    stat.ML cs.LG math.NA

    Stable Implementation of Probabilistic ODE Solvers

    Authors: Nicholas Krämer, Philipp Hennig

    Abstract: Probabilistic solvers for ordinary differential equations (ODEs) provide efficient quantification of numerical uncertainty associated with simulation of dynamical systems. Their convergence rates have been established by a growing body of theoretical analysis. However, these algorithms suffer from numerical instability when run at high order or with small step-sizes -- that is, exactly in the regi… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

  12. arXiv:2012.08202  [pdf, other

    math.NA cs.LG stat.ML

    Calibrated Adaptive Probabilistic ODE Solvers

    Authors: Nathanael Bosch, Philipp Hennig, Filip Tronarp

    Abstract: Probabilistic solvers for ordinary differential equations assign a posterior measure to the solution of an initial value problem. The joint covariance of this distribution provides an estimate of the (global) approximation error. The contraction rate of this error estimate as a function of the solver's step size identifies it as a well-calibrated worst-case error, but its explicit numerical value… ▽ More

    Submitted 22 February, 2021; v1 submitted 15 December, 2020; originally announced December 2020.

    Comments: 17 pages, 10 figures;

  13. arXiv:2011.04803  [pdf, other

    cs.LG

    Self-Tuning Stochastic Optimization with Curvature-Aware Gradient Filtering

    Authors: Ricky T. Q. Chen, Dami Choi, Lukas Balles, David Duvenaud, Philipp Hennig

    Abstract: Standard first-order stochastic optimization algorithms base their updates solely on the average mini-batch gradient, and it has been shown that tracking additional quantities such as the curvature can help de-sensitize common hyperparameters. Based on this intuition, we explore the use of exact per-sample Hessian-vector products and gradients to construct optimizers that are self-tuning and hyper… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

  14. arXiv:2010.09691  [pdf, other

    cs.LG math.NA

    Probabilistic Linear Solvers for Machine Learning

    Authors: Jonathan Wenger, Philipp Hennig

    Abstract: Linear systems are the bedrock of virtually all numerical computation. Machine learning poses specific challenges for the solution of such systems due to their scale, characteristic structure, stochasticity and the central role of uncertainty in the field. Unifying earlier work we propose a class of probabilistic linear solvers which jointly infer the matrix, its inverse and the solution from matr… ▽ More

    Submitted 22 October, 2020; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: Advances in Neural Information Processing Systems (NeurIPS 2020)

  15. Robot Learning with Crash Constraints

    Authors: Alonso Marco, Dominik Baumann, Majid Khadiv, Philipp Hennig, Ludovic Righetti, Sebastian Trimpe

    Abstract: In the past decade, numerous machine learning algorithms have been shown to successfully learn optimal policies to control real robotic systems. However, it is common to encounter failing behaviors as the learning loop progresses. Specifically, in robot applications where failing is undesired but not catastrophic, many algorithms struggle with leveraging data obtained from failures. This is usuall… ▽ More

    Submitted 27 January, 2021; v1 submitted 16 October, 2020; originally announced October 2020.

    Comments: 8 pages, 4 figures, 1 table, 1 algorithm. Accepted for publication in IEEE Robotics and Automation Letters (RA-L). Video demonstration of the experiments available at https://youtu.be/RAiIo0l6_rE . Algorithm implementation available at https://github.com/alonrot/classified_regression.git

    Journal ref: IEEE Robotics and Automation Letters, Vol 6(2), pp. 1439-1446, 2021

  16. arXiv:2010.02720  [pdf, other

    cs.LG

    Learnable Uncertainty under Laplace Approximations

    Authors: Agustinus Kristiadi, Matthias Hein, Philipp Hennig

    Abstract: Laplace approximations are classic, computationally lightweight means for constructing Bayesian neural networks (BNNs). As in other approximate BNNs, one cannot necessarily expect the induced predictive uncertainty to be calibrated. Here we develop a formalism to explicitly "train" the uncertainty in a decoupled way to the prediction itself. To this end, we introduce uncertainty units for Laplace-… ▽ More

    Submitted 7 June, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: UAI 2021

  17. arXiv:2010.02709  [pdf, other

    cs.LG stat.ML

    An Infinite-Feature Extension for Bayesian ReLU Nets That Fixes Their Asymptotic Overconfidence

    Authors: Agustinus Kristiadi, Matthias Hein, Philipp Hennig

    Abstract: A Bayesian treatment can mitigate overconfidence in ReLU nets around the training data. But far away from them, ReLU Bayesian neural networks (BNNs) can still underestimate uncertainty and thus be asymptotically overconfident. This issue arises since the output variance of a BNN with finitely many features is quadratic in the distance from the data region. Meanwhile, Bayesian linear models with Re… ▽ More

    Submitted 24 January, 2022; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: NeurIPS 2021

  18. arXiv:2007.15386  [pdf, other

    cs.LG stat.ML

    ResNet After All? Neural ODEs and Their Numerical Solution

    Authors: Katharina Ott, Prateek Katiyar, Philipp Hennig, Michael Tiemann

    Abstract: A key appeal of the recently proposed Neural Ordinary Differential Equation (ODE) framework is that it seems to provide a continuous-time extension of discrete residual neural networks. As we show herein, though, trained Neural ODE models actually depend on the specific numerical method used during training. If the trained model is supposed to be a flow generated from an ODE, it should be possible… ▽ More

    Submitted 10 September, 2023; v1 submitted 30 July, 2020; originally announced July 2020.

  19. arXiv:2007.01547  [pdf, other

    cs.LG stat.ML

    Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers

    Authors: Robin M. Schmidt, Frank Schneider, Philipp Hennig

    Abstract: Choosing the optimizer is considered to be among the most crucial design decisions in deep learning, and it is not an easy one. The growing literature now lists hundreds of optimization methods. In the absence of clear theoretical guidance and conclusive empirical evidence, the decision is often made based on anecdotes. In this work, we aim to replace these anecdotes, if not with a conclusive rank… ▽ More

    Submitted 10 August, 2021; v1 submitted 3 July, 2020; originally announced July 2020.

    Comments: Raw results: https://github.com/SirRob1997/Crowded-Valley---Results

  20. arXiv:2004.00623  [pdf, other

    math.NA stat.ME stat.ML

    Bayesian ODE Solvers: The Maximum A Posteriori Estimate

    Authors: Filip Tronarp, Simo Sarkka, Philipp Hennig

    Abstract: It has recently been established that the numerical solution of ordinary differential equations can be posed as a nonlinear Bayesian inference problem, which can be approximately solved via Gaussian filtering and smoothing, whenever a Gauss--Markov prior is used. In this paper the class of $ν$ times differentiable linear time invariant Gauss--Markov priors is considered. A taxonomy of Gaussian est… ▽ More

    Submitted 12 January, 2021; v1 submitted 1 April, 2020; originally announced April 2020.

  21. arXiv:2003.01227  [pdf, other

    cs.LG stat.ML

    Fast Predictive Uncertainty for Classification with Bayesian Deep Networks

    Authors: Marius Hobbhahn, Agustinus Kristiadi, Philipp Hennig

    Abstract: In Bayesian Deep Learning, distributions over the output of classification neural networks are often approximated by first constructing a Gaussian distribution over the weights, then sampling from it to receive a distribution over the softmax outputs. This is costly. We reconsider old work (Laplace Bridge) to construct a Dirichlet approximation of this softmax output distribution, which yields an… ▽ More

    Submitted 31 May, 2022; v1 submitted 2 March, 2020; originally announced March 2020.

    Comments: Updated version. Accepted for publication at UAI2022

  22. arXiv:2002.10118  [pdf, other

    stat.ML cs.LG

    Being Bayesian, Even Just a Bit, Fixes Overconfidence in ReLU Networks

    Authors: Agustinus Kristiadi, Matthias Hein, Philipp Hennig

    Abstract: The point estimates of ReLU classification networks---arguably the most widely used neural network architecture---have been shown to yield arbitrarily high confidence far away from the training data. This architecture, in conjunction with a maximum a posteriori estimation scheme, is thus not calibrated nor robust. Approximate Bayesian inference has been empirically demonstrated to improve predicti… ▽ More

    Submitted 17 July, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: ICML 2020

  23. arXiv:2002.09301  [pdf, other

    stat.ML cs.LG math.NA stat.ME

    Differentiable Likelihoods for Fast Inversion of 'Likelihood-Free' Dynamical Systems

    Authors: Hans Kersting, Nicholas Krämer, Martin Schiegg, Christian Daniel, Michael Tiemann, Philipp Hennig

    Abstract: Likelihood-free (a.k.a. simulation-based) inference problems are inverse problems with expensive, or intractable, forward models. ODE inverse problems are commonly treated as likelihood-free, as their forward map has to be numerically approximated by an ODE solver. This, however, is not a fundamental constraint but just a lack of functionality in classic ODE solvers, which do not return a likeliho… ▽ More

    Submitted 29 June, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

    Comments: 11 pages (+ 5 pages appendix), 6 figures

    Report number: Published at ICML 2020

  24. arXiv:2001.04884  [pdf, other

    physics.med-ph

    Analytical probabilistic modeling of dose-volume histograms

    Authors: Niklas Wahl, Philipp Hennig, Hans-Peter Wieser, Mark Bangert

    Abstract: Radiotherapy is sensitive to executional and preparational uncertainties that propagate to uncertainty in dose and plan quality indicators like dose-volume histograms (DVHs). Current approaches to quantify and mitigate such uncertainties rely on explicitly computed error scenarios and are thus subject to statistical uncertainty and limitations regarding the underlying uncertainty model. Here we pr… ▽ More

    Submitted 16 June, 2020; v1 submitted 14 January, 2020; originally announced January 2020.

    Comments: 25 pages, 5 figures

    Journal ref: Med. Phys. (2020), 47(10):260-5273

  25. arXiv:1912.10985  [pdf, other

    cs.LG stat.ML

    BackPACK: Packing more into backprop

    Authors: Felix Dangel, Frederik Kunstner, Philipp Hennig

    Abstract: Automatic differentiation frameworks are optimized for exactly one thing: computing the average mini-batch gradient. Yet, other quantities such as the variance of the mini-batch gradients or many approximations to the Hessian can, in theory, be computed efficiently, and at the same time as the gradient. While these quantities are of great interest to researchers and practitioners, current deep-lea… ▽ More

    Submitted 15 February, 2020; v1 submitted 23 December, 2019; originally announced December 2019.

    Comments: Main text: 10 pages, 7 figures, 1 table; Supplements: 10 pages, 4 figures, 3 tables

  26. arXiv:1911.06048  [pdf, other

    cs.LG stat.ML

    Conjugate Gradients for Kernel Machines

    Authors: Simon Bartels, Philipp Hennig

    Abstract: Regularized least-squares (kernel-ridge / Gaussian process) regression is a fundamental algorithm of statistics and machine learning. Because generic algorithms for the exact solution have cubic complexity in the number of datapoints, large datasets require to resort to approximations. In this work, the computation of the least-squares prediction is itself treated as a probabilistic inference prob… ▽ More

    Submitted 14 November, 2019; originally announced November 2019.

  27. arXiv:1910.09328  [pdf, other

    cs.LG stat.ML

    Integrals over Gaussians under Linear Domain Constraints

    Authors: Alexandra Gessner, Oindrila Kanjilal, Philipp Hennig

    Abstract: Integrals of linearly constrained multivariate Gaussian densities are a frequent problem in machine learning and statistics, arising in tasks like generalized linear models and Bayesian optimization. Yet they are notoriously hard to compute, and to further complicate matters, the numerical values of such integrals may be very small. We present an efficient black-box algorithm that exploits geometr… ▽ More

    Submitted 2 March, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

  28. arXiv:1909.02345  [pdf, other

    cond-mat.mtrl-sci physics.comp-ph

    Phase-Field Modelling of Interface Failure in Brittle Materials

    Authors: Arne Claus Hansen-Dörr, René de Borst, Paul Hennig, Markus Kästner

    Abstract: A phase-field approach is proposed for interface failure between two possibly dissimilar materials. The discrete adhesive interface is regularised over a finite width. Due to the use of a regularised crack model for the bulk material, an interaction between the length scales of the crack and the interface can occur. An analytic one-dimensional analysis has been carried out to quantify this effect… ▽ More

    Submitted 5 September, 2019; originally announced September 2019.

    Comments: 18 pages, 15 figures

    Journal ref: Computer Methods in Applied Mechanics and Engineering Volume 346, 1 April 2019, Pages 25-42

  29. arXiv:1907.10383  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Classified Regression for Bayesian Optimization: Robot Learning with Unknown Penalties

    Authors: Alonso Marco, Dominik Baumann, Philipp Hennig, Sebastian Trimpe

    Abstract: Learning robot controllers by minimizing a black-box objective cost using Bayesian optimization (BO) can be time-consuming and challenging. It is very often the case that some roll-outs result in failure behaviors, causing premature experiment detention. In such cases, the designer is forced to decide on heuristic cost penalties because the acquired data is often scarce, or not comparable with tha… ▽ More

    Submitted 9 November, 2020; v1 submitted 24 July, 2019; originally announced July 2019.

    Comments: This paper was submitted to JMLR in 2018 and rejected. Currently, it is not published, nor under review in any conference or journal venue

  30. arXiv:1906.11655  [pdf, other

    cs.LG stat.ML

    Uncertainty Estimates for Ordinal Embeddings

    Authors: Michael Lohaus, Philipp Hennig, Ulrike von Luxburg

    Abstract: To investigate objects without a describable notion of distance, one can gather ordinal information by asking triplet comparisons of the form "Is object $x$ closer to $y$ or is $x$ closer to $z$?" In order to learn from such data, the objects are typically embedded in a Euclidean space while satisfying as many triplet comparisons as possible. In this paper, we introduce empirical uncertainty estim… ▽ More

    Submitted 27 June, 2019; originally announced June 2019.

    Comments: 16 pages

  31. arXiv:1905.12558  [pdf, other

    cs.LG stat.ML

    Limitations of the Empirical Fisher Approximation for Natural Gradient Descent

    Authors: Frederik Kunstner, Lukas Balles, Philipp Hennig

    Abstract: Natural gradient descent, which preconditions a gradient descent update with the Fisher information matrix of the underlying statistical model, is a way to capture partial second-order information. Several highly visible works have advocated an approximation known as the empirical Fisher, drawing connections between approximate second-order methods and heuristics like Adam. We dispute this argumen… ▽ More

    Submitted 8 June, 2020; v1 submitted 29 May, 2019; originally announced May 2019.

    Comments: V3: Minor corrections (typographic errors)

  32. arXiv:1905.10271  [pdf, other

    stat.ML cs.LG math.NA stat.CO

    Convergence Guarantees for Adaptive Bayesian Quadrature Methods

    Authors: Motonobu Kanagawa, Philipp Hennig

    Abstract: Adaptive Bayesian quadrature (ABQ) is a powerful approach to numerical integration that empirically compares favorably with Monte Carlo integration on problems of medium dimensionality (where non-adaptive quadrature is not competitive). Its key ingredient is an acquisition function that changes as a function of previously collected values of the integrand. While this adaptivity appears to be empir… ▽ More

    Submitted 28 October, 2019; v1 submitted 24 May, 2019; originally announced May 2019.

    Comments: To appear in NeurIPS 2019

  33. arXiv:1903.05499  [pdf, other

    cs.LG stat.ML

    DeepOBS: A Deep Learning Optimizer Benchmark Suite

    Authors: Frank Schneider, Lukas Balles, Philipp Hennig

    Abstract: Because the choice and tuning of the optimizer affects the speed, and ultimately the performance of deep learning, there is significant past and recent research in this area. Yet, perhaps surprisingly, there is no generally agreed-upon protocol for the quantitative and reproducible evaluation of optimization strategies for deep learning. We suggest routines and benchmarks for stochastic optimizati… ▽ More

    Submitted 13 March, 2019; originally announced March 2019.

    Comments: Accepted at ICLR 2019. 9 pages, 3 figures, 2 tables

  34. arXiv:1902.07557  [pdf, other

    cs.LG stat.ML

    Active Probabilistic Inference on Matrices for Pre-Conditioning in Stochastic Optimization

    Authors: Filip de Roos, Philipp Hennig

    Abstract: Pre-conditioning is a well-known concept that can significantly improve the convergence of optimization algorithms. For noise-free problems, where good pre-conditioners are not known a priori, iterative linear algebra methods offer one way to efficiently construct them. For the stochastic optimization problems that dominate contemporary machine learning, however, this approach is not readily avail… ▽ More

    Submitted 20 February, 2019; originally announced February 2019.

    Comments: Conference

  35. arXiv:1902.01813  [pdf, other

    cs.LG stat.ML

    Modular Block-diagonal Curvature Approximations for Feedforward Architectures

    Authors: Felix Dangel, Stefan Harmeling, Philipp Hennig

    Abstract: We propose a modular extension of backpropagation for the computation of block-diagonal approximations to various curvature matrices of the training objective (in particular, the Hessian, generalized Gauss-Newton, and positive-curvature Hessian). The approach reduces the otherwise tedious manual derivation of these matrices into local modules, and is easy to integrate into existing machine learnin… ▽ More

    Submitted 28 February, 2020; v1 submitted 5 February, 2019; originally announced February 2019.

    Comments: 9 pages, 5 figures, 1 table, supplements included (13 pages, 6 figures, 2 tables)

  36. arXiv:1901.07229  [pdf, other

    stat.ML cs.LG

    Fast and Robust Shortest Paths on Manifolds Learned from Data

    Authors: Georgios Arvanitidis, Søren Hauberg, Philipp Hennig, Michael Schober

    Abstract: We propose a fast, simple and robust algorithm for computing shortest paths and distances on Riemannian manifolds learned from data. This amounts to solving a system of ordinary differential equations (ODEs) subject to boundary conditions. Here standard solvers perform poorly because they require well-behaved Jacobians of the ODE, and usually, manifolds learned from data imply unstable and ill-con… ▽ More

    Submitted 22 January, 2019; originally announced January 2019.

    Comments: Accepted at Artificial Intelligence and Statistics (AISTATS) 2019

  37. arXiv:1812.04346  [pdf

    cs.SI cs.LG stat.ML

    Towards Automatic Personality Prediction Using Facebook Like Categories

    Authors: Raad Bin Tareaf, Philipp Berger, Patrick Hennig, Christoph Meinel

    Abstract: We demonstrate that effortlessly accessible digital records of behavior such as Facebook Likes can be obtained and utilized to automatically distinguish a wide range of highly delicate personal traits including: life satisfaction, cultural ethnicity, political views, age, gender and personality traits. The analysis presented based on a dataset of over 738,000 users who conferred their Facebook Lik… ▽ More

    Submitted 11 December, 2018; originally announced December 2018.

    Comments: 14 pages, 6 figures, conference

  38. arXiv:1810.03440  [pdf, other

    stat.ME stat.CO stat.ML

    Probabilistic Solutions To Ordinary Differential Equations As Non-Linear Bayesian Filtering: A New Perspective

    Authors: Filip Tronarp, Hans Kersting, Simo Särkkä, Philipp Hennig

    Abstract: We formulate probabilistic numerical approximations to solutions of ordinary differential equations (ODEs) as problems in Gaussian process (GP) regression with non-linear measurement functions. This is achieved by defining the measurement sequence to consist of the observations of the difference between the derivative of the GP and the vector field evaluated at the GP---which are all identically z… ▽ More

    Submitted 24 April, 2019; v1 submitted 8 October, 2018; originally announced October 2018.

  39. arXiv:1810.03398  [pdf, other

    stat.CO math.NA

    Probabilistic Linear Solvers: A Unifying View

    Authors: Simon Bartels, Jon Cockayne, Ilse C. F. Ipsen, Philipp Hennig

    Abstract: Several recent works have developed a new, probabilistic interpretation for numerical algorithms solving linear systems in which the solution is inferred in a Bayesian framework, either directly or by inferring the unknown action of the matrix inverse. These approaches have typically focused on replicating the behavior of the conjugate gradient method as a prototypical iterative method. In this wo… ▽ More

    Submitted 17 October, 2018; v1 submitted 8 October, 2018; originally announced October 2018.

  40. arXiv:1807.09737  [pdf, other

    math.NA cs.LG math.ST stat.CO stat.ML

    Convergence Rates of Gaussian ODE Filters

    Authors: Hans Kersting, T. J. Sullivan, Philipp Hennig

    Abstract: A recently-introduced class of probabilistic (uncertainty-aware) solvers for ordinary differential equations (ODEs) applies Gaussian (Kalman) filtering to initial value problems. These methods model the true solution $x$ and its first $q$ derivatives \emph{a priori} as a Gauss--Markov process $\boldsymbol{X}$, which is then iteratively conditioned on information about $\dot{x}$. This article estab… ▽ More

    Submitted 17 July, 2020; v1 submitted 25 July, 2018; originally announced July 2018.

    Comments: 26 pages, 5 figures

    MSC Class: 60G15; 60J70; 62G20; 62M05; 65C20; 65L05

  41. arXiv:1807.02582  [pdf, other

    stat.ML cs.LG

    Gaussian Processes and Kernel Methods: A Review on Connections and Equivalences

    Authors: Motonobu Kanagawa, Philipp Hennig, Dino Sejdinovic, Bharath K Sriperumbudur

    Abstract: This paper is an attempt to bridge the conceptual gaps between researchers working on the two widely used approaches based on positive definite kernels: Bayesian learning or inference using Gaussian processes on the one side, and frequentist kernel methods based on reproducing kernel Hilbert spaces on the other. It is widely known in machine learning that these two formalisms are closely related;… ▽ More

    Submitted 6 July, 2018; originally announced July 2018.

    Comments: 64 pages

  42. arXiv:1709.08471  [pdf, other

    math.NA cs.AI

    Bayesian Filtering for ODEs with Bounded Derivatives

    Authors: Emilia Magnani, Hans Kersting, Michael Schober, Philipp Hennig

    Abstract: Recently there has been increasing interest in probabilistic solvers for ordinary differential equations (ODEs) that return full probability measures, instead of point estimates, over the solution and can incorporate uncertainty over the ODE at hand, e.g. if the vector field or the initial value is only approximately known or evaluable. The ODE filter proposed in recent work models the solution of… ▽ More

    Submitted 25 September, 2017; originally announced September 2017.

    Comments: 14 pages, 9 figrues

  43. arXiv:1709.07089  [pdf, other

    eess.SY cs.LG stat.ML

    On the Design of LQR Kernels for Efficient Controller Learning

    Authors: Alonso Marco, Philipp Hennig, Stefan Schaal, Sebastian Trimpe

    Abstract: Finding optimal feedback controllers for nonlinear dynamic systems from data is hard. Recently, Bayesian optimization (BO) has been proposed as a powerful framework for direct controller tuning from experimental trials. For selecting the next query point and finding the global optimum, BO relies on a probabilistic description of the latent objective function, typically a Gaussian process (GP). As… ▽ More

    Submitted 20 September, 2017; originally announced September 2017.

    Comments: 8 pages, 5 figures, to appear in 56th IEEE Conference on Decision and Control (CDC 2017)

  44. arXiv:1706.10234  [pdf, other

    stat.ML cs.AI cs.LG

    Probabilistic Active Learning of Functions in Structural Causal Models

    Authors: Paul K. Rubenstein, Ilya Tolstikhin, Philipp Hennig, Bernhard Schoelkopf

    Abstract: We consider the problem of learning the functions computing children from parents in a Structural Causal Model once the underlying causal graph has been identified. This is in some sense the second step after causal discovery. Taking a probabilistic approach to estimating these functions, we derive a natural myopic active learning scheme that identifies the intervention which is optimally informat… ▽ More

    Submitted 30 June, 2017; originally announced June 2017.

    Comments: 9 pages main text + 4 pages supplement

  45. arXiv:1706.00241  [pdf, other

    cs.LG math.NA stat.ML

    Krylov Subspace Recycling for Fast Iterative Least-Squares in Machine Learning

    Authors: Filip de Roos, Philipp Hennig

    Abstract: Solving symmetric positive definite linear problems is a fundamental computational task in machine learning. The exact solution, famously, is cubicly expensive in the size of the matrix. To alleviate this problem, several linear-time approximations, such as spectral and inducing-point methods, have been suggested and are now in wide use. These are low-rank approximations that choose the low-rank s… ▽ More

    Submitted 1 June, 2017; originally announced June 2017.

  46. arXiv:1705.07774  [pdf, other

    cs.LG stat.ML

    Dissecting Adam: The Sign, Magnitude and Variance of Stochastic Gradients

    Authors: Lukas Balles, Philipp Hennig

    Abstract: The ADAM optimizer is exceedingly popular in the deep learning community. Often it works very well, sometimes it doesn't. Why? We interpret ADAM as a combination of two aspects: for each weight, the update direction is determined by the sign of stochastic gradients, whereas the update magnitude is determined by an estimate of their relative variance. We disentangle these two aspects and analyze th… ▽ More

    Submitted 13 December, 2020; v1 submitted 22 May, 2017; originally announced May 2017.

    Comments: Presented at the 35th International Conference on Machine Learning (ICML), 2018

  47. arXiv:1703.10034  [pdf, other

    cs.LG stat.ML

    Probabilistic Line Searches for Stochastic Optimization

    Authors: Maren Mahsereci, Philipp Hennig

    Abstract: In deterministic optimization, line searches are a standard tool ensuring stability and efficiency. Where only stochastic gradients are available, no direct equivalent has so far been formulated, because uncertain gradients do not allow for a strict sequence of decisions collapsing the search space. We construct a probabilistic line search by combining the structure of existing deterministic metho… ▽ More

    Submitted 30 June, 2017; v1 submitted 29 March, 2017; originally announced March 2017.

    Comments: Extended version of the NIPS '15 conference paper, includes detailed pseudo-code, 59 pages, 35 figures

  48. arXiv:1703.09580  [pdf, other

    cs.LG stat.ML

    Early Stopping without a Validation Set

    Authors: Maren Mahsereci, Lukas Balles, Christoph Lassner, Philipp Hennig

    Abstract: Early stopping is a widely used technique to prevent poor generalization performance when training an over-expressive model by means of gradient-based optimization. To find a good point to halt the optimizer, a common practice is to split the dataset into a training and a smaller validation set to obtain an ongoing estimate of the generalization performance. We propose a novel early stopping crite… ▽ More

    Submitted 6 June, 2017; v1 submitted 28 March, 2017; originally announced March 2017.

    Comments: 16 pages, 10 figures

  49. arXiv:1703.01250  [pdf, other

    cs.RO cs.LG eess.SY

    Virtual vs. Real: Trading Off Simulations and Physical Experiments in Reinforcement Learning with Bayesian Optimization

    Authors: Alonso Marco, Felix Berkenkamp, Philipp Hennig, Angela P. Schoellig, Andreas Krause, Stefan Schaal, Sebastian Trimpe

    Abstract: In practice, the parameters of control policies are often tuned manually. This is time-consuming and frustrating. Reinforcement learning is a promising alternative that aims to automate this process, yet often requires too many experiments to be practical. In this paper, we propose a solution to this problem by exploiting prior knowledge from simulations, which are readily available for most robot… ▽ More

    Submitted 3 March, 2017; originally announced March 2017.

    Comments: 7 pages, 6 figures, to appear in IEEE 2017 International Conference on Robotics and Automation (ICRA)

  50. arXiv:1612.05086  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Coupling Adaptive Batch Sizes with Learning Rates

    Authors: Lukas Balles, Javier Romero, Philipp Hennig

    Abstract: Mini-batch stochastic gradient descent and variants thereof have become standard for large-scale empirical risk minimization like the training of neural networks. These methods are usually used with a constant batch size chosen by simple empirical inspection. The batch size significantly influences the behavior of the stochastic optimization algorithm, though, since it determines the variance of t… ▽ More

    Submitted 28 June, 2017; v1 submitted 15 December, 2016; originally announced December 2016.

    Comments: Thirty-Third Conference on Uncertainty in Artificial Intelligence (UAI), 2017, (accepted)