Skip to main content

Showing 1–25 of 25 results for author: Krämer, N

Searching in archive math. Search in all archives.
.
  1. arXiv:2503.10279  [pdf, other

    stat.ME cs.LG math.NA stat.CO

    Numerically robust Gaussian state estimation with singular observation noise

    Authors: Nicholas Krämer, Filip Tronarp

    Abstract: This article proposes numerically robust algorithms for Gaussian state estimation with singular observation noise. Our approach combines a series of basis changes with Bayes' rule, transforming the singular estimation problem into a nonsingular one with reduced state dimension. In addition to ensuring low runtime and numerical stability, our proposal facilitates marginal-likelihood computations an… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

  2. arXiv:2410.10530  [pdf, ps, other

    math.NA cs.LG stat.ML

    Adaptive Probabilistic ODE Solvers Without Adaptive Memory Requirements

    Authors: Nicholas Krämer

    Abstract: Despite substantial progress in recent years, probabilistic solvers with adaptive step sizes can still not solve memory-demanding differential equations -- unless we care only about a single point in time (which is far too restrictive; we want the whole time series). Counterintuitively, the culprit is the adaptivity itself: Its unpredictable memory demands easily exceed our machine's capabilities,… ▽ More

    Submitted 3 July, 2025; v1 submitted 14 October, 2024; originally announced October 2024.

  3. arXiv:2409.20004  [pdf, other

    math.NA cs.LG eess.SY stat.CO stat.ML

    Numerically Robust Fixed-Point Smoothing Without State Augmentation

    Authors: Nicholas Krämer

    Abstract: Practical implementations of Gaussian smoothing algorithms have received a great deal of attention in the last 60 years. However, almost all work focuses on estimating complete time series (''fixed-interval smoothing'', $\mathcal{O}(K)$ memory) through variations of the Rauch--Tung--Striebel smoother, rarely on estimating the initial states (''fixed-point smoothing'', $\mathcal{O}(1)$ memory). Sin… ▽ More

    Submitted 23 January, 2025; v1 submitted 30 September, 2024; originally announced September 2024.

  4. arXiv:2409.06752  [pdf, ps, other

    cs.MS cs.LG math.NA

    A tutorial on automatic differentiation with complex numbers

    Authors: Nicholas Krämer

    Abstract: Automatic differentiation is everywhere, but there exists only minimal documentation of how it works in complex arithmetic beyond stating "derivatives in $\mathbb{C}^d$" $\cong$ "derivatives in $\mathbb{R}^{2d}$" and, at best, shallow references to Wirtinger calculus. Unfortunately, the equivalence $\mathbb{C}^d \cong \mathbb{R}^{2d}$ becomes insufficient as soon as we need to derive custom gradie… ▽ More

    Submitted 10 December, 2024; v1 submitted 10 September, 2024; originally announced September 2024.

  5. arXiv:2405.17277  [pdf, other

    cs.LG math.NA stat.ML

    Gradients of Functions of Large Matrices

    Authors: Nicholas Krämer, Pablo Moreno-Muñoz, Hrittik Roy, Søren Hauberg

    Abstract: Tuning scientific and probabilistic machine learning models $-$ for example, partial differential equations, Gaussian processes, or Bayesian neural networks $-$ often relies on evaluating functions of matrices whose size grows with the data set or the number of parameters. While the state-of-the-art for evaluating these quantities is almost always based on Lanczos and Arnoldi iterations, the prese… ▽ More

    Submitted 24 October, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  6. arXiv:2311.12184  [pdf, ps, other

    math.OC

    Continuity of Filters for Discrete-Time Control Problems Defined by Explicit Equations

    Authors: Eugene A. Feinberg, Sayaka Ishizawa, Pavlo O. Kasyanov, David N. Kraemer

    Abstract: Discrete time control systems whose dynamics and observations are described by stochastic equations are common in engineering, operations research, health care, and economics. For example, stochastic filtering problems are usually defined via stochastic equations. These problems can be reduced to Markov decision processes (MDPs) whose states are posterior state distributions, and transition probab… ▽ More

    Submitted 3 February, 2025; v1 submitted 20 November, 2023; originally announced November 2023.

    MSC Class: Primary 90C40; Secondary 62C05; 90C39

  7. arXiv:2208.01565  [pdf, other

    cs.LG math.NA

    Approximate Bayesian Neural Operators: Uncertainty Quantification for Parametric PDEs

    Authors: Emilia Magnani, Nicholas Krämer, Runa Eschenhagen, Lorenzo Rosasco, Philipp Hennig

    Abstract: Neural operators are a type of deep architecture that learns to solve (i.e. learns the nonlinear solution operator of) partial differential equations (PDEs). The current state of the art for these models does not provide explicit uncertainty quantification. This is arguably even more of a problem for this kind of tasks than elsewhere in machine learning, because the dynamical systems typically des… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

  8. arXiv:2112.14898  [pdf, other

    math.OC

    Continuity of Discounted Values and the Structure of Optimal Policies for Periodic-Review Inventory Control with Setup Costs

    Authors: Eugene A. Feinberg, David N. Kraemer

    Abstract: This paper proves continuity of value functions in discounted periodic-review single-commodity total-cost inventory control problems with \revision{continuous inventory levels,} fixed ordering costs, possibly bounded inventory storage capacity, and possibly bounded order sizes for finite and infinite horizons. In each of these constrained models, the finite and infinite-horizon value functions are… ▽ More

    Submitted 26 July, 2022; v1 submitted 29 December, 2021; originally announced December 2021.

    MSC Class: 90B05

  9. arXiv:2112.02100  [pdf, other

    cs.MS cs.LG math.NA

    ProbNum: Probabilistic Numerics in Python

    Authors: Jonathan Wenger, Nicholas Krämer, Marvin Pförtner, Jonathan Schmidt, Nathanael Bosch, Nina Effenberger, Johannes Zenn, Alexandra Gessner, Toni Karvonen, François-Xavier Briol, Maren Mahsereci, Philipp Hennig

    Abstract: Probabilistic numerical methods (PNMs) solve numerical problems via probabilistic inference. They have been developed for linear algebra, optimization, integration and differential equation simulation. PNMs naturally incorporate prior information about a problem and quantify uncertainty due to finite computational resources as well as stochastic input. In this paper, we present ProbNum: a Python l… ▽ More

    Submitted 3 December, 2021; originally announced December 2021.

  10. arXiv:2110.11847  [pdf, other

    math.NA stat.ML

    Probabilistic Numerical Method of Lines for Time-Dependent Partial Differential Equations

    Authors: Nicholas Krämer, Jonathan Schmidt, Philipp Hennig

    Abstract: This work develops a class of probabilistic algorithms for the numerical solution of nonlinear, time-dependent partial differential equations (PDEs). Current state-of-the-art PDE solvers treat the space- and time-dimensions separately, serially, and with black-box algorithms, which obscures the interactions between spatial and temporal approximation errors and misguides the quantification of the o… ▽ More

    Submitted 9 March, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

  11. arXiv:2110.11812  [pdf, other

    stat.ML cs.LG math.NA

    Probabilistic ODE Solutions in Millions of Dimensions

    Authors: Nicholas Krämer, Nathanael Bosch, Jonathan Schmidt, Philipp Hennig

    Abstract: Probabilistic solvers for ordinary differential equations (ODEs) have emerged as an efficient framework for uncertainty quantification and inference on dynamical systems. In this work, we explain the mathematical assumptions and detailed implementation schemes behind solving {high-dimensional} ODEs with a probabilistic numerical algorithm. This has not been possible before due to matrix-matrix ope… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

  12. arXiv:2109.06299  [pdf, other

    math.OC

    Continuity of Parametric Optima for Possibly Discontinuous Functions and Noncompact Decision Sets

    Authors: Eugene A. Feinberg, Pavlo O. Kasyanov, David N. Kraemer

    Abstract: This paper investigates continuity properties of value functions and solutions for parametric optimization problems. These problems are important in operations research, control, and economics because optimality equations are their particular cases. The classic fact, Berge's maximum theorem, gives sufficient conditions for continuity of value functions and upper semicontinuity of solution multifun… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

    MSC Class: 49K27; 49K35; 90B05

  13. arXiv:2106.07761  [pdf, other

    stat.ML cs.LG math.NA

    Linear-Time Probabilistic Solutions of Boundary Value Problems

    Authors: Nicholas Krämer, Philipp Hennig

    Abstract: We propose a fast algorithm for the probabilistic solution of boundary value problems (BVPs), which are ordinary differential equations subject to boundary conditions. In contrast to previous work, we introduce a Gauss--Markov prior and tailor it specifically to BVPs, which allows computing a posterior distribution over the solution in linear time, at a quality and cost comparable to that of well-… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

  14. arXiv:2012.10106  [pdf, other

    stat.ML cs.LG math.NA

    Stable Implementation of Probabilistic ODE Solvers

    Authors: Nicholas Krämer, Philipp Hennig

    Abstract: Probabilistic solvers for ordinary differential equations (ODEs) provide efficient quantification of numerical uncertainty associated with simulation of dynamical systems. Their convergence rates have been established by a growing body of theoretical analysis. However, these algorithms suffer from numerical instability when run at high order or with small step-sizes -- that is, exactly in the regi… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

  15. arXiv:2002.09301  [pdf, other

    stat.ML cs.LG math.NA stat.ME

    Differentiable Likelihoods for Fast Inversion of 'Likelihood-Free' Dynamical Systems

    Authors: Hans Kersting, Nicholas Krämer, Martin Schiegg, Christian Daniel, Michael Tiemann, Philipp Hennig

    Abstract: Likelihood-free (a.k.a. simulation-based) inference problems are inverse problems with expensive, or intractable, forward models. ODE inverse problems are commonly treated as likelihood-free, as their forward map has to be numerically approximated by an ODE solver. This, however, is not a fundamental constraint but just a lack of functionality in classic ODE solvers, which do not return a likeliho… ▽ More

    Submitted 29 June, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

    Comments: 11 pages (+ 5 pages appendix), 6 figures

    Report number: Published at ICML 2020

  16. arXiv:1607.02387  [pdf, ps, other

    math.ST stat.ML

    Convergence rates of Kernel Conjugate Gradient for random design regression

    Authors: Gilles Blanchard, Nicole Krämer

    Abstract: We prove statistical rates of convergence for kernel-based least squares regression from i.i.d. data using a conjugate gradient algorithm, where regularization against overfitting is obtained by early stopping. This method is related to Kernel Partial Least Squares, a regression method that combines supervised dimensionality reduction with least squares projection. Following the setting introduced… ▽ More

    Submitted 8 July, 2016; originally announced July 2016.

  17. arXiv:1209.5356  [pdf, other

    math.ST stat.AP stat.ME

    Total loss estimation using copula-based regression models

    Authors: Nicole Kraemer, Eike C. Brechmann, Daniel Silvestrini, Claudia Czado

    Abstract: We present a joint copula-based model for insurance claims and sizes. It uses bivariate copulae to accommodate for the dependence between these quantities. We derive the general distribution of the policy loss without the restrictive assumption of independence. We illustrate that this distribution tends to be skewed and multi-modal, and that an independence assumption can lead to substantial bias… ▽ More

    Submitted 24 September, 2012; originally announced September 2012.

  18. arXiv:1207.6460  [pdf, ps, other

    math.NT

    Imaginärquadratische Einbettung von Ordnungen rationaler Quaternionenalgebren, und die nichtzyklischen endlichen Untergruppen der Bianchi-Gruppen

    Authors: Norbert Krämer

    Abstract: Let k be an imaginary quadratic number field, let F be a rational quaternion algebra and M an extension of F as a quaternion k-algebra. We are going to classify the F-orders which arise as an intersection of F with a maximal M-order; and we are going to prove that the discriminant of such an intersection determines uniquely the isomorphism type of the corresponding maximal M-order. Building on thi… ▽ More

    Submitted 20 February, 2017; v1 submitted 27 July, 2012; originally announced July 2012.

    Comments: in German

  19. arXiv:1009.5839  [pdf, ps, other

    math.ST stat.ME stat.ML

    Optimal learning rates for Kernel Conjugate Gradient regression

    Authors: Gilles Blanchard, Nicole Kraemer

    Abstract: We prove rates of convergence in the statistical sense for kernel-based least squares regression using a conjugate gradient algorithm, where regularization against overfitting is obtained by early stopping. This method is directly related to Kernel Partial Least Squares, a regression method that combines supervised dimensionality reduction with least squares projection. The rates depend on two key… ▽ More

    Submitted 29 September, 2010; originally announced September 2010.

    Comments: to appear in Neural Information Processing Systems 2010

  20. arXiv:1002.4112  [pdf, ps, other

    stat.ME math.ST stat.ML

    The Degrees of Freedom of Partial Least Squares Regression

    Authors: Nicole Kraemer, Masashi Sugiyama

    Abstract: The derivation of statistical properties for Partial Least Squares regression can be a challenging task. The reason is that the construction of latent components from the predictor variables also depends on the response variable. While this typically leads to good performance and interpretable models in practice, it makes the statistical analysis more involved. In this work, we study the intrinsic… ▽ More

    Submitted 9 February, 2011; v1 submitted 22 February, 2010; originally announced February 2010.

    Comments: to appear in the Journal of the American Statistical Association

  21. arXiv:0902.4380  [pdf, ps, other

    stat.ME math.ST stat.ML

    Kernel Partial Least Squares is Universally Consistent

    Authors: Gilles Blanchard, Nicole Kraemer

    Abstract: We prove the statistical consistency of kernel Partial Least Squares Regression applied to a bounded regression learning problem on a reproducing kernel Hilbert space. Partial Least Squares stands out of well-known classical approaches as e.g. Ridge Regression or Principal Components Regression, as it is not defined as the solution of a global cost minimization procedure over a fixed model nor i… ▽ More

    Submitted 14 January, 2010; v1 submitted 25 February, 2009; originally announced February 2009.

    Comments: 18 pages, no figures

    Journal ref: JMLR Workshop and Conference Proceedings 9 (AISTATS 2010) 57-64, 2010

  22. Penalized Partial Least Squares Based on B-Splines Transformations

    Authors: Nicole Kraemer, Anne-Laure Boulesteix, Gerhard Tutz

    Abstract: We propose a novel method to model nonlinear regression problems by adapting the principle of penalization to Partial Least Squares (PLS). Starting with a generalized additive model, we expand the additive component of each variable in terms of a generous amount of B-Splines basis functions. In order to prevent overfitting and to obtain smooth functions, we estimate the regression model by apply… ▽ More

    Submitted 23 August, 2006; originally announced August 2006.

    MSC Class: 62J02; 62G05; 65F10

    Journal ref: Chemometrics and Intelligent Laboratory Systems, 94, p 60 - 69, 2010

  23. arXiv:math/0605751  [pdf, ps, other

    math.ST

    Boosting for Functional Data

    Authors: Nicole Kraemer

    Abstract: We deal with the task of supervised learning if the data is of functional type. The crucial point is the choice of the appropriate fitting method (learner). Boosting is a stepwise technique that combines learners in such a way that the composite learner outperforms the single learner. This can be done by either reweighting the examples or with the help of a gradient descent technique. In this pa… ▽ More

    Submitted 30 May, 2006; originally announced May 2006.

    MSC Class: 62G05;62M20;62G08

    Journal ref: Proceedings of the 17th International Conference on Computational Statistics, 2006, pp. 1121-1128

  24. arXiv:math/0503495  [pdf, ps, other

    math.ST

    On the shrinkage behavior of partial least squares regression

    Authors: Nicole Kraemer

    Abstract: We present a formula for the shrinkage factors of the Partial Least Squares regression estimator and deduce some of their properties, in particular the known fact that some of the factors are >1. We investigate the effect of shrinkage factors for the Mean Squared error of linear estimators and illustrate that we cannot extend the results to nonlinear estimators. In particular, shrinkage factors… ▽ More

    Submitted 23 March, 2005; originally announced March 2005.

    MSC Class: 62J07; 62H99

  25. arXiv:math/0302025  [pdf, ps, other

    math.AG

    Local models for ramified unitary groups

    Authors: Nicole Kraemer

    Abstract: In this article, we study local models associated to certain Shimura varieties. In particular, we present a resoultion of their singularities. As a consequence, we are able to determine the alternating semisimple trace of the geometric Frobenius on the sheaf of nearby cycles.

    Submitted 3 February, 2003; originally announced February 2003.

    MSC Class: 14G35;11G18

    Journal ref: Abhandlungen aus dem Mathematischen Seminar der UniversitÃ?t Hamburg (2003), Vol. 73, 67-80