Skip to main content

Showing 1–18 of 18 results for author: Nüsken, N

Searching in archive math. Search in all archives.
.
  1. arXiv:2504.03461  [pdf, ps, other

    stat.ML cs.LG math.PR

    Conditioning Diffusions Using Malliavin Calculus

    Authors: Jakiw Pidstrigach, Elizabeth Baker, Carles Domingo-Enrich, George Deligiannidis, Nikolas Nüsken

    Abstract: In generative modelling and stochastic optimal control, a central computational task is to modify a reference diffusion process to maximise a given terminal-time reward. Most existing methods require this reward to be differentiable, using gradients to steer the diffusion towards favourable outcomes. However, in many practical settings, like diffusion bridges, the reward is singular, taking an inf… ▽ More

    Submitted 6 June, 2025; v1 submitted 4 April, 2025; originally announced April 2025.

    ACM Class: G.3

  2. arXiv:2409.01464  [pdf, other

    stat.ML cs.LG math.NA math.ST stat.ME

    Stein transport for Bayesian inference

    Authors: Nikolas Nüsken

    Abstract: We introduce $\textit{Stein transport}$, a novel methodology for Bayesian inference designed to efficiently push an ensemble of particles along a predefined curve of tempered probability distributions. The driving vector field is chosen from a reproducing kernel Hilbert space and can be derived either through a suitable kernel ridge regression formulation or as an infinitesimal optimal transport m… ▽ More

    Submitted 28 November, 2024; v1 submitted 2 September, 2024; originally announced September 2024.

  3. arXiv:2405.14373  [pdf, ps, other

    math.PR math.NA stat.CO

    Skew-symmetric schemes for stochastic differential equations with non-Lipschitz drift: an unadjusted Barker algorithm

    Authors: Yuga Iguchi, Samuel Livingstone, Nikolas Nüsken, Giorgos Vasdekis, Rui-Yang Zhang

    Abstract: We propose a new simple and explicit numerical scheme for time-homogeneous stochastic differential equations. The scheme is based on sampling increments at each time step from a skew-symmetric probability distribution, with the level of skewness determined by the drift and volatility of the underlying process. We show that as the step-size decreases the scheme converges weakly to the diffusion of… ▽ More

    Submitted 7 July, 2025; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: 42 pages, 3 figures Keywords: Stochastic differential equations, Skew-symmetric distributions, Sampling algorithms, Markov Chain Monte Carlo

    MSC Class: 60H35; 65C05; 65C30; 65C40

  4. arXiv:2401.12967  [pdf, other

    math.ST math.NA stat.ME

    Measure transport with kernel mean embeddings

    Authors: L. Wang, N. Nüsken

    Abstract: Kalman filters constitute a scalable and robust methodology for approximate Bayesian inference, matching first and second order moments of the target posterior. To improve the accuracy in nonlinear and non-Gaussian settings, we extend this principle to include more or different characteristics, based on kernel mean embeddings (KMEs) of probability measures into reproducing kernel Hilbert spaces. F… ▽ More

    Submitted 2 September, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  5. arXiv:2308.07663  [pdf, ps, other

    cs.IT math.DS

    Coherent set identification via direct low rank maximum likelihood estimation

    Authors: Robert Polzin, Ilja Klebanov, Nikolas Nüsken, Péter Koltai

    Abstract: We analyze connections between two low rank modeling approaches from the last decade for treating dynamical data. The first one is the coherence problem (or coherent set approach), where groups of states are sought that evolve under the action of a stochastic transition matrix in a way maximally distinguishable from other groups. The second one is a low rank factorization approach for stochastic m… ▽ More

    Submitted 1 October, 2024; v1 submitted 15 August, 2023; originally announced August 2023.

    MSC Class: 65F55; 62M05; 37M10; 15A23; 60J22

  6. arXiv:2307.15496  [pdf, other

    cs.LG math.NA math.PR stat.ML

    From continuous-time formulations to discretization schemes: tensor trains and robust regression for BSDEs and parabolic PDEs

    Authors: Lorenz Richter, Leon Sallandt, Nikolas Nüsken

    Abstract: The numerical approximation of partial differential equations (PDEs) poses formidable challenges in high dimensions since classical grid-based methods suffer from the so-called curse of dimensionality. Recent attempts rely on a combination of Monte Carlo methods and variational formulations, using neural networks for function approximation. Extending previous work (Richter et al., 2021), we argue… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

  7. arXiv:2112.03749  [pdf, other

    math.NA math.PR stat.ML

    Interpolating between BSDEs and PINNs: deep learning for elliptic and parabolic boundary value problems

    Authors: Nikolas Nüsken, Lorenz Richter

    Abstract: Solving high-dimensional partial differential equations is a recurrent challenge in economics, science and engineering. In recent years, a great number of computational approaches have been developed, most of them relying on a combination of Monte Carlo sampling and deep learning based approximation. For elliptic and parabolic problems, existing methods can broadly be classified into those resting… ▽ More

    Submitted 29 January, 2023; v1 submitted 7 December, 2021; originally announced December 2021.

  8. arXiv:2107.06621  [pdf, other

    math.PR math.NA math.ST

    Rough McKean-Vlasov dynamics for robust ensemble Kalman filtering

    Authors: Michele Coghi, Torstein Nilssen, Nikolas Nüsken, Sebastian Reich

    Abstract: Motivated by the challenge of incorporating data into misspecified and multiscale dynamical models, we study a McKean-Vlasov equation that contains the data stream as a common driving rough path. This setting allows us to prove well-posedness as well as continuity with respect to the driver in an appropriate rough-path topology. The latter property is key in our subsequent development of a robust… ▽ More

    Submitted 20 January, 2022; v1 submitted 14 July, 2021; originally announced July 2021.

    Comments: 44 pages, 7 figures

    MSC Class: 60L20; 60L90; 60H10; 60F99; 65C35; 62M05

  9. arXiv:2102.12956  [pdf, other

    stat.ML cs.LG math.AP math.NA math.PR math.ST

    Stein Variational Gradient Descent: many-particle and long-time asymptotics

    Authors: Nikolas Nüsken, D. R. Michiel Renger

    Abstract: Stein variational gradient descent (SVGD) refers to a class of methods for Bayesian inference based on interacting particle systems. In this paper, we consider the originally proposed deterministic dynamics as well as a stochastic variant, each of which represent one of the two main paradigms in Bayesian computational statistics: variational inference and Markov chain Monte Carlo. As it turns out,… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

    Comments: 25 pages

  10. arXiv:2102.11830  [pdf, other

    stat.ML cs.LG math.NA math.PR

    Solving high-dimensional parabolic PDEs using the tensor train format

    Authors: Lorenz Richter, Leon Sallandt, Nikolas Nüsken

    Abstract: High-dimensional partial differential equations (PDEs) are ubiquitous in economics, science and engineering. However, their numerical treatment poses formidable challenges since traditional grid-based methods tend to be frustrated by the curse of dimensionality. In this paper, we argue that tensor trains provide an appealing approximation framework for parabolic PDEs: the combination of reformulat… ▽ More

    Submitted 17 July, 2021; v1 submitted 23 February, 2021; originally announced February 2021.

  11. arXiv:2010.10436  [pdf, other

    stat.ML cs.LG math.ST

    VarGrad: A Low-Variance Gradient Estimator for Variational Inference

    Authors: Lorenz Richter, Ayman Boustati, Nikolas Nüsken, Francisco J. R. Ruiz, Ömer Deniz Akyildiz

    Abstract: We analyse the properties of an unbiased gradient estimator of the ELBO for variational inference, based on the score function method with leave-one-out control variates. We show that this gradient estimator can be obtained using a new loss, defined as the variance of the log-ratio between the exact posterior and the variational approximation, which we call the $\textit{log-variance loss}$. Under… ▽ More

    Submitted 29 October, 2020; v1 submitted 20 October, 2020; originally announced October 2020.

  12. arXiv:2005.05409  [pdf, other

    math.OC cs.LG math.NA math.PR stat.ML

    Solving high-dimensional Hamilton-Jacobi-Bellman PDEs using neural networks: perspectives from the theory of controlled diffusions and measures on path space

    Authors: Nikolas Nüsken, Lorenz Richter

    Abstract: Optimal control of diffusion processes is intimately connected to the problem of solving certain Hamilton-Jacobi-Bellman equations. Building on recent machine learning inspired approaches towards high-dimensional PDEs, we investigate the potential of $\textit{iterative diffusion optimisation}$ techniques, in particular considering applications in importance sampling and rare event simulation, and… ▽ More

    Submitted 29 January, 2023; v1 submitted 11 May, 2020; originally announced May 2020.

  13. arXiv:1912.02859  [pdf, ps, other

    math.NA math.DS

    Affine invariant interacting Langevin dynamics for Bayesian inference

    Authors: Alfredo Garbuno-Inigo, Nikolas Nüsken, Sebastian Reich

    Abstract: We propose a computational method (with acronym ALDI) for sampling from a given target distribution based on first-order (overdamped) Langevin dynamics which satisfies the property of affine invariance. The central idea of ALDI is to run an ensemble of particles with their empirical covariance serving as a preconditioner for their underlying Langevin dynamics. ALDI does not require taking the inve… ▽ More

    Submitted 9 April, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

    MSC Class: 65N21; 62F15; 65N75; 65C30; 90C56

  14. arXiv:1912.00894  [pdf, other

    stat.ML cs.LG math.AP math.ST

    On the geometry of Stein variational gradient descent

    Authors: A. Duncan, N. Nuesken, L. Szpruch

    Abstract: Bayesian inference problems require sampling or approximating high-dimensional probability distributions. The focus of this paper is on the recently introduced Stein variational gradient descent methodology, a class of algorithms that rely on iterated steepest descent steps with respect to a reproducing kernel Hilbert space norm. This construction leads to interacting particle systems, the mean-fi… ▽ More

    Submitted 12 February, 2023; v1 submitted 2 December, 2019; originally announced December 2019.

    Comments: 40 pages, 4 figures

  15. arXiv:1908.10890  [pdf, ps, other

    math.DS math.NA

    Note on Interacting Langevin Diffusions: Gradient Structure and Ensemble Kalman Sampler by Garbuno-Inigo, Hoffmann, Li and Stuart

    Authors: Nikolas Nüsken, Sebastian Reich

    Abstract: An interacting system of Langevin dynamics driven particles has been proposed for sampling from a given posterior density by Garbuno-Inigo, Hoffmann, Li and Stuart in Interacting Langevin Diffusions: Gradient Structure and Ensemble Kalman Sampler (arXiv:1903:08866v2). The proposed formulation is primarily studied from a formal mean-field limit perspective, while the theoretical behaviour under a f… ▽ More

    Submitted 28 August, 2019; originally announced August 2019.

    MSC Class: 60H10; 82C22; 62F15; 35Q84

  16. State and Parameter Estimation from Observed Signal Increments

    Authors: Nikolas Nüsken, Sebastian Reich, Paul J. Rozdeba

    Abstract: The success of the ensemble Kalman filter has triggered a strong interest in expanding its scope beyond classical state estimation problems. In this paper, we focus on continuous-time data assimilation where the model and measurement errors are correlated and both states and parameters need to be identified. Such scenarios arise from noisy and partial observations of Lagrangian particles which mov… ▽ More

    Submitted 1 May, 2019; v1 submitted 26 March, 2019; originally announced March 2019.

    MSC Class: 62M20; 93E11; 93E20; 65D10; 65C05; 65C35

  17. arXiv:1806.11026  [pdf, other

    math.PR math.NA

    Constructing sampling schemes via coupling: Markov semigroups and optimal transport

    Authors: N. Nuesken, G. A. Pavliotis

    Abstract: In this paper we develop a general framework for constructing and analysing coupled Markov chain Monte Carlo samplers, allowing for both (possibly degenerate) diffusion and piecewise deterministic Markov processes. For many performance criteria of interest, including the asymptotic variance, the task of finding efficient couplings can be phrased in terms of problems related to optimal transport th… ▽ More

    Submitted 28 June, 2018; originally announced June 2018.

    Comments: 54 pages, 5 figures

  18. arXiv:1705.00170  [pdf, other

    math.PR math-ph math.NA math.ST

    Using Perturbed Underdamped Langevin Dynamics to Efficiently Sample from Probability Distributions

    Authors: A. B. Duncan, N. Nuesken, G. A. Pavliotis

    Abstract: In this paper we introduce and analyse Langevin samplers that consist of perturbations of the standard underdamped Langevin dynamics. The perturbed dynamics is such that its invariant measure is the same as that of the unperturbed dynamics. We show that appropriate choices of the perturbations can lead to samplers that have improved properties, at least in terms of reducing the asymptotic variance… ▽ More

    Submitted 29 April, 2017; originally announced May 2017.

    Comments: 45 pages, 4 figures