Skip to main content

Showing 1–30 of 30 results for author: Sanz-Alonso, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2412.14318  [pdf, other

    math.DS math.NA stat.ML

    Long-time accuracy of ensemble Kalman filters for chaotic and machine-learned dynamical systems

    Authors: Daniel Sanz-Alonso, Nathan Waniorek

    Abstract: Filtering is concerned with online estimation of the state of a dynamical system from partial and noisy observations. In applications where the state is high dimensional, ensemble Kalman filters are often the method of choice. This paper establishes long-time accuracy of ensemble Kalman filters. We introduce conditions on the dynamics and the observations under which the estimation error remains s… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

    Comments: 40 pages, 4 figures

    MSC Class: 62F15; 68Q25; 60G35; 62M05

  2. arXiv:2410.10523  [pdf, other

    stat.ML cs.LG math.OC

    Inverse Problems and Data Assimilation: A Machine Learning Approach

    Authors: Eviatar Bach, Ricardo Baptista, Daniel Sanz-Alonso, Andrew Stuart

    Abstract: The aim of these notes is to demonstrate the potential for ideas in machine learning to impact on the fields of inverse problems and data assimilation. The perspective is one that is primarily aimed at researchers from inverse problems and/or data assimilation who wish to see a mathematical presentation of machine learning as it pertains to their fields. As a by-product, we include a succinct math… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: 254 pages

  3. arXiv:2405.16359  [pdf, other

    stat.CO math.HO math.NA

    A First Course in Monte Carlo Methods

    Authors: Daniel Sanz-Alonso, Omar Al-Ghattas

    Abstract: This is a concise mathematical introduction to Monte Carlo methods, a rich family of algorithms with far-reaching applications in science and engineering. Monte Carlo methods are an exciting subject for mathematical statisticians and computational and applied mathematicians: the design and analysis of modern algorithms are rooted in a broad mathematical toolbox that includes ergodic theory of Mark… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 150 pages, 21 figures

  4. arXiv:2405.13180  [pdf, other

    eess.SP cs.LG nlin.CD physics.ao-ph stat.AP

    Data Assimilation with Machine Learning Surrogate Models: A Case Study with FourCastNet

    Authors: Melissa Adrian, Daniel Sanz-Alonso, Rebecca Willett

    Abstract: Modern data-driven surrogate models for weather forecasting provide accurate short-term predictions but inaccurate and nonphysical long-term forecasts. This paper investigates online weather prediction using machine learning surrogates supplemented with partial and noisy observations. We empirically demonstrate and theoretically justify that, despite the long-time instability of the surrogates and… ▽ More

    Submitted 10 February, 2025; v1 submitted 21 May, 2024; originally announced May 2024.

  5. arXiv:2401.17037  [pdf, other

    cs.LG math.NA stat.ML

    Enhancing Gaussian Process Surrogates for Optimization and Posterior Approximation via Random Exploration

    Authors: Hwanwoo Kim, Daniel Sanz-Alonso

    Abstract: This paper proposes novel noise-free Bayesian optimization strategies that rely on a random exploration step to enhance the accuracy of Gaussian process surrogate models. The new algorithms retain the ease of implementation of the classical GP-UCB algorithm, but the additional random exploration step accelerates their convergence, nearly achieving the optimal convergence rate. Furthermore, to faci… ▽ More

    Submitted 17 July, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

  6. arXiv:2312.09225  [pdf, ps, other

    math.NA math.ST stat.ML

    Gaussian Process Regression under Computational and Epistemic Misspecification

    Authors: Daniel Sanz-Alonso, Ruiyi Yang

    Abstract: Gaussian process regression is a classical kernel method for function estimation and data interpolation. In large data applications, computational costs can be reduced using low-rank or sparse approximations of the kernel. This paper investigates the effect of such kernel approximations on the interpolation error. We introduce a unified framework to analyze Gaussian process regression under import… ▽ More

    Submitted 3 October, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

  7. arXiv:2304.09933  [pdf, ps, other

    math.NA stat.CO

    Analysis of a Computational Framework for Bayesian Inverse Problems: Ensemble Kalman Updates and MAP Estimators Under Mesh Refinement

    Authors: Daniel Sanz-Alonso, Nathan Waniorek

    Abstract: This paper analyzes a popular computational framework to solve infinite-dimensional Bayesian inverse problems, discretizing the prior and the forward model in a finite-dimensional weighted inner product space. We demonstrate the benefit of working on a weighted space by establishing operator-norm bounds for finite element and graph-based discretizations of Matérn-type priors and deconvolution forw… ▽ More

    Submitted 20 February, 2024; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: 39 pages, 0 figures

    MSC Class: 65M32 (Primary) 68Q25; 35Q62 62F15 (Secondary)

  8. arXiv:2302.11449  [pdf, other

    stat.CO

    From Optimization to Sampling Through Gradient Flows

    Authors: N. Garcia Trillos, B. Hosseini, D. Sanz-Alonso

    Abstract: This article overviews how gradient flows, and discretizations thereof, are useful to design and analyze optimization and sampling algorithms. The interplay between optimization, sampling, and gradient flows is an active research area; our goal is to provide an accessible and lively introduction to some core ideas, emphasizing that gradient flows uncover the conceptual unity behind many optimizati… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

    Comments: This article will appear in the Notices of the American Mathematical Society

  9. arXiv:2301.11961  [pdf, other

    stat.ML cs.LG math.DS stat.CO

    Reduced-Order Autodifferentiable Ensemble Kalman Filters

    Authors: Yuming Chen, Daniel Sanz-Alonso, Rebecca Willett

    Abstract: This paper introduces a computational framework to reconstruct and forecast a partially observed state that evolves according to an unknown or expensive-to-simulate dynamical system. Our reduced-order autodifferentiable ensemble Kalman filters (ROAD-EnKFs) learn a latent low-dimensional surrogate model for the dynamics and a decoder that maps from the latent space to the state space. The learned d… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

  10. arXiv:2210.10962  [pdf, other

    stat.ML cs.LG math.OC

    Optimization on Manifolds via Graph Gaussian Processes

    Authors: Hwanwoo Kim, Daniel Sanz-Alonso, Ruiyi Yang

    Abstract: This paper integrates manifold learning techniques within a \emph{Gaussian process upper confidence bound} algorithm to optimize an objective function on a manifold. Our approach is motivated by applications where a full representation of the manifold is not available and querying the objective is expensive. We rely on a point cloud of manifold samples to define a graph Gaussian process surrogate… ▽ More

    Submitted 8 November, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

  11. arXiv:2208.03246  [pdf, ps, other

    stat.ML math.NA math.ST stat.ME

    Non-Asymptotic Analysis of Ensemble Kalman Updates: Effective Dimension and Localization

    Authors: Omar Al Ghattas, Daniel Sanz-Alonso

    Abstract: Many modern algorithms for inverse problems and data assimilation rely on ensemble Kalman updates to blend prior predictions with observed data. Ensemble Kalman methods often perform well with a small ensemble size, which is essential in applications where generating each particle is costly. This paper develops a non-asymptotic analysis of ensemble Kalman updates that rigorously explains why a sma… ▽ More

    Submitted 5 October, 2023; v1 submitted 5 August, 2022; originally announced August 2022.

  12. arXiv:2207.01093  [pdf, other

    stat.ML cs.LG math.PR math.ST stat.ME

    Mathematical Foundations of Graph-Based Bayesian Semi-Supervised Learning

    Authors: Nicolas García Trillos, Daniel Sanz-Alonso, Ruiyi Yang

    Abstract: In recent decades, science and engineering have been revolutionized by a momentous growth in the amount of available data. However, despite the unprecedented ease with which data are now collected and stored, labeling data by supplementing each feature with an informative tag remains to be challenging. Illustrative tasks where the labeling process requires expert knowledge or is tedious and time-c… ▽ More

    Submitted 3 July, 2022; originally announced July 2022.

    Comments: To appear in Notices of the AMS

  13. arXiv:2205.09322  [pdf, other

    stat.CO math.NA math.OC stat.ME

    Hierarchical Ensemble Kalman Methods with Sparsity-Promoting Generalized Gamma Hyperpriors

    Authors: Hwanwoo Kim, Daniel Sanz-Alonso, Alexander Strang

    Abstract: This paper introduces a computational framework to incorporate flexible regularization techniques in ensemble Kalman methods for nonlinear inverse problems. The proposed methodology approximates the maximum a posteriori (MAP) estimate of a hierarchical Bayesian model characterized by a conditionally Gaussian prior and generalized gamma hyperpriors. Suitable choices of hyperparameters yield sparsit… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

  14. arXiv:2111.13329  [pdf, other

    stat.ME stat.CO stat.ML

    A Variational Inference Approach to Inverse Problems with Gamma Hyperpriors

    Authors: Shiv Agrawal, Hwanwoo Kim, Daniel Sanz-Alonso, Alexander Strang

    Abstract: Hierarchical models with gamma hyperpriors provide a flexible, sparse-promoting framework to bridge $L^1$ and $L^2$ regularizations in Bayesian formulations to inverse problems. Despite the Bayesian motivation for these models, existing methodologies are limited to \textit{maximum a posteriori} estimation. The potential to perform uncertainty quantification has not yet been realized. This paper in… ▽ More

    Submitted 28 November, 2021; v1 submitted 26 November, 2021; originally announced November 2021.

  15. arXiv:2109.02777  [pdf, other

    stat.CO math.NA

    Finite Element Representations of Gaussian Processes: Balancing Numerical and Statistical Accuracy

    Authors: Daniel Sanz-Alonso, Ruiyi Yang

    Abstract: The stochastic partial differential equation approach to Gaussian processes (GPs) represents Matérn GP priors in terms of $n$ finite element basis functions and Gaussian coefficients with sparse precision matrix. Such representations enhance the scalability of GP regression and classification to datasets of large size $N$ by setting $n\approx N$ and exploiting sparsity. In this paper we reconsider… ▽ More

    Submitted 8 April, 2022; v1 submitted 6 September, 2021; originally announced September 2021.

  16. arXiv:2107.07687  [pdf, other

    stat.ML cs.LG stat.CO

    Auto-differentiable Ensemble Kalman Filters

    Authors: Yuming Chen, Daniel Sanz-Alonso, Rebecca Willett

    Abstract: Data assimilation is concerned with sequentially estimating a temporally-evolving state. This task, which arises in a wide range of scientific and engineering applications, is particularly challenging when the state is high-dimensional and the state-space dynamics are unknown. This paper introduces a machine learning framework for learning dynamical systems in data assimilation. Our auto-different… ▽ More

    Submitted 19 July, 2021; v1 submitted 15 July, 2021; originally announced July 2021.

  17. arXiv:2106.06787  [pdf, other

    math.NA stat.CO stat.ME

    Graph-based Prior and Forward Models for Inverse Problems on Manifolds with Boundaries

    Authors: John Harlim, Shixiao Jiang, Hwanwoo Kim, Daniel Sanz-Alonso

    Abstract: This paper develops manifold learning techniques for the numerical solution of PDE-constrained Bayesian inverse problems on manifolds with boundaries. We introduce graphical Matérn-type Gaussian field priors that enable flexible modeling near the boundaries, representing boundary values by superposition of harmonic functions with appropriate Dirichlet boundary conditions. We also investigate the g… ▽ More

    Submitted 12 June, 2021; originally announced June 2021.

  18. Bayesian Update with Importance Sampling: Required Sample Size

    Authors: Daniel Sanz-Alonso, Zijian Wang

    Abstract: Importance sampling is used to approximate Bayes' rule in many computational approaches to Bayesian inverse problems, data assimilation and machine learning. This paper reviews and further investigates the required sample size for importance sampling in terms of the $χ^2$-divergence between target and proposal. We develop general abstract theory and illustrate through numerous examples the roles t… ▽ More

    Submitted 22 September, 2020; originally announced September 2020.

    MSC Class: 62-08; 62F15; 65C05

  19. arXiv:2008.11809  [pdf, ps, other

    math.ST stat.ML

    Unlabeled Data Help in Graph-Based Semi-Supervised Learning: A Bayesian Nonparametrics Perspective

    Authors: Daniel Sanz-Alonso, Ruiyi Yang

    Abstract: In this paper we analyze the graph-based approach to semi-supervised learning under a manifold assumption. We adopt a Bayesian perspective and demonstrate that, for a suitable choice of prior constructed with sufficiently many unlabeled data, the posterior contracts around the truth at a rate that is minimax optimal up to a logarithmic factor. Our theory covers both regression and classification.

    Submitted 12 June, 2021; v1 submitted 26 August, 2020; originally announced August 2020.

  20. arXiv:2004.08000  [pdf, other

    stat.ME math.NA stat.CO

    The SPDE Approach to Matérn Fields: Graph Representations

    Authors: Daniel Sanz-Alonso, Ruiyi Yang

    Abstract: This paper investigates Gaussian Markov random field approximations to nonstationary Gaussian fields using graph representations of stochastic partial differential equations. We establish approximation error guarantees building on the theory of spectral convergence of graph Laplacians. The proposed graph representations provide a generalization of the Matérn model to unstructured point clouds, and… ▽ More

    Submitted 26 April, 2021; v1 submitted 16 April, 2020; originally announced April 2020.

  21. arXiv:2003.07991  [pdf, other

    stat.CO math.NA stat.ME

    Data-Driven Forward Discretizations for Bayesian Inversion

    Authors: Daniele Bigoni, Yuming Chen, Nicolas Garcia Trillos, Youssef Marzouk, Daniel Sanz-Alonso

    Abstract: This paper suggests a framework for the learning of discretizations of expensive forward models in Bayesian inverse problems. The main idea is to incorporate the parameters governing the discretization as part of the unknown to be estimated within the Bayesian machinery. We numerically show that in a variety of inverse problems arising in mechanical engineering, signal processing and the geoscienc… ▽ More

    Submitted 21 August, 2020; v1 submitted 17 March, 2020; originally announced March 2020.

  22. arXiv:1912.03253  [pdf, other

    stat.CO math.NA

    HMC: avoiding rejections by not using leapfrog and some results on the acceptance rate

    Authors: M. P. Calvo, D. Sanz-Alonso, J. M. Sanz-Serna

    Abstract: The leapfrog integrator is routinely used within the Hamiltonian Monte Carlo method and its variants. We give strong numerical evidence that alternative, easy to implement algorithms yield fewer rejections with a given computational effort. When the dimensionality of the target distribution is high, the number of accepted proposals may be multiplied by a factor of three or more. This increase in t… ▽ More

    Submitted 2 April, 2021; v1 submitted 6 December, 2019; originally announced December 2019.

    Comments: 37 pages, 8 figures

  23. arXiv:1904.03335  [pdf, other

    stat.ML cs.LG

    Local Regularization of Noisy Point Clouds: Improved Global Geometric Estimates and Data Analysis

    Authors: Nicolas Garcia Trillos, Daniel Sanz-Alonso, Ruiyi Yang

    Abstract: Several data analysis techniques employ similarity relationships between data points to uncover the intrinsic dimension and geometric structure of the underlying data-generating mechanism. In this paper we work under the model assumption that the data is made of random perturbations of feature vectors lying on a low-dimensional manifold. We study two questions: how to define the similarity relatio… ▽ More

    Submitted 5 April, 2019; originally announced April 2019.

  24. arXiv:1901.10082  [pdf, other

    stat.ML cs.LG stat.CO

    Variational Characterizations of Local Entropy and Heat Regularization in Deep Learning

    Authors: Nicolas Garcia Trillos, Zach Kaplan, Daniel Sanz-Alonso

    Abstract: The aim of this paper is to provide new theoretical and computational understanding on two loss regularizations employed in deep learning, known as local entropy and heat regularization. For both regularized losses we introduce variational characterizations that naturally suggest a two-step scheme for their optimization, based on the iterative shift of a probability density and the calculation of… ▽ More

    Submitted 28 January, 2019; originally announced January 2019.

  25. arXiv:1810.06191  [pdf, other

    stat.ME

    Inverse Problems and Data Assimilation

    Authors: Daniel Sanz-Alonso, Andrew M. Stuart, Armeen Taeb

    Abstract: We provide a clear and concise introduction to the subjects of inverse problems and data assimilation, and their inter-relations. The first part of our notes covers inverse problems; this refers to the study of how to estimate unknown model parameters from data. The second part of our notes covers data assimilation; this refers to a particular class of inverse problems in which the unknown paramet… ▽ More

    Submitted 14 February, 2023; v1 submitted 15 October, 2018; originally announced October 2018.

  26. arXiv:1710.07702  [pdf, other

    stat.ML cs.LG math.PR stat.CO

    On the Consistency of Graph-based Bayesian Learning and the Scalability of Sampling Algorithms

    Authors: Nicolas Garcia Trillos, Zachary Kaplan, Thabo Samakhoana, Daniel Sanz-Alonso

    Abstract: A popular approach to semi-supervised learning proceeds by endowing the input data with a graph structure in order to extract geometric information and incorporate it into a Bayesian framework. We introduce new theory that gives appropriate scalings of graph parameters that provably lead to a well-defined limiting posterior as the size of the unlabeled data set grows. Furthermore, we show that the… ▽ More

    Submitted 12 January, 2020; v1 submitted 20 October, 2017; originally announced October 2017.

  27. arXiv:1706.07193  [pdf, ps, other

    math.PR math.AP math.SP math.ST stat.ML

    Continuum Limit of Posteriors in Graph Bayesian Inverse Problems

    Authors: Nicolas Garcia Trillos, Daniel Sanz-Alonso

    Abstract: We consider the problem of recovering a function input of a differential equation formulated on an unknown domain $M$. We assume to have access to a discrete domain $M_n=\{x_1, \dots, x_n\} \subset M$, and to noisy measurements of the output solution at $p\le n$ of those points. We introduce a graph-based Bayesian inverse problem, and show that the graph-posterior measures over functions in $M_n$… ▽ More

    Submitted 22 June, 2017; originally announced June 2017.

  28. arXiv:1705.07382  [pdf, ps, other

    math.ST math.AP stat.CO

    The Bayesian update: variational formulations and gradient flows

    Authors: Nicolas Garcia Trillos, Daniel Sanz-Alonso

    Abstract: The Bayesian update can be viewed as a variational problem by characterizing the posterior as the minimizer of a functional. The variational viewpoint is far from new and is at the heart of popular methods for posterior approximation. However, some of its consequences seem largely unexplored. We focus on the following one: defining the posterior as the minimizer of a functional gives a natural pat… ▽ More

    Submitted 1 November, 2018; v1 submitted 20 May, 2017; originally announced May 2017.

  29. arXiv:1608.08814  [pdf, ps, other

    stat.CO

    Importance Sampling and Necessary Sample Size: an Information Theory Approach

    Authors: Daniel Sanz-Alonso

    Abstract: Importance sampling approximates expectations with respect to a target measure by using samples from a proposal measure. The performance of the method over large classes of test functions depends heavily on the closeness between both measures. We derive a general bound that needs to hold for importance sampling to be successful, and relates the $f$-divergence between the target and the proposal to… ▽ More

    Submitted 31 August, 2016; originally announced August 2016.

  30. arXiv:1511.06196  [pdf, ps, other

    stat.CO

    Importance Sampling: Intrinsic Dimension and Computational Cost

    Authors: S. Agapiou, O. Papaspiliopoulos, D. Sanz-Alonso, A. M. Stuart

    Abstract: The basic idea of importance sampling is to use independent samples from a proposal measure in order to approximate expectations with respect to a target measure. It is key to understand how many samples are required in order to guarantee accurate approximations. Intuitively, some notion of distance between the target and the proposal should determine the computational cost of the method. A major… ▽ More

    Submitted 14 January, 2017; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: Statistical Science