Skip to main content

Showing 1–50 of 131 results for author: Stuart, A M

.
  1. arXiv:2505.24134  [pdf, ps, other

    stat.ML cs.CV cs.LG

    A Mathematical Perspective On Contrastive Learning

    Authors: Ricardo Baptista, Andrew M. Stuart, Son Tran

    Abstract: Multimodal contrastive learning is a methodology for linking different data modalities; the canonical example is linking image and text data. The methodology is typically framed as the identification of a set of encoders, one for each modality, that align representations within a common latent space. In this work, we focus on the bimodal setting and interpret contrastive learning as the optimizati… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: 44 pages, 15 figures

  2. arXiv:2505.19841  [pdf, ps, other

    stat.ML cs.LG physics.comp-ph

    Efficient Deconvolution in Populational Inverse Problems

    Authors: Arnaud Vadeboncoeur, Mark Girolami, Andrew M. Stuart

    Abstract: This work is focussed on the inversion task of inferring the distribution over parameters of interest leading to multiple sets of observations. The potential to solve such distributional inversion problems is driven by increasing availability of data, but a major roadblock is blind deconvolution, arising when the observational noise distribution is unknown. However, when data originates from colle… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  3. arXiv:2503.16154  [pdf, ps, other

    math.ST

    Statistical accuracy of the ensemble Kalman filter in the near-linear setting

    Authors: E. Calvello, J. A. Carrillo, F. Hoffmann, P. Monmarché, A. M. Stuart, U. Vaes

    Abstract: Estimating the state of a dynamical system from partial and noisy observations is a ubiquitous problem in a large number of applications, such as probabilistic weather forecasting and prediction of epidemics. Particle filters are a widely adopted approach to the problem and provide provably accurate approximations of the statistics of the state, but they perform poorly in high dimensions because o… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

    MSC Class: 60G35; 62F15; 65C35; 70F45; 93E11

  4. arXiv:2501.17110  [pdf, other

    math.NA cs.LG

    Solving Roughly Forced Nonlinear PDEs via Misspecified Kernel Methods and Neural Networks

    Authors: Ricardo Baptista, Edoardo Calvello, Matthieu Darcy, Houman Owhadi, Andrew M. Stuart, Xianjin Yang

    Abstract: We consider the use of Gaussian Processes (GPs) or Neural Networks (NNs) to numerically approximate the solutions to nonlinear partial differential equations (PDEs) with rough forcing or source terms, which commonly arise as pathwise solutions to stochastic PDEs. Kernel methods have recently been generalized to solve nonlinear PDEs by approximating their solutions as the maximum a posteriori estim… ▽ More

    Submitted 29 January, 2025; v1 submitted 28 January, 2025; originally announced January 2025.

    Comments: 41 pages, 7 figures

  5. arXiv:2501.15785  [pdf, other

    cs.LG math.DS math.OC

    Memorization and Regularization in Generative Diffusion Models

    Authors: Ricardo Baptista, Agnimitra Dasgupta, Nikola B. Kovachki, Assad Oberai, Andrew M. Stuart

    Abstract: Diffusion models have emerged as a powerful framework for generative modeling. At the heart of the methodology is score matching: learning gradients of families of log-densities for noisy versions of the data distribution at different scales. When the loss function adopted in score matching is evaluated using empirical data, rather than the population loss, the minimizer corresponds to the score o… ▽ More

    Submitted 18 March, 2025; v1 submitted 27 January, 2025; originally announced January 2025.

    Comments: 59 pages, 20 figures

  6. arXiv:2409.09800  [pdf, ps, other

    math.ST math.DS math.NA math.OC

    Accuracy of the Ensemble Kalman Filter in the Near-Linear Setting

    Authors: Edoardo Calvello, Pierre Monmarché, Andrew M. Stuart, Urbain Vaes

    Abstract: The filtering distribution captures the statistics of the state of a dynamical system from partial and noisy observations. Classical particle filters provably approximate this distribution in quite general settings; however they behave poorly for high dimensional problems, suffering weight collapse. This issue is circumvented by the ensemble Kalman filter which is an equal-weight interacting parti… ▽ More

    Submitted 6 February, 2025; v1 submitted 15 September, 2024; originally announced September 2024.

  7. arXiv:2408.06526  [pdf, other

    cs.LG math.NA stat.ML

    Operator Learning Using Random Features: A Tool for Scientific Computing

    Authors: Nicholas H. Nelsen, Andrew M. Stuart

    Abstract: Supervised operator learning centers on the use of training data, in the form of input-output pairs, to estimate maps between infinite-dimensional spaces. It is emerging as a powerful tool to complement traditional scientific computing, which may often be framed in terms of operators mapping between spaces of functions. Building on the classical random features methodology for scalar regression, t… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

    Comments: 36 pages, 1 table, 9 figures. SIGEST version of SIAM J. Sci. Comput. Vol. 43 No. 5 (2021) pp. A3212-A3243, hence text overlap with arXiv:2005.10224

    MSC Class: 68T05; 65D40; 62J07; 62M45; 68W20; 35R60

    Journal ref: SIAM Review Vol. 66 No. 3 (2024) pp. 535-571

  8. arXiv:2408.01362  [pdf, other

    stat.ML cs.LG

    Autoencoders in Function Space

    Authors: Justin Bunker, Mark Girolami, Hefin Lambley, Andrew M. Stuart, T. J. Sullivan

    Abstract: Autoencoders have found widespread application in both their original deterministic form and in their variational formulation (VAEs). In scientific applications and in image processing it is often of interest to consider data that are viewed as functions; while discretisation (of differential equations arising in the sciences) or pixellation (of images) renders problems finite dimensional in pract… ▽ More

    Submitted 5 January, 2025; v1 submitted 2 August, 2024; originally announced August 2024.

    Comments: 53 pages, 24 figures

    MSC Class: 62G07 (Primary) 65M99; 68T07 (Secondary) ACM Class: I.2.6

  9. arXiv:2406.17263  [pdf, other

    cs.LG math.DS math.NA

    Efficient, Multimodal, and Derivative-Free Bayesian Inference With Fisher-Rao Gradient Flows

    Authors: Yifan Chen, Daniel Zhengyu Huang, Jiaoyang Huang, Sebastian Reich, Andrew M. Stuart

    Abstract: In this paper, we study efficient approximate sampling for probability distributions known up to normalization constants. We specifically focus on a problem class arising in Bayesian inference for large-scale inverse problems in science and engineering applications. The computational challenges we address with the proposed methodology are: (i) the need for repeated evaluations of expensive forward… ▽ More

    Submitted 11 October, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

    Comments: 42 pages, 10 figures

  10. arXiv:2406.06486  [pdf, other

    cs.LG math.NA

    Continuum Attention for Neural Operators

    Authors: Edoardo Calvello, Nikola B. Kovachki, Matthew E. Levine, Andrew M. Stuart

    Abstract: Transformers, and the attention mechanism in particular, have become ubiquitous in machine learning. Their success in modeling nonlocal, long-range correlations has led to their widespread adoption in natural language processing, computer vision, and time-series problems. Neural operators, which map spaces of functions into spaces of functions, are necessarily both nonlinear and nonlocal if they a… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  11. arXiv:2405.17955  [pdf, other

    stat.ML cs.LG stat.CO

    Efficient Prior Calibration From Indirect Data

    Authors: O. Deniz Akyildiz, Mark Girolami, Andrew M. Stuart, Arnaud Vadeboncoeur

    Abstract: Bayesian inversion is central to the quantification of uncertainty within problems arising from numerous applications in science and engineering. To formulate the approach, four ingredients are required: a forward model mapping the unknown parameter to an element of a solution space, often the solution space for a differential equation; an observation operator mapping an element of the solution sp… ▽ More

    Submitted 14 May, 2025; v1 submitted 28 May, 2024; originally announced May 2024.

  12. arXiv:2405.13149  [pdf, other

    stat.ML cs.LG math.NA math.PR stat.CO

    Gaussian Measures Conditioned on Nonlinear Observations: Consistency, MAP Estimators, and Simulation

    Authors: Yifan Chen, Bamdad Hosseini, Houman Owhadi, Andrew M Stuart

    Abstract: The article presents a systematic study of the problem of conditioning a Gaussian random variable $ξ$ on nonlinear observations of the form $F \circ φ(ξ)$ where $φ: \mathcal{X} \to \mathbb{R}^N$ is a bounded linear operator and $F$ is nonlinear. Such problems arise in the context of Bayesian inference and recent machine learning-inspired PDE solvers. We give a representer theorem for the condition… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  13. arXiv:2405.02221  [pdf, other

    math.NA cs.LG

    Discretization Error of Fourier Neural Operators

    Authors: Samuel Lanthaler, Andrew M. Stuart, Margaret Trautner

    Abstract: Operator learning is a variant of machine learning that is designed to approximate maps between function spaces from data. The Fourier Neural Operator (FNO) is a common model architecture used for operator learning. The FNO combines pointwise linear and nonlinear operations in physical space with pointwise linear operations in Fourier space, leading to a parameterized map acting between function s… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    MSC Class: 41A35 (Primary) 65T50; 68T07 (Secondary)

  14. arXiv:2403.14934  [pdf, other

    math.OC

    A Stochastic Model-Based Control Methodology for Glycemic Management in the Intensive Care Unit

    Authors: Melike Sirlanci, George Hripcsak, Cecilia C. Low Wang, J. N. Stroh, Yanran Wang, Tellen D. Bennett, Andrew M. Stuart, David J. Albers

    Abstract: Intensive care unit (ICU) patients exhibit erratic blood glucose (BG) fluctuations, including hypoglycemic and hyperglycemic episodes, and require exogenous insulin delivery to keep their BG in healthy ranges. Glycemic control via glycemic management (GM) is associated with reduced mortality and morbidity in the ICU, but GM increases the cognitive load on clinicians. The availability of robust, ac… ▽ More

    Submitted 3 July, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: 26 pages, 4 figures, 5 tables

    MSC Class: 49-11 ACM Class: I.6.3

  15. arXiv:2402.15715  [pdf, other

    cs.LG math.NA

    Operator Learning: Algorithms and Analysis

    Authors: Nikola B. Kovachki, Samuel Lanthaler, Andrew M. Stuart

    Abstract: Operator learning refers to the application of ideas from machine learning to approximate (typically nonlinear) operators mapping between Banach spaces of functions. Such operators often arise from physical models expressed in terms of partial differential equations (PDEs). In this context, such approximate operators hold great potential as efficient surrogate models to complement traditional nume… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  16. arXiv:2402.01593  [pdf, ps, other

    math.NA math.DS math.OC

    Statistical Accuracy of Approximate Filtering Methods

    Authors: J. A. Carrillo, F. Hoffmann, A. M. Stuart, U. Vaes

    Abstract: Estimating the statistics of the state of a dynamical system, from partial and noisy observations, is both mathematically challenging and finds wide application. Furthermore, the applications are of great societal importance, including problems such as probabilistic weather forecasting and prediction of epidemics. Particle filters provide a well-founded approach to the problem, leading to provably… ▽ More

    Submitted 31 May, 2025; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: To appear in ICIAM proceedings

    MSC Class: 60G35; 62F15; 65C35; 70F45; 93E11

  17. arXiv:2310.14555  [pdf, other

    physics.geo-ph cs.LG

    Modeling groundwater levels in California's Central Valley by hierarchical Gaussian process and neural network regression

    Authors: Anshuman Pradhan, Kyra H. Adams, Venkat Chandrasekaran, Zhen Liu, John T. Reager, Andrew M. Stuart, Michael J. Turmon

    Abstract: Modeling groundwater levels continuously across California's Central Valley (CV) hydrological system is challenging due to low-quality well data which is sparsely and noisily sampled across time and space. The lack of consistent well data makes it difficult to evaluate the impact of 2017 and 2019 wet years on CV groundwater following a severe drought during 2012-2015. A novel machine learning meth… ▽ More

    Submitted 11 October, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

  18. arXiv:2310.03597  [pdf, other

    stat.ML cs.LG math.DS math.NA

    Sampling via Gradient Flows in the Space of Probability Measures

    Authors: Yifan Chen, Daniel Zhengyu Huang, Jiaoyang Huang, Sebastian Reich, Andrew M Stuart

    Abstract: Sampling a target probability distribution with an unknown normalization constant is a fundamental challenge in computational science and engineering. Recent work shows that algorithms derived by considering gradient flows in the space of probability measures open up new avenues for algorithm development. This paper makes three contributions to this sampling approach by scrutinizing the design com… ▽ More

    Submitted 9 March, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: Related and text overlap with arXiv:2302.11024

  19. arXiv:2306.15924  [pdf, ps, other

    cs.LG math.NA

    The Parametric Complexity of Operator Learning

    Authors: Samuel Lanthaler, Andrew M. Stuart

    Abstract: Neural operator architectures employ neural networks to approximate operators mapping between Banach spaces of functions; they may be used to accelerate model evaluations via emulation, or to discover models from data. Consequently, the methodology has received increasing attention over recent years, giving rise to the rapidly growing field of operator learning. The first contribution of this pape… ▽ More

    Submitted 9 March, 2025; v1 submitted 28 June, 2023; originally announced June 2023.

  20. arXiv:2306.12006  [pdf, other

    math.NA cs.LG

    Learning Homogenization for Elliptic Operators

    Authors: Kaushik Bhattacharya, Nikola Kovachki, Aakila Rajan, Andrew M. Stuart, Margaret Trautner

    Abstract: Multiscale partial differential equations (PDEs) arise in various applications, and several schemes have been developed to solve them efficiently. Homogenization theory is a powerful methodology that eliminates the small-scale dependence, resulting in simplified equations that are computationally tractable while accurately predicting the macroscopic response. In the field of continuum mechanics, h… ▽ More

    Submitted 4 January, 2024; v1 submitted 21 June, 2023; originally announced June 2023.

    MSC Class: 35B27; 35J47; 74H15

  21. arXiv:2305.04962  [pdf, other

    math.NA stat.ML

    Error Analysis of Kernel/GP Methods for Nonlinear and Parametric PDEs

    Authors: Pau Batlle, Yifan Chen, Bamdad Hosseini, Houman Owhadi, Andrew M Stuart

    Abstract: We introduce a priori Sobolev-space error estimates for the solution of nonlinear, and possibly parametric, PDEs using Gaussian process and kernel based methods. The primary assumptions are: (1) a continuous embedding of the reproducing kernel Hilbert space of the kernel into a Sobolev space of sufficient regularity; and (2) the stability of the differential operator and the solution map of the PD… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    MSC Class: 60G15; 65M75; 65N75; 65N35; 47B34; 41A15; 35R30; 34B15

  22. arXiv:2304.13221  [pdf, other

    math.NA cs.LG

    Nonlocality and Nonlinearity Implies Universality in Operator Learning

    Authors: Samuel Lanthaler, Zongyi Li, Andrew M. Stuart

    Abstract: Neural operator architectures approximate operators between infinite-dimensional Banach spaces of functions. They are gaining increased attention in computational science and engineering, due to their potential both to accelerate traditional numerical methods and to enable data-driven discovery. As the field is in its infancy basic questions about minimal requirements for universal approximation r… ▽ More

    Submitted 14 June, 2024; v1 submitted 25 April, 2023; originally announced April 2023.

  23. arXiv:2302.11024  [pdf, other

    stat.ML math.NA

    Gradient Flows for Sampling: Mean-Field Models, Gaussian Approximations and Affine Invariance

    Authors: Yifan Chen, Daniel Zhengyu Huang, Jiaoyang Huang, Sebastian Reich, Andrew M. Stuart

    Abstract: Sampling a probability distribution with an unknown normalization constant is a fundamental problem in computational science and engineering. This task may be cast as an optimization problem over all probability measures, and an initial distribution can be evolved to the desired minimizer dynamically via gradient flows. Mean-field models, whose law is governed by the gradient flow in the space of… ▽ More

    Submitted 10 September, 2024; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: 82 pages, 8 figures (Welcome any feedback!)

  24. arXiv:2212.13239  [pdf, ps, other

    math.OC math.DS math.NA

    The Mean Field Ensemble Kalman Filter: Near-Gaussian Setting

    Authors: J. A. Carrillo, F. Hoffmann, A. M. Stuart, U. Vaes

    Abstract: The ensemble Kalman filter is widely used in applications because, for high dimensional filtering problems, it has a robustness that is not shared for example by the particle filter; in particular it does not suffer from weight collapse. However, there is no theory which quantifies its accuracy as an approximation of the true filtering distribution, except in the Gaussian setting. To address this… ▽ More

    Submitted 27 August, 2024; v1 submitted 26 December, 2022; originally announced December 2022.

    MSC Class: 62F15; 65C35; 93E11; 70F45

  25. Learning macroscopic internal variables and history dependence from microscopic models

    Authors: Burigede Liu, Eric Ocegueda, Margaret Trautner, Andrew M. Stuart, Kaushik Bhattacharya

    Abstract: This paper concerns the study of history dependent phenomena in heterogeneous materials in a two-scale setting where the material is specified at a fine microscopic scale of heterogeneities that is much smaller than the coarse macroscopic scale of application. We specifically study a polycrystalline medium where each grain is governed by crystal plasticity while the solid is subjected to macroscop… ▽ More

    Submitted 30 April, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

  26. arXiv:2209.11371  [pdf, other

    math.OC math.NA

    Ensemble Kalman Methods: A Mean Field Perspective

    Authors: Edoardo Calvello, Sebastian Reich, Andrew M. Stuart

    Abstract: Ensemble Kalman methods are widely used for state estimation in the geophysical sciences. Their success stems from the fact that they take an underlying (possibly noisy) dynamical system as a black box to provide a systematic, derivative-free methodology for incorporating noisy, partial and possibly indirect observations to update estimates of the state; furthermore the ensemble approach allows fo… ▽ More

    Submitted 7 October, 2024; v1 submitted 22 September, 2022; originally announced September 2022.

  27. arXiv:2208.04506  [pdf, other

    math.DS cs.LG math.NA stat.ME

    Second Order Ensemble Langevin Method for Sampling and Inverse Problems

    Authors: Ziming Liu, Andrew M. Stuart, Yixuan Wang

    Abstract: We propose a sampling method based on an ensemble approximation of second order Langevin dynamics. The log target density is appended with a quadratic term in an auxiliary momentum variable and damped-driven Hamiltonian dynamics introduced; the resulting stochastic differential equation is invariant to the Gibbs measure, with marginal on the position coordinates given by the target. A precondition… ▽ More

    Submitted 24 October, 2022; v1 submitted 8 August, 2022; originally announced August 2022.

  28. arXiv:2205.14139  [pdf, other

    math.NA

    Learning Markovian Homogenized Models in Viscoelasticity

    Authors: Kaushik Bhattacharya, Burigede Liu, Andrew M. Stuart, Margaret Trautner

    Abstract: Fully resolving dynamics of materials with rapidly-varying features involves expensive fine-scale computations which need to be conducted on macroscopic scales. The theory of homogenization provides an approach to derive effective macroscopic equations which eliminates the small scales by exploiting scale separation. An accurate homogenized model avoids the computationally-expensive task of numeri… ▽ More

    Submitted 4 June, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

  29. arXiv:2204.04386  [pdf, other

    math.NA

    Efficient Derivative-free Bayesian Inference for Large-Scale Inverse Problems

    Authors: Daniel Zhengyu Huang, Jiaoyang Huang, Sebastian Reich, Andrew M. Stuart

    Abstract: We consider Bayesian inference for large scale inverse problems, where computational challenges arise from the need for repeated evaluations of an expensive forward model. This renders most Markov chain Monte Carlo approaches infeasible, since they typically require $O(10^4)$ model runs, or more. Moreover, the forward model is often given as a black box or is impractical to differentiate. Therefor… ▽ More

    Submitted 11 August, 2022; v1 submitted 9 April, 2022; originally announced April 2022.

    Comments: 44 pages, 15 figures

  30. arXiv:2203.13181  [pdf, other

    math.NA

    The Cost-Accuracy Trade-Off In Operator Learning With Neural Networks

    Authors: Maarten V. de Hoop, Daniel Zhengyu Huang, Elizabeth Qian, Andrew M. Stuart

    Abstract: The term `surrogate modeling' in computational science and engineering refers to the development of computationally efficient approximations for expensive simulations, such as those arising from numerical solution of partial differential equations (PDEs). Surrogate modeling is an enabling methodology for many-query computations in science and engineering, which include iterative methods in optimiz… ▽ More

    Submitted 11 August, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: 48 pages, 19 figures

  31. Ensemble-Based Experimental Design for Targeting Data Acquisition to Inform Climate Models

    Authors: Oliver R. A. Dunbar, Michael F. Howland, Tapio Schneider, Andrew M. Stuart

    Abstract: Data required to calibrate uncertain GCM parameterizations are often only available in limited regions or time periods, for example, observational data from field campaigns, or data generated in local high-resolution simulations. This raises the question of where and when to acquire additional data to be maximally informative about parameterizations in a GCM. Here we construct a new ensemble-based… ▽ More

    Submitted 27 June, 2022; v1 submitted 12 January, 2022; originally announced January 2022.

  32. arXiv:2108.12515  [pdf, other

    math.ST cs.LG stat.ME stat.ML

    Convergence Rates for Learning Linear Operators from Noisy Data

    Authors: Maarten V. de Hoop, Nikola B. Kovachki, Nicholas H. Nelsen, Andrew M. Stuart

    Abstract: This paper studies the learning of linear operators between infinite-dimensional Hilbert spaces. The training data comprises pairs of random input vectors in a Hilbert space and their noisy images under an unknown self-adjoint linear operator. Assuming that the operator is diagonalizable in a known basis, this work solves the equivalent inverse problem of estimating the operator's eigenvalues give… ▽ More

    Submitted 2 November, 2022; v1 submitted 27 August, 2021; originally announced August 2021.

    Comments: To appear in SIAM/ASA Journal on Uncertainty Quantification (JUQ); 34 pages, 5 figures, 2 tables

    MSC Class: 62G20; 62C10; 68T05; 47A62

    Journal ref: SIAM/ASA J. Uncertainty Quantification Vol. 11 No. 2 (2023) pp. 480-513

  33. arXiv:2107.06658  [pdf, other

    math.DS cs.LG stat.ML

    A Framework for Machine Learning of Model Error in Dynamical Systems

    Authors: Matthew E. Levine, Andrew M. Stuart

    Abstract: The development of data-informed predictive models for dynamical systems is of widespread interest in many disciplines. We present a unifying framework for blending mechanistic and machine-learning approaches to identify dynamical systems from noisily and partially observed data. We compare pure data-driven learning with hybrid models which incorporate imperfect domain knowledge. Our formulation i… ▽ More

    Submitted 17 August, 2022; v1 submitted 14 July, 2021; originally announced July 2021.

  34. arXiv:2106.02519  [pdf, other

    math.DS math.NA

    Consensus Based Sampling

    Authors: J. A. Carrillo, F. Hoffmann, A. M. Stuart, U. Vaes

    Abstract: We propose a novel method for sampling and optimization tasks based on a stochastic interacting particle system. We explain how this method can be used for the following two goals: (i) generating approximate samples from a given target distribution; (ii) optimizing a given objective function. The approach is derivative-free and affine invariant, and is therefore well-suited for solving inverse pro… ▽ More

    Submitted 4 November, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

    MSC Class: 62F15; 65C35; 65N21; 35G25

  35. arXiv:2104.03384  [pdf, other

    math.NA stat.ML

    Ensemble Inference Methods for Models With Noisy and Expensive Likelihoods

    Authors: Oliver R. A. Dunbar, Andrew B. Duncan, Andrew M. Stuart, Marie-Therese Wolfram

    Abstract: The increasing availability of data presents an opportunity to calibrate unknown parameters which appear in complex models of phenomena in the biomedical, physical and social sciences. However, model complexity often leads to parameter-to-data maps which are expensive to evaluate and are only available through noisy approximations. This paper is concerned with the use of interacting particle syste… ▽ More

    Submitted 22 January, 2022; v1 submitted 7 April, 2021; originally announced April 2021.

    MSC Class: 65C05; 65C40; 60J22

  36. arXiv:2103.12959  [pdf, other

    math.NA stat.ML

    Solving and Learning Nonlinear PDEs with Gaussian Processes

    Authors: Yifan Chen, Bamdad Hosseini, Houman Owhadi, Andrew M Stuart

    Abstract: We introduce a simple, rigorous, and unified framework for solving nonlinear partial differential equations (PDEs), and for solving inverse problems (IPs) involving the identification of parameters in PDEs, using the framework of Gaussian processes. The proposed approach: (1) provides a natural generalization of collocation kernel methods to nonlinear PDEs and IPs; (2) has guaranteed convergence f… ▽ More

    Submitted 10 August, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

    Comments: 41 pages

    MSC Class: 60G15; 65M75; 65N75; 65N35; 47B34; 41A15; 35R30; 34B15

  37. arXiv:2102.01580  [pdf, other

    math.NA math.DS

    Iterated Kalman Methodology For Inverse Problems

    Authors: Daniel Zhengyu Huang, Tapio Schneider, Andrew M. Stuart

    Abstract: This paper is focused on the optimization approach to the solution of inverse problems. We introduce a stochastic dynamical system in which the parameter-to-data map is embedded, with the goal of employing techniques from nonlinear Kalman filtering to estimate the parameter given the data. The extended Kalman filter (which we refer to as ExKI in the context of inverse problems) can be effective fo… ▽ More

    Submitted 28 April, 2022; v1 submitted 2 February, 2021; originally announced February 2021.

    Comments: 56 pages, 24 figures

  38. arXiv:2102.00540  [pdf, other

    math.DS

    Derivative-free Bayesian Inversion Using Multiscale Dynamics

    Authors: G. A. Pavliotis, A. M. Stuart, U. Vaes

    Abstract: Inverse problems are ubiquitous because they formalize the integration of data with mathematical models. In many scientific applications the forward model is expensive to evaluate, and adjoint computations are difficult to employ; in this setting derivative-free methods which involve a small number of forward model evaluations are an attractive proposition. Ensemble Kalman based interacting partic… ▽ More

    Submitted 4 November, 2021; v1 submitted 31 January, 2021; originally announced February 2021.

    MSC Class: 62F15; 65C35; 65C30; 65N21

  39. Calibration and Uncertainty Quantification of Convective Parameters in an Idealized GCM

    Authors: Oliver R. A. Dunbar, Alfredo Garbuno-Inigo, Tapio Schneider, Andrew M. Stuart

    Abstract: Parameters in climate models are usually calibrated manually, exploiting only small subsets of the available data. This precludes both optimal calibration and quantification of uncertainties. Traditional Bayesian calibration methods that allow uncertainty quantification are too expensive for climate models; they are also not robust in the presence of internal climate variability. For example, Mark… ▽ More

    Submitted 19 August, 2021; v1 submitted 24 December, 2020; originally announced December 2020.

  40. arXiv:2009.13457  [pdf, other

    math.NA

    Drift Estimation of Multiscale Diffusions Based on Filtered Data

    Authors: Assyr Abdulle, Giacomo Garegnani, Grigorios A. Pavliotis, Andrew M. Stuart, Andrea Zanoni

    Abstract: We study the problem of drift estimation for two-scale continuous time series. We set ourselves in the framework of overdamped Langevin equations, for which a single-scale surrogate homogenized equation exists. In this setting, estimating the drift coefficient of the homogenized equation requires pre-processing of the data, often in the form of subsampling; this is because the two-scale equation a… ▽ More

    Submitted 6 June, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

  41. Posterior Consistency of Semi-Supervised Regression on Graphs

    Authors: Andrea L. Bertozzi, Bamdad Hosseini, Hao Li, Kevin Miller, Andrew M. Stuart

    Abstract: Graph-based semi-supervised regression (SSR) is the problem of estimating the value of a function on a weighted graph from its values (labels) on a small subset of the vertices. This paper is concerned with the consistency of SSR in the context of classification, in the setting where the labels have small noise and the underlying graph weighting is consistent with well-clustered nodes. We present… ▽ More

    Submitted 24 March, 2021; v1 submitted 24 July, 2020; originally announced July 2020.

  42. arXiv:2007.06175  [pdf, other

    math.OC math.DS

    Ensemble Kalman Inversion for Sparse Learning of Dynamical Systems from Time-Averaged Data

    Authors: Tapio Schneider, Andrew M. Stuart, Jin-Long Wu

    Abstract: Enforcing sparse structure within learning has led to significant advances in the field of data-driven discovery of dynamical systems. However, such methods require access not only to time-series of the state of the dynamical system, but also to the time derivative. In many applications, the data are available only in the form of time-averages such as moments and autocorrelation functions. We prop… ▽ More

    Submitted 20 October, 2020; v1 submitted 12 July, 2020; originally announced July 2020.

    Comments: 51 pages, 30 figures

  43. arXiv:2005.11375  [pdf, other

    math.ST math.NA stat.ML

    Consistency of Empirical Bayes And Kernel Flow For Hierarchical Parameter Estimation

    Authors: Yifan Chen, Houman Owhadi, Andrew M. Stuart

    Abstract: Gaussian process regression has proven very powerful in statistics, machine learning and inverse problems. A crucial aspect of the success of this methodology, in a wide range of applications to complex and real-world problems, is hierarchical modeling and learning of hyperparameters. The purpose of this paper is to study two paradigms of learning hierarchical parameters: one is from the probabili… ▽ More

    Submitted 16 March, 2021; v1 submitted 22 May, 2020; originally announced May 2020.

    Comments: to appear in Mathematics of Computation

    MSC Class: 65F12 62C10 41A05 35Q62

  44. arXiv:2005.10224  [pdf, other

    math.NA cs.LG physics.comp-ph stat.ML

    The Random Feature Model for Input-Output Maps between Banach Spaces

    Authors: Nicholas H. Nelsen, Andrew M. Stuart

    Abstract: Well known to the machine learning community, the random feature model is a parametric approximation to kernel interpolation or regression methods. It is typically used to approximate functions mapping a finite-dimensional input space to the real line. In this paper, we instead propose a methodology for use of the random feature model as a data-driven surrogate for operators that map an input Bana… ▽ More

    Submitted 5 June, 2021; v1 submitted 20 May, 2020; originally announced May 2020.

    Comments: To appear in SIAM Journal on Scientific Computing; 32 pages, 9 figures

    MSC Class: 65D15; 65D40; 62M45; 35R60

    Journal ref: SIAM J. Sci. Comput. Vol. 43 No. 5 (2021) pp. A3212-A3243

  45. arXiv:2005.03180  [pdf, other

    math.NA cs.LG stat.ML

    Model Reduction and Neural Networks for Parametric PDEs

    Authors: Kaushik Bhattacharya, Bamdad Hosseini, Nikola B. Kovachki, Andrew M. Stuart

    Abstract: We develop a general framework for data-driven approximation of input-output maps between infinite-dimensional spaces. The proposed approach is motivated by the recent successes of neural networks and deep learning, in combination with ideas from model reduction. This combination results in a neural network approximation which, in principle, is defined on infinite-dimensional spaces and, in practi… ▽ More

    Submitted 17 June, 2021; v1 submitted 6 May, 2020; originally announced May 2020.

    Comments: 39 pages, 13 figures

    MSC Class: 65N75; 62M45; 68T05; 60H30; 60H15

  46. arXiv:2004.08376  [pdf, other

    stat.CO physics.comp-ph

    Learning Stochastic Closures Using Ensemble Kalman Inversion

    Authors: Tapio Schneider, Andrew M. Stuart, Jin-Long Wu

    Abstract: Although the governing equations of many systems, when derived from first principles, may be viewed as known, it is often too expensive to numerically simulate all the interactions they describe. Therefore researchers often seek simpler descriptions that describe complex phenomena without numerically resolving all the interacting components. Stochastic differential equations (SDEs) arise naturally… ▽ More

    Submitted 30 April, 2021; v1 submitted 17 April, 2020; originally announced April 2020.

    Comments: 35 pages, 26 figures

  47. Calibrate, Emulate, Sample

    Authors: Emmet Cleary, Alfredo Garbuno-Inigo, Shiwei Lan, Tapio Schneider, Andrew M Stuart

    Abstract: Many parameter estimation problems arising in applications are best cast in the framework of Bayesian inversion. This allows not only for an estimate of the parameters, but also for the quantification of uncertainties in the estimates. Often in such problems the parameter-to-data map is very expensive to evaluate, and computing derivatives of the map, or derivative-adjoints, may not be feasible. A… ▽ More

    Submitted 10 January, 2020; originally announced January 2020.

  48. arXiv:1910.14193  [pdf, other

    q-bio.QM

    A Simple Modeling Framework For Prediction In The Human Glucose-Insulin System

    Authors: M. Sirlanci, M. E. Levine, C. C. Low Wang, D. J. Albers, A. M. Stuart

    Abstract: In this paper, we build a new, simple, and interpretable mathematical model to estimate and forecast physiology related to the human glucose-insulin system, constrained by available data. By constructing a simple yet flexible model class with interpretable parameters, this general model can be specialized to work in different settings, such as type 2 diabetes mellitus (T2DM) and intensive care uni… ▽ More

    Submitted 20 September, 2022; v1 submitted 30 October, 2019; originally announced October 2019.

    Comments: 41 pages, 8 figures, 4 tables

    MSC Class: 92

  49. arXiv:1909.06389  [pdf, other

    math.SP math.AP stat.ML

    Spectral Analysis Of Weighted Laplacians Arising In Data Clustering

    Authors: Franca Hoffmann, Bamdad Hosseini, Assad A. Oberai, Andrew M. Stuart

    Abstract: Graph Laplacians computed from weighted adjacency matrices are widely used to identify geometric structure in data, and clusters in particular; their spectral properties play a central role in a number of unsupervised and semi-supervised learning algorithms. When suitably scaled, graph Laplacians approach limiting continuum operators in the large data limit. Studying these limiting operators, ther… ▽ More

    Submitted 13 July, 2020; v1 submitted 13 September, 2019; originally announced September 2019.

    MSC Class: 47A75; 62H30; 68T10; 35B20; 05C50

  50. arXiv:1906.07658  [pdf, other

    stat.ML cs.LG math.NA math.OC

    Consistency of semi-supervised learning algorithms on graphs: Probit and one-hot methods

    Authors: Franca Hoffmann, Bamdad Hosseini, Zhi Ren, Andrew M. Stuart

    Abstract: Graph-based semi-supervised learning is the problem of propagating labels from a small number of labelled data points to a larger set of unlabelled data. This paper is concerned with the consistency of optimization-based techniques for such problems, in the limit where the labels have small noise and the underlying unlabelled data is well clustered. We study graph-based probit for binary classific… ▽ More

    Submitted 9 March, 2020; v1 submitted 18 June, 2019; originally announced June 2019.

    MSC Class: 62H30; 68T10; 68Q87; 91C20