-
Adaptive tempering schedules with approximative intermediate measures for filtering problems
Authors:
Iris Rammelmüller,
Gottfried Hastermann,
Jana de Wiljes
Abstract:
Data assimilation algorithms integrate prior information from numerical model simulations with observed data. Ensemble-based filters, regarded as state-of-the-art, are widely employed for large-scale estimation tasks in disciplines such as geoscience and meteorology. Despite their inability to produce the true posterior distribution for nonlinear systems, their robustness and capacity for state tr…
▽ More
Data assimilation algorithms integrate prior information from numerical model simulations with observed data. Ensemble-based filters, regarded as state-of-the-art, are widely employed for large-scale estimation tasks in disciplines such as geoscience and meteorology. Despite their inability to produce the true posterior distribution for nonlinear systems, their robustness and capacity for state tracking are noteworthy. In contrast, Particle filters yield the correct distribution in the ensemble limit but require substantially larger ensemble sizes than ensemble-based filters to maintain stability in higher-dimensional spaces. It is essential to transcend traditional Gaussian assumptions to achieve realistic quantification of uncertainties. One approach involves the hybridisation of filters, facilitated by tempering, to harness the complementary strengths of different filters. A new adaptive tempering method is proposed to tune the underlying schedule, aiming to systematically surpass the performance previously achieved. Although promising numerical results for certain filter combinations in toy examples exist in the literature, the tuning of hyperparameters presents a considerable challenge. A deeper understanding of these interactions is crucial for practical applications.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
LDP polygons and the number 12 revisited
Authors:
Ulrike Bücking,
Christian Haase,
Karin Schaller,
Jan-Hendrik de Wiljes
Abstract:
We give a combinatorial proof of a lattice point identity involving a lattice polygon and its dual, generalizing the formula $area(Δ) + area(Δ^*) = 6$ for reflexive $Δ$. The identity is equivalent to the stringy Libgober-Wood identity for toric log del Pezzo surfaces.
We give a combinatorial proof of a lattice point identity involving a lattice polygon and its dual, generalizing the formula $area(Δ) + area(Δ^*) = 6$ for reflexive $Δ$. The identity is equivalent to the stringy Libgober-Wood identity for toric log del Pezzo surfaces.
△ Less
Submitted 5 September, 2023;
originally announced September 2023.
-
A continued learning approach for model-informed precision dosing: updating models in clinical practice
Authors:
Corinna Maier,
Jana de Wiljes,
Niklas Hartung,
Charlotte Kloft,
Wilhelm Huisinga
Abstract:
Model-informed precision dosing (MIPD) is a quantitative dosing framework that combines prior knowledge on the drug-disease-patient system with patient data from therapeutic drug/ biomarker monitoring (TDM) to support individualized dosing in ongoing treatment.Structural models and prior parameter distributions used in MIPD approaches typically build on prior clinical trials that involve only a li…
▽ More
Model-informed precision dosing (MIPD) is a quantitative dosing framework that combines prior knowledge on the drug-disease-patient system with patient data from therapeutic drug/ biomarker monitoring (TDM) to support individualized dosing in ongoing treatment.Structural models and prior parameter distributions used in MIPD approaches typically build on prior clinical trials that involve only a limited number of patients selected according to some exclusion/inclusion criteria. Compared to the prior clinical trial population, the patient population in clinical practice can be expected to include also altered behavior and/or increased interindividual variability, the extent of which, however, is typically unknown. Here, we address the question of how to adapt and refine models on the level of the model parameters to better reflect this real-world diversity. We propose an approach for continued learning across patients during MIPD using a sequential hierarchical Bayesian framework. The approach builds on two stages to separate the update of the individual patient parameters from updating the population parameters. Consequently, it enables continued learning across hospitals or study centers, since only summary patient data (on the level of model parameters) need to be shared, but no individual TDM data. We illustrate this continued learning approach with neutrophil-guided dosing of paclitaxel. The present study constitutes an important step towards building confidence in MIPD and eventually establishing MIPD increasingly in everyday therapeutic use.
△ Less
Submitted 7 June, 2021;
originally announced June 2021.
-
Randomized maximum likelihood based posterior sampling
Authors:
Yuming Ba,
Jana de Wiljes,
Dean S. Oliver,
Sebastian Reich
Abstract:
Minimization of a stochastic cost function is commonly used for approximate sampling in high-dimensional Bayesian inverse problems with Gaussian prior distributions and multimodal posterior distributions. The density of the samples generated by minimization is not the desired target density, unless the observation operator is linear, but the distribution of samples is useful as a proposal density…
▽ More
Minimization of a stochastic cost function is commonly used for approximate sampling in high-dimensional Bayesian inverse problems with Gaussian prior distributions and multimodal posterior distributions. The density of the samples generated by minimization is not the desired target density, unless the observation operator is linear, but the distribution of samples is useful as a proposal density for importance sampling or for Markov chain Monte Carlo methods. In this paper, we focus on applications to sampling from multimodal posterior distributions in high dimensions. We first show that sampling from multimodal distributions is improved by computing all critical points instead of only minimizers of the objective function. For applications to high-dimensional geoscience problems, we demonstrate an efficient approximate weighting that uses a low-rank Gauss-Newton approximation of the determinant of the Jacobian. The method is applied to two toy problems with known posterior distributions and a Darcy flow problem with multiple modes in the posterior.
△ Less
Submitted 17 August, 2021; v1 submitted 10 January, 2021;
originally announced January 2021.
-
Reconstructing regime-dependent causal relationships from observational time series
Authors:
Elena Saggioro,
Jana de Wiljes,
Marlene Kretschmer,
Jakob Runge
Abstract:
Inferring causal relations from observational time series data is a key problem across science and engineering whenever experimental interventions are infeasible or unethical. Increasing data availability over the past decades has spurred the development of a plethora of causal discovery methods, each addressing particular challenges of this difficult task. In this paper we focus on an important c…
▽ More
Inferring causal relations from observational time series data is a key problem across science and engineering whenever experimental interventions are infeasible or unethical. Increasing data availability over the past decades has spurred the development of a plethora of causal discovery methods, each addressing particular challenges of this difficult task. In this paper we focus on an important challenge that is at the core of time series causal discovery: regime-dependent causal relations. Often dynamical systems feature transitions depending on some, often persistent, unobserved background regime, and different regimes may exhibit different causal relations. Here, we assume a persistent and discrete regime variable leading to a finite number of regimes within which we may assume stationary causal relations. To detect regime-dependent causal relations, we combine the conditional independence-based PCMCI method with a regime learning optimisation approach. PCMCI allows for linear and nonlinear, high-dimensional time series causal discovery. Our method, Regime-PCMCI, is evaluated on a number of numerical experiments demonstrating that it can distinguish regimes with different causal directions, time lags, effects and sign of causal links, as well as changes in the variables' autocorrelation. Further, Regime-PCMCI is employed to observations of El Niño Southern Oscillation and Indian rainfall, demonstrating skill also in real-world datasets.
△ Less
Submitted 1 July, 2020;
originally announced July 2020.
-
Thermophysical modelling and parameter estimation of small solar system bodies via data assimilation
Authors:
M. Hamm,
I. Pelivan,
M. Grott,
J. de Wiljes
Abstract:
Deriving thermophysical properties such as thermal inertia from thermal infrared observations provides useful insights into the structure of the surface material on planetary bodies. The estimation of these properties is usually done by fitting temperature variations calculated by thermophysical models to infrared observations. For multiple free model parameters, traditional methods such as Least-…
▽ More
Deriving thermophysical properties such as thermal inertia from thermal infrared observations provides useful insights into the structure of the surface material on planetary bodies. The estimation of these properties is usually done by fitting temperature variations calculated by thermophysical models to infrared observations. For multiple free model parameters, traditional methods such as Least-Squares fitting or Markov-Chain Monte-Carlo methods become computationally too expensive. Consequently, the simultaneous estimation of several thermophysical parameters together with their corresponding uncertainties and correlations is often not computationally feasible and the analysis is usually reduced to fitting one or two parameters. Data assimilation methods have been shown to be robust while sufficiently accurate and computationally affordable even for a large number of parameters. This paper will introduce a standard sequential data assimilation method, the Ensemble Square Root Filter, to thermophysical modelling of asteroid surfaces. This method is used to re-analyse infrared observations of the MARA instrument, which measured the diurnal temperature variation of a single boulder on the surface of near-Earth asteroid (162173) Ryugu. The thermal inertia is estimated to be $295 \pm 18$ $\mathrm{J\,m^{-2}\,K^{-1}\,s^{-1/2}}$, while all five free parameters of the initial analysis are varied and estimated simultaneously. Based on this thermal inertia estimate the thermal conductivity of the boulder is estimated to be between 0.07 and 0.12 $\mathrm{W\,m^{-1}\,K^{-1}}$ and the porosity to be between 0.30 and 0.52. For the first time in thermophysical parameter derivation, correlations and uncertainties of all free model parameters are incorporated in the estimation procedure which is more than 5000 times more efficient than a comparable parameter sweep.
△ Less
Submitted 7 July, 2020; v1 submitted 30 March, 2020;
originally announced March 2020.
-
Analysis of a localised nonlinear Ensemble Kalman Bucy Filter with complete and accurate observations
Authors:
Jana de Wiljes,
Xin T. Tong
Abstract:
Concurrent observation technologies have made high-precision real-time data available in large quantities. Data assimilation (DA) is concerned with how to combine this data with physical models to produce accurate predictions. For spatial-temporal models, the Ensemble Kalman Filter with proper localization techniques is considered to be a state-of-the-art DA methodology. This article proposes and…
▽ More
Concurrent observation technologies have made high-precision real-time data available in large quantities. Data assimilation (DA) is concerned with how to combine this data with physical models to produce accurate predictions. For spatial-temporal models, the Ensemble Kalman Filter with proper localization techniques is considered to be a state-of-the-art DA methodology. This article proposes and investigates a localized Ensemble Kalman Bucy Filter (l-EnKBF) for nonlinear models with short-range interactions. We derive dimension-independent and component-wise error bounds and show the long time path-wise error only has logarithmic dependence on the time range. The theoretical results are verified through some simple numerical tests.
△ Less
Submitted 7 July, 2020; v1 submitted 28 August, 2019;
originally announced August 2019.
-
Ensemble transform algorithms for nonlinear smoothing problems
Authors:
Jana de Wiljes,
Sahani Pathiraja,
Sebastian Reich
Abstract:
Several numerical tools designed to overcome the challenges of smoothing in a nonlinear and non-Gaussian setting are investigated for a class of particle smoothers. The considered family of smoothers is induced by the class of linear ensemble transform filters which contains classical filters such as the stochastic ensemble Kalman filter, the ensemble square root filter and the recently introduced…
▽ More
Several numerical tools designed to overcome the challenges of smoothing in a nonlinear and non-Gaussian setting are investigated for a class of particle smoothers. The considered family of smoothers is induced by the class of linear ensemble transform filters which contains classical filters such as the stochastic ensemble Kalman filter, the ensemble square root filter and the recently introduced nonlinear ensemble transform filter. Further the ensemble transform particle smoother is introduced and particularly highlighted as it is consistent in the particle limit and does not require assumptions with respect to the family of the posterior distribution. The linear update pattern of the considered class of linear ensemble transform smoothers allows one to implement important supplementary techniques such as adaptive spread corrections, hybrid formulations, and localization in order to facilitate their application to complex estimation problems. These additional features are derived and numerically investigated for a sequence of increasingly challenging test problems.
△ Less
Submitted 28 October, 2019; v1 submitted 18 January, 2019;
originally announced January 2019.
-
Interacting particle filters for simultaneous state and parameter estimation
Authors:
Angwenyi David,
Jana de Wiljes,
Sebastian Reich
Abstract:
Simultaneous state and parameter estimation arises from various applicational areas but presents a major computational challenge. Most available Markov chain or sequential Monte Carlo techniques are applicable to relatively low dimensional problems only. Alternative methods, such as the ensemble Kalman filter or other ensemble transform filters have, on the other hand, been successfully applied to…
▽ More
Simultaneous state and parameter estimation arises from various applicational areas but presents a major computational challenge. Most available Markov chain or sequential Monte Carlo techniques are applicable to relatively low dimensional problems only. Alternative methods, such as the ensemble Kalman filter or other ensemble transform filters have, on the other hand, been successfully applied to high dimensional state estimation problems. In this paper, we propose an extension of these techniques to high dimensional state space models which depend on a few unknown parameters. More specifically, we combine the ensemble Kalman-Bucy filter for the continuous-time filtering problem with a generalized ensemble transform particle filter for intermittent parameter updates. We demonstrate the performance of this two stage update filter for a wave equation with unknown wave velocity parameter.
△ Less
Submitted 26 September, 2017;
originally announced September 2017.
-
Complete Subgraphs of the Coprime Hypergraph of Integers III: Construction
Authors:
Jan-Hendrik de Wiljes
Abstract:
The coprime hypergraph of integers on $n$ vertices $CHI_k(n)$ is defined via vertex set $\{1,2,\dots,n\}$ and hyperedge set $\{\{v_1,v_2,\dots,v_{k+1}\}\subseteq\{1,2,\dots,n\}:\gcd(v_1,v_2,\dots,v_{k+1})=1\}$. In this article we present ideas on how to construct maximal subgraphs in $CHI_k(n)$. This continues the author's earlier work, which dealt with bounds on the size and structural properties…
▽ More
The coprime hypergraph of integers on $n$ vertices $CHI_k(n)$ is defined via vertex set $\{1,2,\dots,n\}$ and hyperedge set $\{\{v_1,v_2,\dots,v_{k+1}\}\subseteq\{1,2,\dots,n\}:\gcd(v_1,v_2,\dots,v_{k+1})=1\}$. In this article we present ideas on how to construct maximal subgraphs in $CHI_k(n)$. This continues the author's earlier work, which dealt with bounds on the size and structural properties of these subgraphs. We succeed in the cases $k\in\{1,2,3\}$ and give promising ideas for $k\geq 4$.
△ Less
Submitted 12 August, 2017;
originally announced August 2017.
-
Kalman Filter and its Modern Extensions for the Continuous-time Nonlinear Filtering Problem
Authors:
Amirhossein Taghvaei,
Jana de Wiljes,
Prashant G. Mehta,
Sebastian Reich
Abstract:
This paper is concerned with the filtering problem in continuous-time. Three algorithmic solution approaches for this problem are reviewed: (i) the classical Kalman-Bucy filter which provides an exact solution for the linear Gaussian problem, (ii) the ensemble Kalman-Bucy filter (EnKBF) which is an approximate filter and represents an extension of the Kalman-Bucy filter to nonlinear problems, and…
▽ More
This paper is concerned with the filtering problem in continuous-time. Three algorithmic solution approaches for this problem are reviewed: (i) the classical Kalman-Bucy filter which provides an exact solution for the linear Gaussian problem, (ii) the ensemble Kalman-Bucy filter (EnKBF) which is an approximate filter and represents an extension of the Kalman-Bucy filter to nonlinear problems, and (iii) the feedback particle filter (FPF) which represents an extension of the EnKBF and furthermore provides for an consistent solution in the general nonlinear, non-Gaussian case. The common feature of the three algorithms is the gain times error formula to implement the update step (to account for conditioning due to the observations) in the filter. In contrast to the commonly used sequential Monte Carlo methods, the EnKBF and FPF avoid the resampling of the particles in the importance sampling update step. Moreover, the feedback control structure provides for error correction potentially leading to smaller simulation variance and improved stability properties. The paper also discusses the issue of non-uniqueness of the filter update formula and formulates a novel approximation algorithm based on ideas from optimal transport and coupling of measures. Performance of this and other algorithms is illustrated for a numerical example.
△ Less
Submitted 21 December, 2017; v1 submitted 21 February, 2017;
originally announced February 2017.
-
Long-time stability and accuracy of the ensemble Kalman-Bucy filter for fully observed processes and small measurement noise
Authors:
Jana de Wiljes,
Sebastian Reich,
Wilhelm Stannat
Abstract:
The ensemble Kalman filter has become a popular data assimilation technique in the geosciences. However, little is known theoretically about its long term stability and accuracy. In this paper, we investigate the behavior of an ensemble Kalman-Bucy filter applied to continuous-time filtering problems. We derive mean field limiting equations as the ensemble size goes to infinity as well as uniform-…
▽ More
The ensemble Kalman filter has become a popular data assimilation technique in the geosciences. However, little is known theoretically about its long term stability and accuracy. In this paper, we investigate the behavior of an ensemble Kalman-Bucy filter applied to continuous-time filtering problems. We derive mean field limiting equations as the ensemble size goes to infinity as well as uniform-in-time accuracy and stability results for finite ensemble sizes. The later results require that the process is fully observed and that the measurement noise is small. We also demonstrate that our ensemble Kalman-Bucy filter is consistent with the classic Kalman-Bucy filter for linear systems and Gaussian processes. We finally verify our theoretical findings for the Lorenz-63 system.
△ Less
Submitted 24 November, 2017; v1 submitted 19 December, 2016;
originally announced December 2016.
-
Second-order accurate ensemble transform particle filters
Authors:
Walter Acevedo,
Jana de Wiljes,
Sebastian Reich
Abstract:
Particle filters (also called sequential Monte Carlo methods) are widely used for state and parameter estimation problems in the context of nonlinear evolution equations. The recently proposed ensemble transform particle filter (ETPF) (S.~Reich, {\it A non-parametric ensemble transform method for Bayesian inference}, SIAM J.~Sci.~Comput., 35, (2013), pp. A2013--A2014) replaces the resampling step…
▽ More
Particle filters (also called sequential Monte Carlo methods) are widely used for state and parameter estimation problems in the context of nonlinear evolution equations. The recently proposed ensemble transform particle filter (ETPF) (S.~Reich, {\it A non-parametric ensemble transform method for Bayesian inference}, SIAM J.~Sci.~Comput., 35, (2013), pp. A2013--A2014) replaces the resampling step of a standard particle filter by a linear transformation which allows for a hybridization of particle filters with ensemble Kalman filters and renders the resulting hybrid filters applicable to spatially extended systems. However, the linear transformation step is computationally expensive and leads to an underestimation of the ensemble spread for small and moderate ensemble sizes. Here we address both of these shortcomings by developing second-order accurate extensions of the ETPF. These extensions allow one in particular to replace the exact solution of a linear transport problem by its Sinkhorn approximation. It is also demonstrated that the nonlinear ensemble transform filter (NETF) arises as a special case of our general framework. We illustrate the performance of the second-order accurate filters for the chaotic Lorenz-63 and Lorenz-96 models and a dynamic scene-viewing model. The numerical results for the Lorenz-63 and Lorenz-96 models demonstrate that significant accuracy improvements can be achieved in comparison to a standard ensemble Kalman filter and the ETPF for small to moderate ensemble sizes. The numerical results for the scene-viewing model reveal, on the other hand, that second-order corrections can lead to statistically inconsistent samples from the posterior parameter distribution.
△ Less
Submitted 10 April, 2017; v1 submitted 29 August, 2016;
originally announced August 2016.