-
Learning algorithms for mean field optimal control
Authors:
H. Mete Soner,
Josef Teichmann,
Qinxin Yan
Abstract:
We analyze an algorithm to numerically solve the mean-field optimal control problems by approximating the optimal feedback controls using neural networks with problem specific architectures. We approximate the model by an $N$-particle system and leverage the exchangeability of the particles to obtain substantial computational efficiency. In addition to several numerical examples, a convergence ana…
▽ More
We analyze an algorithm to numerically solve the mean-field optimal control problems by approximating the optimal feedback controls using neural networks with problem specific architectures. We approximate the model by an $N$-particle system and leverage the exchangeability of the particles to obtain substantial computational efficiency. In addition to several numerical examples, a convergence analysis is provided. We also developed a universal approximation theorem on Wasserstein spaces.
△ Less
Submitted 22 March, 2025;
originally announced March 2025.
-
Universal approximation property of neural stochastic differential equations
Authors:
Anna P. Kwossek,
David J. Prömel,
Josef Teichmann
Abstract:
We identify various classes of neural networks that are able to approximate continuous functions locally uniformly subject to fixed global linear growth constraints. For such neural networks the associated neural stochastic differential equations can approximate general stochastic differential equations, both of Itô diffusion type, arbitrarily well. Moreover, quantitative error estimates are deriv…
▽ More
We identify various classes of neural networks that are able to approximate continuous functions locally uniformly subject to fixed global linear growth constraints. For such neural networks the associated neural stochastic differential equations can approximate general stochastic differential equations, both of Itô diffusion type, arbitrarily well. Moreover, quantitative error estimates are derived for stochastic differential equations with sufficiently regular coefficients.
△ Less
Submitted 20 March, 2025;
originally announced March 2025.
-
Signature Reconstruction from Randomized Signatures
Authors:
Mie Glückstad,
Nicola Muca Cirone,
Josef Teichmann
Abstract:
Controlled ordinary differential equations driven by continuous bounded variation curves can be considered a continuous time analogue of recurrent neural networks for the construction of expressive features of the input curves. We ask up to which extent well known signature features of such curves can be reconstructed from controlled ordinary differential equations with (untrained) random vector f…
▽ More
Controlled ordinary differential equations driven by continuous bounded variation curves can be considered a continuous time analogue of recurrent neural networks for the construction of expressive features of the input curves. We ask up to which extent well known signature features of such curves can be reconstructed from controlled ordinary differential equations with (untrained) random vector fields. The answer turns out to be algebraically involved, but essentially the number of signature features, which can be reconstructed from the non-linear flow of the controlled ordinary differential equation, is exponential in its hidden dimension, when the vector fields are chosen to be neural with depth two. Moreover, we characterize a general linear independence condition on arbitrary vector fields, under which the signature features up to some fixed order can always be reconstructed. Algebraically speaking this complements in a quantitative manner several well known results from the theory of Lie algebras of vector fields and puts them in a context of machine learning.
△ Less
Submitted 5 February, 2025;
originally announced February 2025.
-
Learning Chaotic Systems and Long-Term Predictions with Neural Jump ODEs
Authors:
Florian Krach,
Josef Teichmann
Abstract:
The Path-dependent Neural Jump ODE (PD-NJ-ODE) is a model for online prediction of generic (possibly non-Markovian) stochastic processes with irregular (in time) and potentially incomplete (with respect to coordinates) observations. It is a model for which convergence to the $L^2$-optimal predictor, which is given by the conditional expectation, is established theoretically. Thereby, the training…
▽ More
The Path-dependent Neural Jump ODE (PD-NJ-ODE) is a model for online prediction of generic (possibly non-Markovian) stochastic processes with irregular (in time) and potentially incomplete (with respect to coordinates) observations. It is a model for which convergence to the $L^2$-optimal predictor, which is given by the conditional expectation, is established theoretically. Thereby, the training of the model is solely based on a dataset of realizations of the underlying stochastic process, without the need of knowledge of the law of the process. In the case where the underlying process is deterministic, the conditional expectation coincides with the process itself. Therefore, this framework can equivalently be used to learn the dynamics of ODE or PDE systems solely from realizations of the dynamical system with different initial conditions. We showcase the potential of our method by applying it to the chaotic system of a double pendulum. When training the standard PD-NJ-ODE method, we see that the prediction starts to diverge from the true path after about half of the evaluation time. In this work we enhance the model with two novel ideas, which independently of each other improve the performance of our modelling setup. The resulting dynamics match the true dynamics of the chaotic system very closely. The same enhancements can be used to provably enable the PD-NJ-ODE to learn long-term predictions for general stochastic datasets, where the standard model fails. This is verified in several experiments.
△ Less
Submitted 26 July, 2024;
originally announced July 2024.
-
Ramifications of generalized Feller theory
Authors:
Christa Cuchiero,
Tonio Möllmann,
Josef Teichmann
Abstract:
Generalized Feller theory provides an important analog to Feller theory beyond locally compact state spaces. This is very useful for solutions of certain stochastic partial differential equations, Markovian lifts of fractional processes, or infinite dimensional affine and polynomial processes which appear prominently in the theory of signature stochastic differential equations. We extend several f…
▽ More
Generalized Feller theory provides an important analog to Feller theory beyond locally compact state spaces. This is very useful for solutions of certain stochastic partial differential equations, Markovian lifts of fractional processes, or infinite dimensional affine and polynomial processes which appear prominently in the theory of signature stochastic differential equations. We extend several folklore results related to generalized Feller processes, in particular on their construction and path properties, and provide the often quite sophisticated proofs in full detail. We also introduce the new concept of extended Feller processes and compare them with standard and generalized ones. A key example relates generalized Feller semigroups of algebra homomorphisms via the method of characteristics to transport equations and continuous semiflows on weighted spaces, i.e. a remarkably generic way to treat differential equations on weighted spaces. We also provide a counterexample, which shows that no condition of the basic definition of generalized Feller semigroups can be dropped.
△ Less
Submitted 7 August, 2023;
originally announced August 2023.
-
Extending Path-Dependent NJ-ODEs to Noisy Observations and a Dependent Observation Framework
Authors:
William Andersson,
Jakob Heiss,
Florian Krach,
Josef Teichmann
Abstract:
The Path-Dependent Neural Jump Ordinary Differential Equation (PD-NJ-ODE) is a model for predicting continuous-time stochastic processes with irregular and incomplete observations. In particular, the method learns optimal forecasts given irregularly sampled time series of incomplete past observations. So far the process itself and the coordinate-wise observation times were assumed to be independen…
▽ More
The Path-Dependent Neural Jump Ordinary Differential Equation (PD-NJ-ODE) is a model for predicting continuous-time stochastic processes with irregular and incomplete observations. In particular, the method learns optimal forecasts given irregularly sampled time series of incomplete past observations. So far the process itself and the coordinate-wise observation times were assumed to be independent and observations were assumed to be noiseless. In this work we discuss two extensions to lift these restrictions and provide theoretical guarantees as well as empirical examples for them. In particular, we can lift the assumption of independence by extending the theory to much more realistic settings of conditional independence without any need to change the algorithm. Moreover, we introduce a new loss function, which allows us to deal with noisy observations and explain why the previously used loss function did not lead to a consistent estimator.
△ Less
Submitted 5 February, 2024; v1 submitted 24 July, 2023;
originally announced July 2023.
-
Global universal approximation of functional input maps on weighted spaces
Authors:
Christa Cuchiero,
Philipp Schmocker,
Josef Teichmann
Abstract:
We introduce so-called functional input neural networks defined on a possibly infinite dimensional weighted space with values also in a possibly infinite dimensional output space. To this end, we use an additive family to map the input weighted space to the hidden layer, on which a non-linear scalar activation function is applied to each neuron, and finally return the output via some linear readou…
▽ More
We introduce so-called functional input neural networks defined on a possibly infinite dimensional weighted space with values also in a possibly infinite dimensional output space. To this end, we use an additive family to map the input weighted space to the hidden layer, on which a non-linear scalar activation function is applied to each neuron, and finally return the output via some linear readouts. Relying on Stone-Weierstrass theorems on weighted spaces, we can prove a global universal approximation result on weighted spaces for continuous functions going beyond the usual approximation on compact sets. This then applies in particular to approximation of (non-anticipative) path space functionals via functional input neural networks. As a further application of the weighted Stone-Weierstrass theorem we prove a global universal approximation result for linear functions of the signature. We also introduce the viewpoint of Gaussian process regression in this setting and emphasize that the reproducing kernel Hilbert space of the signature kernels are Cameron-Martin spaces of certain Gaussian processes. This paves a way towards uncertainty quantification for signature kernel regression.
△ Less
Submitted 2 February, 2025; v1 submitted 5 June, 2023;
originally announced June 2023.
-
Signature SDEs from an affine and polynomial perspective
Authors:
Christa Cuchiero,
Sara Svaluto-Ferro,
Josef Teichmann
Abstract:
Signature stochastic differential equations (SDEs) constitute a large class of stochastic processes, here driven by Brownian motions, whose characteristics are linear maps of their own signature, i.e. of iterated integrals of the process with itself, and allow therefore for a generic path dependence. We show that their prolongation with the corresponding signature is an affine and polynomial proce…
▽ More
Signature stochastic differential equations (SDEs) constitute a large class of stochastic processes, here driven by Brownian motions, whose characteristics are linear maps of their own signature, i.e. of iterated integrals of the process with itself, and allow therefore for a generic path dependence. We show that their prolongation with the corresponding signature is an affine and polynomial process taking values in the set of group-like elements of the extended tensor algebra. By relying on the duality theory for affine or polynomial processes, we obtain explicit formulas in terms of converging power series for the Fourier-Laplace transform and the expected value of entire functions of the signature process' marginals. The coefficients of these power series are solutions of extended tensor algebra valued Riccati and linear ordinary differential equations (ODEs), respectively, whose vector fields can be expressed in terms of the characteristics of the corresponding SDEs. We thus construct a class of stochastic processes which is universal (in a sense specified in the introduction) within Ito-diffusions with path-dependent characteristics and allows for an explicit characterization of the Fourier-Laplace transform and hence the full law on path space. The practical applicability of this affine and polynomial approach is illustrated by several numerical examples.
△ Less
Submitted 3 February, 2025; v1 submitted 2 February, 2023;
originally announced February 2023.
-
Ergodic robust maximization of asymptotic growth under stochastic volatility
Authors:
David Itkin,
Benedikt Koch,
Martin Larsson,
Josef Teichmann
Abstract:
We consider an asymptotic robust growth problem under model uncertainty and in the presence of (non-Markovian) stochastic covariance. We fix two inputs representing the instantaneous covariance for the asset process $X$, which depends on an additional stochastic factor process $Y$, as well as the invariant density of $X$ together with $Y$. The stochastic factor process $Y$ has continuous trajector…
▽ More
We consider an asymptotic robust growth problem under model uncertainty and in the presence of (non-Markovian) stochastic covariance. We fix two inputs representing the instantaneous covariance for the asset process $X$, which depends on an additional stochastic factor process $Y$, as well as the invariant density of $X$ together with $Y$. The stochastic factor process $Y$ has continuous trajectories but is not even required to be a semimartingale. Our setup allows for drift uncertainty in $X$ and model uncertainty for the local dynamics of $Y$. This work builds upon a recent paper of Kardaras & Robertson, where the authors consider an analogous problem, however, without the additional stochastic factor process. Under suitable, quite weak assumptions we are able to characterize the robust optimal trading strategy and the robust optimal growth rate. The optimal strategy is shown to be functionally generated and, remarkably, does not depend on the factor process $Y$. Our result provides a comprehensive answer to a question proposed by Fernholz in 2002. Mathematically, we use a combination of partial differential equation (PDE), calculus of variations and generalized Dirichlet form techniques.
△ Less
Submitted 28 November, 2022;
originally announced November 2022.
-
Optimal Estimation of Generic Dynamics by Path-Dependent Neural Jump ODEs
Authors:
Florian Krach,
Marc Nübel,
Josef Teichmann
Abstract:
This paper studies the problem of forecasting general stochastic processes using a path-dependent extension of the Neural Jump ODE (NJ-ODE) framework \citep{herrera2021neural}. While NJ-ODE was the first framework to establish convergence guarantees for the prediction of irregularly observed time series, these results were limited to data stemming from Itô-diffusions with complete observations, in…
▽ More
This paper studies the problem of forecasting general stochastic processes using a path-dependent extension of the Neural Jump ODE (NJ-ODE) framework \citep{herrera2021neural}. While NJ-ODE was the first framework to establish convergence guarantees for the prediction of irregularly observed time series, these results were limited to data stemming from Itô-diffusions with complete observations, in particular Markov processes, where all coordinates are observed simultaneously. In this work, we generalise these results to generic, possibly non-Markovian or discontinuous, stochastic processes with incomplete observations, by utilising the reconstruction properties of the signature transform. These theoretical results are supported by empirical studies, where it is shown that the path-dependent NJ-ODE outperforms the original NJ-ODE framework in the case of non-Markovian data. Moreover, we show that PD-NJ-ODE can be applied successfully to classical stochastic filtering problems and to limit order book (LOB) data.
△ Less
Submitted 4 July, 2024; v1 submitted 28 June, 2022;
originally announced June 2022.
-
Optimal Stopping via Randomized Neural Networks
Authors:
Calypso Herrera,
Florian Krach,
Pierre Ruyssen,
Josef Teichmann
Abstract:
This paper presents the benefits of using randomized neural networks instead of standard basis functions or deep neural networks to approximate the solutions of optimal stopping problems. The key idea is to use neural networks, where the parameters of the hidden layers are generated randomly and only the last layer is trained, in order to approximate the continuation value. Our approaches are appl…
▽ More
This paper presents the benefits of using randomized neural networks instead of standard basis functions or deep neural networks to approximate the solutions of optimal stopping problems. The key idea is to use neural networks, where the parameters of the hidden layers are generated randomly and only the last layer is trained, in order to approximate the continuation value. Our approaches are applicable to high dimensional problems where the existing approaches become increasingly impractical. In addition, since our approaches can be optimized using simple linear regression, they are easy to implement and theoretical guarantees can be provided. We test our approaches for American option pricing on Black--Scholes, Heston and rough Heston models and for optimally stopping a fractional Brownian motion. In all cases, our algorithms outperform the state-of-the-art and other relevant machine learning approaches in terms of computation time while achieving comparable results. Moreover, we show that they can also be used to efficiently compute Greeks of American options.
△ Less
Submitted 1 December, 2023; v1 submitted 28 April, 2021;
originally announced April 2021.
-
A Sobolev rough path extension theorem via regularity structures
Authors:
Chong Liu,
David J. Prömel,
Josef Teichmann
Abstract:
We show that every $\mathbb{R}^d$-valued Sobolev path with regularity $α$ and integrability $p$ can be lifted to a Sobolev rough path provided $α< 1/p<1/3$. The novelty of our approach is its use of ideas underlying Hairer's reconstruction theorem generalized to a framework allowing for Sobolev models and Sobolev modelled distributions. Moreover, we show that the corresponding lifting map is local…
▽ More
We show that every $\mathbb{R}^d$-valued Sobolev path with regularity $α$ and integrability $p$ can be lifted to a Sobolev rough path provided $α< 1/p<1/3$. The novelty of our approach is its use of ideas underlying Hairer's reconstruction theorem generalized to a framework allowing for Sobolev models and Sobolev modelled distributions. Moreover, we show that the corresponding lifting map is locally Lipschitz continuous with respect to the inhomogeneous Sobolev metric.
△ Less
Submitted 10 November, 2022; v1 submitted 13 April, 2021;
originally announced April 2021.
-
Discrete-time signatures and randomness in reservoir computing
Authors:
Christa Cuchiero,
Lukas Gonon,
Lyudmila Grigoryeva,
Juan-Pablo Ortega,
Josef Teichmann
Abstract:
A new explanation of geometric nature of the reservoir computing phenomenon is presented. Reservoir computing is understood in the literature as the possibility of approximating input/output systems with randomly chosen recurrent neural systems and a trained linear readout layer. Light is shed on this phenomenon by constructing what is called strongly universal reservoir systems as random projecti…
▽ More
A new explanation of geometric nature of the reservoir computing phenomenon is presented. Reservoir computing is understood in the literature as the possibility of approximating input/output systems with randomly chosen recurrent neural systems and a trained linear readout layer. Light is shed on this phenomenon by constructing what is called strongly universal reservoir systems as random projections of a family of state-space systems that generate Volterra series expansions. This procedure yields a state-affine reservoir system with randomly generated coefficients in a dimension that is logarithmically reduced with respect to the original system. This reservoir system is able to approximate any element in the fading memory filters class just by training a different linear readout for each different filter. Explicit expressions for the probability distributions needed in the generation of the projected reservoir system are stated and bounds for the committed approximation error are provided.
△ Less
Submitted 17 September, 2020;
originally announced October 2020.
-
Stopper-Controller Games embedded in Single-Player Control Problems
Authors:
Martin Larsson,
Marvin S. Mueller,
Josef Teichmann
Abstract:
In 2002, Benjamin Jourdain and Claude Martini discovered that for a class of payoff functions, the pricing problem for American options can be reduced to pricing of European options for an appropriately associated payoff, all within a Black-Scholes framework. This discovery has been investigated in great detail by Sören Christensen, Jan Kallsen and Matthias Lenga in a recent work in 2020. In the p…
▽ More
In 2002, Benjamin Jourdain and Claude Martini discovered that for a class of payoff functions, the pricing problem for American options can be reduced to pricing of European options for an appropriately associated payoff, all within a Black-Scholes framework. This discovery has been investigated in great detail by Sören Christensen, Jan Kallsen and Matthias Lenga in a recent work in 2020. In the present work we prove that this phenomenon can be observed in a wider context, and even holds true in a setup of non-linear stochastic processes. We analyse this problem from both probabilistic and analytic viewpoints. In the classical situation, Jourdain and Martini used this method to approximate prices of American put options. The broader applicability now potentially covers non-linear frameworks such as model uncertainty and controller-and-stopper-games.
△ Less
Submitted 16 June, 2020;
originally announced June 2020.
-
Neural Jump Ordinary Differential Equations: Consistent Continuous-Time Prediction and Filtering
Authors:
Calypso Herrera,
Florian Krach,
Josef Teichmann
Abstract:
Combinations of neural ODEs with recurrent neural networks (RNN), like GRU-ODE-Bayes or ODE-RNN are well suited to model irregularly observed time series. While those models outperform existing discrete-time approaches, no theoretical guarantees for their predictive capabilities are available. Assuming that the irregularly-sampled time series data originates from a continuous stochastic process, t…
▽ More
Combinations of neural ODEs with recurrent neural networks (RNN), like GRU-ODE-Bayes or ODE-RNN are well suited to model irregularly observed time series. While those models outperform existing discrete-time approaches, no theoretical guarantees for their predictive capabilities are available. Assuming that the irregularly-sampled time series data originates from a continuous stochastic process, the $L^2$-optimal online prediction is the conditional expectation given the currently available information. We introduce the Neural Jump ODE (NJ-ODE) that provides a data-driven approach to learn, continuously in time, the conditional expectation of a stochastic process. Our approach models the conditional expectation between two observations with a neural ODE and jumps whenever a new observation is made. We define a novel training framework, which allows us to prove theoretical guarantees for the first time. In particular, we show that the output of our model converges to the $L^2$-optimal prediction. This can be interpreted as solution to a special filtering problem. We provide experiments showing that the theoretical results also hold empirically. Moreover, we experimentally show that our model outperforms the baselines in more complex learning tasks and give comparisons on real-world datasets.
△ Less
Submitted 16 April, 2021; v1 submitted 8 June, 2020;
originally announced June 2020.
-
On Sobolev rough paths
Authors:
Chong Liu,
David J. Prömel,
Josef Teichmann
Abstract:
We introduce the space of rough paths with Sobolev regularity and the corresponding concept of controlled Sobolev paths. Based on these notions, we study rough path integration and rough differential equations. As main result, we prove that the solution map associated to differential equations driven by rough paths is a locally Lipschitz continuous map on the Sobolev rough path space for any arbit…
▽ More
We introduce the space of rough paths with Sobolev regularity and the corresponding concept of controlled Sobolev paths. Based on these notions, we study rough path integration and rough differential equations. As main result, we prove that the solution map associated to differential equations driven by rough paths is a locally Lipschitz continuous map on the Sobolev rough path space for any arbitrary low regularity $α$ and integrability $p$ provided $α>1/p$.
△ Less
Submitted 3 October, 2020; v1 submitted 5 June, 2020;
originally announced June 2020.
-
A generative adversarial network approach to calibration of local stochastic volatility models
Authors:
Christa Cuchiero,
Wahid Khosrawi,
Josef Teichmann
Abstract:
We propose a fully data-driven approach to calibrate local stochastic volatility (LSV) models, circumventing in particular the ad hoc interpolation of the volatility surface. To achieve this, we parametrize the leverage function by a family of feed-forward neural networks and learn their parameters directly from the available market option prices. This should be seen in the context of neural SDEs…
▽ More
We propose a fully data-driven approach to calibrate local stochastic volatility (LSV) models, circumventing in particular the ad hoc interpolation of the volatility surface. To achieve this, we parametrize the leverage function by a family of feed-forward neural networks and learn their parameters directly from the available market option prices. This should be seen in the context of neural SDEs and (causal) generative adversarial networks: we generate volatility surfaces by specific neural SDEs, whose quality is assessed by quantifying, possibly in an adversarial manner, distances to market prices. The minimization of the calibration functional relies strongly on a variance reduction technique based on hedging and deep hedging, which is interesting in its own right: it allows the calculation of model prices and model implied volatilities in an accurate way using only small sets of sample paths. For numerical illustration we implement a SABR-type LSV model and conduct a thorough statistical performance analysis on many samples of implied volatility smiles, showing the accuracy and stability of the method.
△ Less
Submitted 29 September, 2020; v1 submitted 5 May, 2020;
originally announced May 2020.
-
Denise: Deep Robust Principal Component Analysis for Positive Semidefinite Matrices
Authors:
Calypso Herrera,
Florian Krach,
Anastasis Kratsios,
Pierre Ruyssen,
Josef Teichmann
Abstract:
The robust PCA of covariance matrices plays an essential role when isolating key explanatory features. The currently available methods for performing such a low-rank plus sparse decomposition are matrix specific, meaning, those algorithms must re-run for every new matrix. Since these algorithms are computationally expensive, it is preferable to learn and store a function that nearly instantaneousl…
▽ More
The robust PCA of covariance matrices plays an essential role when isolating key explanatory features. The currently available methods for performing such a low-rank plus sparse decomposition are matrix specific, meaning, those algorithms must re-run for every new matrix. Since these algorithms are computationally expensive, it is preferable to learn and store a function that nearly instantaneously performs this decomposition when evaluated. Therefore, we introduce Denise, a deep learning-based algorithm for robust PCA of covariance matrices, or more generally, of symmetric positive semidefinite matrices, which learns precisely such a function. Theoretical guarantees for Denise are provided. These include a novel universal approximation theorem adapted to our geometric deep learning problem and convergence to an optimal solution to the learning problem. Our experiments show that Denise matches state-of-the-art performance in terms of decomposition quality, while being approximately $2000\times$ faster than the state-of-the-art, principal component pursuit (PCP), and $200 \times$ faster than the current speed-optimized method, fast PCP.
△ Less
Submitted 6 June, 2023; v1 submitted 28 April, 2020;
originally announced April 2020.
-
How Implicit Regularization of ReLU Neural Networks Characterizes the Learned Function -- Part I: the 1-D Case of Two Layers with Random First Layer
Authors:
Jakob Heiss,
Josef Teichmann,
Hanna Wutte
Abstract:
In this paper, we consider one dimensional (shallow) ReLU neural networks in which weights are chosen randomly and only the terminal layer is trained. First, we mathematically show that for such networks L2-regularized regression corresponds in function space to regularizing the estimate's second derivative for fairly general loss functionals. For least squares regression, we show that the trained…
▽ More
In this paper, we consider one dimensional (shallow) ReLU neural networks in which weights are chosen randomly and only the terminal layer is trained. First, we mathematically show that for such networks L2-regularized regression corresponds in function space to regularizing the estimate's second derivative for fairly general loss functionals. For least squares regression, we show that the trained network converges to the smooth spline interpolation of the training data as the number of hidden nodes tends to infinity. Moreover, we derive a novel correspondence between the early stopped gradient descent (without any explicit regularization of the weights) and the smoothing spline regression.
△ Less
Submitted 4 October, 2023; v1 submitted 7 November, 2019;
originally announced November 2019.
-
Deep neural networks, generic universal interpolation, and controlled ODEs
Authors:
Christa Cuchiero,
Martin Larsson,
Josef Teichmann
Abstract:
A recent paradigm views deep neural networks as discretizations of certain controlled ordinary differential equations, sometimes called neural ordinary differential equations. We make use of this perspective to link expressiveness of deep networks to the notion of controllability of dynamical systems. Using this connection, we study an expressiveness property that we call universal interpolation,…
▽ More
A recent paradigm views deep neural networks as discretizations of certain controlled ordinary differential equations, sometimes called neural ordinary differential equations. We make use of this perspective to link expressiveness of deep networks to the notion of controllability of dynamical systems. Using this connection, we study an expressiveness property that we call universal interpolation, and show that it is generic in a certain sense. The universal interpolation property is slightly weaker than universal approximation, and disentangles supervised learning on finite training sets from generalization properties. We also show that universal interpolation holds for certain deep neural networks even if large numbers of parameters are left untrained, and are instead chosen randomly. This lends theoretical support to the observation that training with random initialization can be successful even when most parameters are largely unchanged through the training. Our results also explore what a minimal amount of trainable parameters in neural ordinary differential equations could be without giving up on expressiveness.
△ Less
Submitted 16 July, 2020; v1 submitted 15 August, 2019;
originally announced August 2019.
-
Markovian lifts of positive semidefinite affine Volterra type processes
Authors:
Christa Cuchiero,
Josef Teichmann
Abstract:
We consider stochastic partial differential equations appearing as Markovian lifts of matrix valued (affine) Volterra type processes from the point of view of the generalized Feller property (see e.g., \cite{doetei:10}). We introduce in particular Volterra Wishart processes with fractional kernels and values in the cone of positive semidefinite matrices. They are constructed from matrix products o…
▽ More
We consider stochastic partial differential equations appearing as Markovian lifts of matrix valued (affine) Volterra type processes from the point of view of the generalized Feller property (see e.g., \cite{doetei:10}). We introduce in particular Volterra Wishart processes with fractional kernels and values in the cone of positive semidefinite matrices. They are constructed from matrix products of infinite dimensional Ornstein Uhlenbeck processes whose state space are matrix valued measures. Parallel to that we also consider positive definite Volterra pure jump processes, giving rise to multivariate Hawkes type processes. We apply these affine covariance processes for multivariate (rough) volatility modeling and introduce a (rough) multivariate Volterra Heston type model.
△ Less
Submitted 4 September, 2019; v1 submitted 1 July, 2019;
originally announced July 2019.
-
An elementary proof of the reconstruction theorem
Authors:
Harprit Singh,
Josef Teichmann
Abstract:
The reconstruction theorem, a cornerstone of Martin Hairer's theory of regularity structures, appears in this article as the unique extension of the explicitly given reconstruction operator on the set of smooth models due its inherent Lipschitz properties. This new proof is a direct consequence of constructions of mollification procedures on spaces of models and modelled distributions: more precis…
▽ More
The reconstruction theorem, a cornerstone of Martin Hairer's theory of regularity structures, appears in this article as the unique extension of the explicitly given reconstruction operator on the set of smooth models due its inherent Lipschitz properties. This new proof is a direct consequence of constructions of mollification procedures on spaces of models and modelled distributions: more precisely, for an abstract model $Z$ of a given regularity structure, a mollified model is constructed, and additionally, any modelled distribution $f$ can be approximated by elements of a universal subspace of modelled distribution spaces. These considerations yield in particular a non-standard approximation results for rough path theory. All results are formulated in a generic $(p,q)$ Besov setting.
△ Less
Submitted 7 December, 2018;
originally announced December 2018.
-
Optimal extension to Sobolev rough paths
Authors:
Chong Liu,
David J. Prömel,
Josef Teichmann
Abstract:
We show that every $\mathbb{R}^d$-valued Sobolev path with regularity $α$ and integrability $p$ can be lifted to a Sobolev rough path in the sense of T. Lyons provided $α>1/p>0$. Moreover, we prove the existence of unique rough path lifts which are optimal w.r.t. strictly convex functionals among all possible rough path lifts given a Sobolev path. As examples, we consider the rough path lift with…
▽ More
We show that every $\mathbb{R}^d$-valued Sobolev path with regularity $α$ and integrability $p$ can be lifted to a Sobolev rough path in the sense of T. Lyons provided $α>1/p>0$. Moreover, we prove the existence of unique rough path lifts which are optimal w.r.t. strictly convex functionals among all possible rough path lifts given a Sobolev path. As examples, we consider the rough path lift with minimal Sobolev norm and characterize the Stratonovich rough path lift of a Brownian motion as optimal lift w.r.t. to a suitable convex functional. Generalizations of the results to Besov spaces are briefly discussed.
△ Less
Submitted 28 April, 2022; v1 submitted 13 November, 2018;
originally announced November 2018.
-
Characterization of non-linear Besov spaces
Authors:
Chong Liu,
David J. Prömel,
Josef Teichmann
Abstract:
The canonical generalizations of two classical norms on Besov spaces are shown to be equivalent even in the case of non-linear Besov spaces, that is, function spaces consisting of functions taking values in a metric space and equipped with some Besov-type topology. The proofs are based on atomic decomposition techniques and metric embeddings. Additionally, we provide embedding results showing how…
▽ More
The canonical generalizations of two classical norms on Besov spaces are shown to be equivalent even in the case of non-linear Besov spaces, that is, function spaces consisting of functions taking values in a metric space and equipped with some Besov-type topology. The proofs are based on atomic decomposition techniques and metric embeddings. Additionally, we provide embedding results showing how non-linear Besov spaces embed into non-linear $p$-variation spaces and vice versa. We emphasize that we neither assume the UMD property of the involved spaces nor their separability.
△ Less
Submitted 8 August, 2019; v1 submitted 12 June, 2018;
originally announced June 2018.
-
Generalized Feller processes and Markovian lifts of stochastic Volterra processes: the affine case
Authors:
Christa Cuchiero,
Josef Teichmann
Abstract:
We consider stochastic (partial) differential equations appearing as Markovian lifts of affine Volterra processes with jumps from the point of view of the generalized Feller property which was introduced in e.g.~\cite{doetei:10}. In particular we provide new existence, uniqueness and approximation results for Markovian lifts of affine rough volatility models of general jump diffusion type. We demo…
▽ More
We consider stochastic (partial) differential equations appearing as Markovian lifts of affine Volterra processes with jumps from the point of view of the generalized Feller property which was introduced in e.g.~\cite{doetei:10}. In particular we provide new existence, uniqueness and approximation results for Markovian lifts of affine rough volatility models of general jump diffusion type. We demonstrate that in this Markovian light the theory of stochastic Volterra processes becomes almost classical.
△ Less
Submitted 2 August, 2019; v1 submitted 27 April, 2018;
originally announced April 2018.
-
Deep Hedging
Authors:
Hans Bühler,
Lukas Gonon,
Josef Teichmann,
Ben Wood
Abstract:
We present a framework for hedging a portfolio of derivatives in the presence of market frictions such as transaction costs, market impact, liquidity constraints or risk limits using modern deep reinforcement machine learning methods.
We discuss how standard reinforcement learning methods can be applied to non-linear reward structures, i.e. in our case convex risk measures. As a general contribu…
▽ More
We present a framework for hedging a portfolio of derivatives in the presence of market frictions such as transaction costs, market impact, liquidity constraints or risk limits using modern deep reinforcement machine learning methods.
We discuss how standard reinforcement learning methods can be applied to non-linear reward structures, i.e. in our case convex risk measures. As a general contribution to the use of deep learning for stochastic processes, we also show that the set of constrained trading strategies used by our algorithm is large enough to $ε$-approximate any optimal solution.
Our algorithm can be implemented efficiently even in high-dimensional situations using modern machine learning tools. Its structure does not depend on specific market dynamics, and generalizes across hedging instruments including the use of liquid derivatives. Its computational performance is largely invariant in the size of the portfolio as it depends mainly on the number of hedging instruments available.
We illustrate our approach by showing the effect on hedging under transaction costs in a synthetic market driven by the Heston model, where we outperform the standard "complete market" solution.
△ Less
Submitted 8 February, 2018;
originally announced February 2018.
-
Linearized Filtering of Affine Processes Using Stochastic Riccati Equations
Authors:
Lukas Gonon,
Josef Teichmann
Abstract:
We consider an affine process $X$ which is only observed up to an additive white noise, and we ask for its law, for some time $t > 0 $, conditional on all observations up to this time $ t $. This is a general, possibly high dimensional filtering problem which is not even locally approximately Gaussian, whence essentially only particle filtering methods remain as solution techniques. In this work w…
▽ More
We consider an affine process $X$ which is only observed up to an additive white noise, and we ask for its law, for some time $t > 0 $, conditional on all observations up to this time $ t $. This is a general, possibly high dimensional filtering problem which is not even locally approximately Gaussian, whence essentially only particle filtering methods remain as solution techniques. In this work we present an efficient numerical solution by introducing an approximate filter for which conditional characteristic functions can be calculated by solving a system of generalized Riccati differential equations depending on the observation and the process characteristics of the signal $X$. The quality of the approximation can be controlled by easily observable quantities in terms of a macro location of the signal in state space. Asymptotic techniques as well as maximization techniques can be directly applied to the solutions of the Riccati equations leading to novel very tractable filtering formulas. The efficiency of the method is illustrated with numerical experiments for Cox-Ingersoll-Ross and Wishart processes, for which Gaussian approximations usually fail.
△ Less
Submitted 23 January, 2018;
originally announced January 2018.
-
A fundamental theorem of asset pricing for continuous time large financial markets in a two filtration setting
Authors:
Christa Cuchiero,
Irene Klein,
Josef Teichmann
Abstract:
We present a version of the fundamental theorem of asset pricing (FTAP) for continuous time large financial markets with two filtrations in an $L^p$-setting for $ 1 \leq p < \infty$. This extends the results of Yuri Kabanov and Christophe Stricker \cite{KS:06} to continuous time and to a large financial market setting, however, still preserving the simplicity of the discrete time setting. On the o…
▽ More
We present a version of the fundamental theorem of asset pricing (FTAP) for continuous time large financial markets with two filtrations in an $L^p$-setting for $ 1 \leq p < \infty$. This extends the results of Yuri Kabanov and Christophe Stricker \cite{KS:06} to continuous time and to a large financial market setting, however, still preserving the simplicity of the discrete time setting. On the other hand it generalizes Stricker's $L^p$-version of FTAP \cite{S:90} towards a setting with two filtrations. We do neither assume that price processes are semi-martigales, (and it does not follow due to trading with respect to the \emph{smaller} filtration) nor that price processes have any path properties, neither any other particular property of the two filtrations in question, nor admissibility of portfolio wealth processes, but we rather go for a completely general (and realistic) result, where trading strategies are just predictable with respect to a smaller filtration than the one generated by the price processes. Applications range from modeling trading with delayed information, trading on different time grids, dealing with inaccurate price information, and randomization approaches to uncertainty.
△ Less
Submitted 5 May, 2017;
originally announced May 2017.
-
Stochastic Analysis with Modelled Distributions
Authors:
Chong Liu,
David J. Prömel,
Josef Teichmann
Abstract:
Using a Besov topology on spaces of modelled distributions in the framework of Hairer's regularity structures, we prove the reconstruction theorem on these Besov spaces with negative regularity. The Besov spaces of modelled distributions are shown to be UMD Banach spaces and of martingale type $2$. As a consequence, this gives access to a rich stochastic integration theory and to existence and uni…
▽ More
Using a Besov topology on spaces of modelled distributions in the framework of Hairer's regularity structures, we prove the reconstruction theorem on these Besov spaces with negative regularity. The Besov spaces of modelled distributions are shown to be UMD Banach spaces and of martingale type $2$. As a consequence, this gives access to a rich stochastic integration theory and to existence and uniqueness results for mild solutions of semilinear stochastic partial differential equations in these spaces of modelled distributions and for distribution-valued SDEs. Furthermore, we provide a Fubini type theorem allowing to interchange the order of stochastic integration and reconstruction.
△ Less
Submitted 5 February, 2020; v1 submitted 13 September, 2016;
originally announced September 2016.
-
Parabolic free boundary price formation models under market size fluctuations
Authors:
Peter A. Markowich,
Josef Teichmann,
Marie-Therese Wolfram
Abstract:
In this paper we propose an extension of the Lasry-Lions price formation model which includes fluctuations of the numbers of buyers and vendors. We analyze the model in the case of deterministic and stochastic market size fluctuations and present results on the long time asymptotic behavior and numerical evidence and conjectures on periodic, almost periodic and stochastic fluctuations. The numeric…
▽ More
In this paper we propose an extension of the Lasry-Lions price formation model which includes fluctuations of the numbers of buyers and vendors. We analyze the model in the case of deterministic and stochastic market size fluctuations and present results on the long time asymptotic behavior and numerical evidence and conjectures on periodic, almost periodic and stochastic fluctuations. The numerical simulations extend the theoretical statements and give further insights into price formation dynamics.
△ Less
Submitted 15 March, 2016;
originally announced March 2016.
-
Pathwise construction of affine processes
Authors:
Nicoletta Gabrielli,
Josef Teichmann
Abstract:
Based on the theory of multivariate time changes for Markov processes, we show how to identify affine processes as solutions of certain time change equations. The result is a strong version of a theorem presented by J. Kallsen (2006) which provides a representation in law of an affine process as a time-change transformation of a family of independent Lévy processes.
Based on the theory of multivariate time changes for Markov processes, we show how to identify affine processes as solutions of certain time change equations. The result is a strong version of a theorem presented by J. Kallsen (2006) which provides a representation in law of an affine process as a time-change transformation of a family of independent Lévy processes.
△ Less
Submitted 25 December, 2014;
originally announced December 2014.
-
A new perspective on the fundamental theorem of asset pricing for large financial markets
Authors:
Christa Cuchiero,
Irene Klein,
Josef Teichmann
Abstract:
In the context of large financial markets we formulate the notion of \emph{no asymptotic free lunch with vanishing risk} (NAFLVR), under which we can prove a version of the fundamental theorem of asset pricing (FTAP) in markets with an (even uncountably) infinite number of assets, as it is for instance the case in bond markets. We work in the general setting of admissible portfolio wealth processe…
▽ More
In the context of large financial markets we formulate the notion of \emph{no asymptotic free lunch with vanishing risk} (NAFLVR), under which we can prove a version of the fundamental theorem of asset pricing (FTAP) in markets with an (even uncountably) infinite number of assets, as it is for instance the case in bond markets. We work in the general setting of admissible portfolio wealth processes as laid down by Y. Kabanov \cite{kab:97} under a substantially relaxed concatenation property and adapt the FTAP proof variant obtained in \cite{CT:14} for the classical small market situation to large financial markets. In the case of countably many assets, our setting includes the large financial market model considered by M. De Donno et al. \cite{DGP:05} and its abstract integration theory.
The notion of (NAFLVR) turns out to be an economically meaningful "no arbitrage" condition (in particular not involving weak-$*$-closures), and, (NAFLVR) is equivalent to the existence of a separating measure. Furthermore we show -- by means of a counterexample -- that the existence of an equivalent separating measure does not lead to an equivalent $σ$-martingale measure, even in a countable large financial market situation.
△ Less
Submitted 9 October, 2023; v1 submitted 23 December, 2014;
originally announced December 2014.
-
Discrete Time Term Structure Theory and Consistent Recalibration Models
Authors:
Anja Richter,
Josef Teichmann
Abstract:
We develop theory and applications of forward characteristic processes in discrete time following a seminal paper of Jan Kallsen and Paul Krühner. Particular emphasis is placed on the dynamics of volatility surfaces which can be easily formulated and implemented from the chosen discrete point of view. In mathematical terms we provide an algorithmic answer to the following question: describe a rich…
▽ More
We develop theory and applications of forward characteristic processes in discrete time following a seminal paper of Jan Kallsen and Paul Krühner. Particular emphasis is placed on the dynamics of volatility surfaces which can be easily formulated and implemented from the chosen discrete point of view. In mathematical terms we provide an algorithmic answer to the following question: describe a rich, still tractable class of discrete time stochastic processes, whose marginal distributions are given at initial time and which are free of arbitrage. In terms of mathematical finance we can construct models with pre-described (implied) volatility surface and quite general volatility surface dynamics. In terms of the works of Rene Carmona and Sergey Nadtochiy, we analyze the dynamics of tangent affine models. We believe that the discrete approach due to its technical simplicity will be important in term structure modelling.
△ Less
Submitted 5 September, 2014;
originally announced September 2014.
-
Exotic one-parameter semigroups of endomorphisms of a symmetric cone
Authors:
Bojan Kuzma,
Matjaž Omladič,
Klemen Šivic,
Josef Teichmann
Abstract:
We construct an exotic one-parameter semigroup of endomorphims of a symmetric cone $C$, whose generator is not the sum of a Lie group generator and an endomorphism of $C$. The question is motivated by the theory of affine processes on symmetric cones, which plays an important role in mathematical finance. On the other hand, theoretical question that we solve in this paper seems to have been implic…
▽ More
We construct an exotic one-parameter semigroup of endomorphims of a symmetric cone $C$, whose generator is not the sum of a Lie group generator and an endomorphism of $C$. The question is motivated by the theory of affine processes on symmetric cones, which plays an important role in mathematical finance. On the other hand, theoretical question that we solve in this paper seems to have been implicitly open even much longer then this motivation suggests.
△ Less
Submitted 13 August, 2014;
originally announced August 2014.
-
A convergence result for the Emery topology and a variant of the proof of the fundamental theorem of asset pricing
Authors:
Christa Cuchiero,
Josef Teichmann
Abstract:
We show that \emph{No unbounded profit with bounded risk} (NUPBR) implies \emph{predictable uniform tightness} (P-UT), a boundedness property in the Emery topology which has been introduced by C. Stricker \cite{S:85}. Combining this insight with well known results from J. Mémin and L. Słominski \cite{MS:91} leads to a short variant of the proof of the fundamental theorem of asset pricing initially…
▽ More
We show that \emph{No unbounded profit with bounded risk} (NUPBR) implies \emph{predictable uniform tightness} (P-UT), a boundedness property in the Emery topology which has been introduced by C. Stricker \cite{S:85}. Combining this insight with well known results from J. Mémin and L. Słominski \cite{MS:91} leads to a short variant of the proof of the fundamental theorem of asset pricing initially proved by F. Delbaen and W. Schachermayer \cite{DS:94}. The results are formulated in the general setting of admissible portfolio wealth processes as laid down by Y. Kabanov in \cite{kab:97}.
△ Less
Submitted 10 July, 2014; v1 submitted 20 June, 2014;
originally announced June 2014.
-
The Gärtner-Ellis theorem, homogenization, and affine processes
Authors:
Archil Gulisashvili,
Josef Teichmann
Abstract:
We obtain a first order extension of the large deviation estimates in the Gärtner-Ellis theorem. In addition, for a given family of measures, we find a special family of functions having a similar Laplace principle expansion up to order one to that of the original family of measures. The construction of the special family of functions mentioned above is based on heat kernel expansions. Some of the…
▽ More
We obtain a first order extension of the large deviation estimates in the Gärtner-Ellis theorem. In addition, for a given family of measures, we find a special family of functions having a similar Laplace principle expansion up to order one to that of the original family of measures. The construction of the special family of functions mentioned above is based on heat kernel expansions. Some of the ideas employed in the paper come from the theory of affine stochastic processes. For instance, we provide an explicit expansion with respect to the homogenization parameter of the rescaled cumulant generating function in the case of a generic continuous affine process. We also compute the coefficients in the homogenization expansion for the Heston model that is one of the most popular stock price models with stochastic volatility.
△ Less
Submitted 14 June, 2014;
originally announced June 2014.
-
When roll-overs do not qualify as numéraire: bond markets beyond short rate paradigms
Authors:
Irene Klein,
Thorsten Schmidt,
Josef Teichmann
Abstract:
We investigate default-free bond markets where the standard relationship between a possibly existing bank account process and the term structure of bond prices is broken, i.e. the bank account process is not a valid numéraire. We argue that this feature is not the exception but rather the rule in bond markets when starting with, e.g., terminal bonds as numéraires.
Our setting are general càdlàg…
▽ More
We investigate default-free bond markets where the standard relationship between a possibly existing bank account process and the term structure of bond prices is broken, i.e. the bank account process is not a valid numéraire. We argue that this feature is not the exception but rather the rule in bond markets when starting with, e.g., terminal bonds as numéraires.
Our setting are general càdlàg processes as bond prices, where we employ directly methods from large financial markets. Moreover, we do not restrict price process to be semimartingales, which allows for example to consider markets driven by fractional Brownian motion. In the core of the article we relate the appropriate no arbitrage assumptions (NAFL), i.e. no asymptotic free lunch, to the existence of an equivalent local martingale measure with respect to the terminal bond as numéraire, and no arbitrage opportunities of the first kind (NAA1) to the existence of a supermartingale deflator, respectively. In all settings we obtain existence of a generalized bank account as a limit of convex combinations of roll-over bonds.
Additionally we provide an alternative definition of the concept of a numéraire, leading to a possibly interesting connection to bubbles. If we can construct a bank account process through roll-overs, we can relate the impossibility of taking the bank account as numéraire to liquidity effects. Here we enter endogenously the arena of multiple yield curves.
The theory is illustrated by several examples.
△ Less
Submitted 30 September, 2013;
originally announced October 2013.
-
Fourier transform methods for pathwise covariance estimation in the presence of jumps
Authors:
Christa Cuchiero,
Josef Teichmann
Abstract:
We provide a new non-parametric Fourier procedure to estimate the trajectory of the instantaneous covariance process (from discrete observations of a multidimensional price process) in the presence of jumps extending the seminal work Malliavin and Mancino~\cite{MM:02, MM:09}. Our approach relies on a modification of (classical) jump-robust estimators of integrated realized covariance to estimate t…
▽ More
We provide a new non-parametric Fourier procedure to estimate the trajectory of the instantaneous covariance process (from discrete observations of a multidimensional price process) in the presence of jumps extending the seminal work Malliavin and Mancino~\cite{MM:02, MM:09}. Our approach relies on a modification of (classical) jump-robust estimators of integrated realized covariance to estimate the Fourier coefficients of the covariance trajectory. Using Fourier-Féjer inversion we reconstruct the path of the instantaneous covariance. We prove consistency and central limit theorem (CLT) and in particular that the asymptotic estimator variance is smaller by a factor $ 2/3$ in comparison to classical local estimators.
The procedure is robust enough to allow for an iteration and we can show theoretically and empirically how to estimate the integrated realized covariance of the instantaneous stochastic covariance process. We apply these techniques to robust calibration problems for multivariate modeling in finance, i.e., the selection of a pricing measure by using time series and derivatives' price information simultaneously.
△ Less
Submitted 20 June, 2014; v1 submitted 16 January, 2013;
originally announced January 2013.
-
Generalizations of Matérn's hard-core point processes
Authors:
Jakob Teichmann,
Felix Ballani,
Karl Gerald van den Boogaart
Abstract:
Matérn's hard-core processes are valuable point process models in spatial statistics. In order to extend their field of application, Matérn's original models are generalized here, both as point processes and particle processes. The thinning rule uses a distance-dependent probability function, which controls deletion of points close together. For this general setting, explicit formulas for first- a…
▽ More
Matérn's hard-core processes are valuable point process models in spatial statistics. In order to extend their field of application, Matérn's original models are generalized here, both as point processes and particle processes. The thinning rule uses a distance-dependent probability function, which controls deletion of points close together. For this general setting, explicit formulas for first- and second-order characteristics can be given. Two examples from materials science illustrate the application of the models.
△ Less
Submitted 12 September, 2012;
originally announced September 2012.
-
Invariant manifolds with boundary for jump-diffusions
Authors:
Damir Filipovic,
Stefan Tappe,
Josef Teichmann
Abstract:
We provide necessary and sufficient conditions for stochastic invariance of finite dimensional submanifolds with boundary in Hilbert spaces for stochastic partial differential equations driven by Wiener processes and Poisson random measures.
We provide necessary and sufficient conditions for stochastic invariance of finite dimensional submanifolds with boundary in Hilbert spaces for stochastic partial differential equations driven by Wiener processes and Poisson random measures.
△ Less
Submitted 20 June, 2014; v1 submitted 6 February, 2012;
originally announced February 2012.
-
Cubature Methods For Stochastic (Partial) Differential Equations In Weighted Spaces
Authors:
Philipp Doersek,
Josef Teichmann,
Dejan Veluscek
Abstract:
The cubature on Wiener space method, a high-order weak approximation scheme, is established for SPDEs in the case of unbounded characteristics and unbounded payoffs. We first introduce a recently described flexible functional analytic framework, so called weighted spaces, where Feller-like properties hold. A refined analysis of vector fields on weighted spaces then yields optimal convergence rates…
▽ More
The cubature on Wiener space method, a high-order weak approximation scheme, is established for SPDEs in the case of unbounded characteristics and unbounded payoffs. We first introduce a recently described flexible functional analytic framework, so called weighted spaces, where Feller-like properties hold. A refined analysis of vector fields on weighted spaces then yields optimal convergence rates of cubature methods for stochastic partial differential equations of Da Prato-Zabczyk type. The ubiquitous stability for the local approximation operator within the functional analytic setting is proved for SPDEs, however, in the infinite dimensional case we need a newly introduced assumption on weak symmetry of the cubature formula. In finite dimensions, we use the UFG condition to obtain optimal rates of convergence on non-uniform meshes for nonsmooth payoffs with exponential growth.
△ Less
Submitted 19 January, 2012;
originally announced January 2012.
-
Efficient simulation and calibration of general HJM models by splitting schemes
Authors:
Philipp Doersek,
Josef Teichmann
Abstract:
We introduce efficient numerical methods for generic HJM equations of interest rate theory by means of high-order weak approximation schemes. These schemes allow for QMC implementations due to the relatively low dimensional integration space. The complexity of the resulting algorithm is considerably lower than the complexity of multi-level MC algorithms as long as the optimal order of QMC-converge…
▽ More
We introduce efficient numerical methods for generic HJM equations of interest rate theory by means of high-order weak approximation schemes. These schemes allow for QMC implementations due to the relatively low dimensional integration space. The complexity of the resulting algorithm is considerably lower than the complexity of multi-level MC algorithms as long as the optimal order of QMC-convergence is guaranteed. In order to make the methods applicable to real world problems, we introduce and use the setting of weighted function spaces, such that unbounded payoffs and unbounded characteristics of the equations in question are still allowed. We also provide an implementation, where we efficiently calibrate an HJM equation to caplet data.
△ Less
Submitted 22 December, 2011;
originally announced December 2011.
-
Affine processes on symmetric cones
Authors:
Christa Cuchiero,
Martin Keller-Ressel,
Eberhard Mayerhofer,
Josef Teichmann
Abstract:
We consider affine Markov processes taking values in convex cones. In particular, we characterize all affine processes taking values in an irreducible symmetric cone in terms of certain Lévy-Khintchine triplets. This is the complete classification of affine processes on these conic state spaces, thus extending the theory of Wishart processes on positive semidefinite matrices, as put forward by Bru…
▽ More
We consider affine Markov processes taking values in convex cones. In particular, we characterize all affine processes taking values in an irreducible symmetric cone in terms of certain Lévy-Khintchine triplets. This is the complete classification of affine processes on these conic state spaces, thus extending the theory of Wishart processes on positive semidefinite matrices, as put forward by Bru (1991).
△ Less
Submitted 6 December, 2011;
originally announced December 2011.
-
Path properties and regularity of affine processes on general state spaces
Authors:
Christa Cuchiero,
Josef Teichmann
Abstract:
We provide a new proof for regularity of affine processes on general state spaces by methods from the theory of Markovian semimartingales. On the way to this result we also show that the definition of an affine process, namely as stochastically continuous time-homogeneous Markov process with exponential affine Fourier-Laplace transform, already implies the existence of a càdlàg version. This was o…
▽ More
We provide a new proof for regularity of affine processes on general state spaces by methods from the theory of Markovian semimartingales. On the way to this result we also show that the definition of an affine process, namely as stochastically continuous time-homogeneous Markov process with exponential affine Fourier-Laplace transform, already implies the existence of a càdlàg version. This was one of the last open issues in the fundaments of affine processes.
△ Less
Submitted 16 January, 2013; v1 submitted 8 July, 2011;
originally announced July 2011.
-
Regularity of affine processes on general state spaces
Authors:
Martin Keller-Ressel,
Walter Schachermayer,
Josef Teichmann
Abstract:
We consider a stochastically continuous, affine Markov process in the sense of Duffie, Filipovic and Schachermayer, with cadlag paths, on a general state space D, i.e. an arbitrary Borel subset of R^d. We show that such a process is always regular, meaning that its Fourier-Laplace transform is differentiable in time, with derivatives that are continuous in the transform variable. As a consequence,…
▽ More
We consider a stochastically continuous, affine Markov process in the sense of Duffie, Filipovic and Schachermayer, with cadlag paths, on a general state space D, i.e. an arbitrary Borel subset of R^d. We show that such a process is always regular, meaning that its Fourier-Laplace transform is differentiable in time, with derivatives that are continuous in the transform variable. As a consequence, we show that generalized Riccati equations and Levy-Khintchine parameters for the process can be derived, as in the case of $D = R_+^m \times R^n$ studied in Duffie, Filipovic and Schachermayer (2003). Moreover, we show that when the killing rate is zero, the affine process is a semi-martingale with absolutely continuous characteristics up to its time of explosion. Our results generalize the results of Keller-Ressel, Schachermayer and Teichmann (2011) for the state space $R_+^m \times R^n$ and provide a new probabilistic approach to regularity.
△ Less
Submitted 22 May, 2012; v1 submitted 3 May, 2011;
originally announced May 2011.
-
A Semigroup Point Of View On Splitting Schemes For Stochastic (Partial) Differential Equations
Authors:
Philipp Doersek,
Josef Teichmann
Abstract:
We construct normed spaces of real-valued functions with controlled growth on possibly infinite-dimensional state spaces such that semigroups of positive, bounded operators $(P_t)_{t\ge 0}$ thereon with $\lim_{t\to 0+}P_t f(x)=f(x)$ are in fact strongly continuous. This result applies to prove optimal rates of convergence of splitting schemes for stochastic (partial) differential equations with li…
▽ More
We construct normed spaces of real-valued functions with controlled growth on possibly infinite-dimensional state spaces such that semigroups of positive, bounded operators $(P_t)_{t\ge 0}$ thereon with $\lim_{t\to 0+}P_t f(x)=f(x)$ are in fact strongly continuous. This result applies to prove optimal rates of convergence of splitting schemes for stochastic (partial) differential equations with linearly growing characteristics and for sets of functions with controlled growth. Applications are general Da Prato-Zabczyk type equations and the HJM equations from interest rate theory.
△ Less
Submitted 11 November, 2010;
originally announced November 2010.
-
A new extrapolation method for weak approximation schemes with applications
Authors:
Kojiro Oshima,
Josef Teichmann,
Dejan Veluscek
Abstract:
We review Fujiwara's scheme, a sixth order weak approximation scheme for the numerical approximation of SDEs, and embed it into a general method to construct weak approximation schemes of order $ 2m $ for $ m \in \mathbf{N} $. Those schemes cannot be seen as cubature schemes, but rather as universal ways how to extrapolate from a lower order weak approximation scheme, namely the Ninomiya-Victoir…
▽ More
We review Fujiwara's scheme, a sixth order weak approximation scheme for the numerical approximation of SDEs, and embed it into a general method to construct weak approximation schemes of order $ 2m $ for $ m \in \mathbf{N} $. Those schemes cannot be seen as cubature schemes, but rather as universal ways how to extrapolate from a lower order weak approximation scheme, namely the Ninomiya-Victoir scheme, for higher orders.
△ Less
Submitted 23 November, 2009;
originally announced November 2009.
-
Affine processes on positive semidefinite matrices
Authors:
Christa Cuchiero,
Damir Filipović,
Eberhard Mayerhofer,
Josef Teichmann
Abstract:
This article provides the mathematical foundation for stochastically continuous affine processes on the cone of positive semidefinite symmetric matrices. This analysis has been motivated by a large and growing use of matrix-valued affine processes in finance, including multi-asset option pricing with stochastic volatility and correlation structures, and fixed-income models with stochastically corr…
▽ More
This article provides the mathematical foundation for stochastically continuous affine processes on the cone of positive semidefinite symmetric matrices. This analysis has been motivated by a large and growing use of matrix-valued affine processes in finance, including multi-asset option pricing with stochastic volatility and correlation structures, and fixed-income models with stochastically correlated risk factors and default intensities.
△ Less
Submitted 11 April, 2011; v1 submitted 1 October, 2009;
originally announced October 2009.
-
Another approach to some rough and stochastic partial differential equations
Authors:
Josef Teichmann
Abstract:
In this note we introduce a new approach to rough and stochastic partial differential equations (RPDEs and SPDEs): we consider general Banach spaces as state spaces and -- for the sake of simiplicity -- finite dimensional sources of noise, either rough or stochastic. By means of a time-dependent transformation of state space and rough path theory we are able to construct unique solutions of the…
▽ More
In this note we introduce a new approach to rough and stochastic partial differential equations (RPDEs and SPDEs): we consider general Banach spaces as state spaces and -- for the sake of simiplicity -- finite dimensional sources of noise, either rough or stochastic. By means of a time-dependent transformation of state space and rough path theory we are able to construct unique solutions of the respective R- and SPDEs. As a consequence of our construction we can apply the pool of results of rough path theory, in particular we obtain strong and weak numerical schemes of high order converging to the solution process.
△ Less
Submitted 19 August, 2009;
originally announced August 2009.
-
Affine processes are regular
Authors:
Martin Keller-Ressel,
Walter Schachermayer,
Josef Teichmann
Abstract:
We show that stochastically continuous, time-homogeneous affine processes on the canonical state space $\Rplus^m \times \RR^n$ are always regular. In the paper of \citet{Duffie2003} regularity was used as a crucial basic assumption. It was left open whether this regularity condition is automatically satisfied, for stochastically continuous affine processes. We now show that the regularity assump…
▽ More
We show that stochastically continuous, time-homogeneous affine processes on the canonical state space $\Rplus^m \times \RR^n$ are always regular. In the paper of \citet{Duffie2003} regularity was used as a crucial basic assumption. It was left open whether this regularity condition is automatically satisfied, for stochastically continuous affine processes. We now show that the regularity assumption is indeed superfluous, since regularity follows from stochastic continuity and the exponentially affine behavior of the characteristic function. For the proof we combine classic results on the differentiability of transformation semigroups with the method of the moving frame which has been recently found to be useful in the theory of SPDEs.
△ Less
Submitted 11 February, 2010; v1 submitted 18 June, 2009;
originally announced June 2009.