-
Convergence of the Markovian iteration for coupled FBSDEs via a differentiation approach
Authors:
Zhipeng Huang,
Cornelis W. Oosterlee
Abstract:
In this paper, we investigate the Markovian iteration method for solving coupled forward-backward stochastic differential equations (FBSDEs) featuring a fully coupled forward drift, meaning the drift term explicitly depends on both the forward and backward processes. An FBSDE system typically involves three stochastic processes: the forward process $X$, the backward process $Y$ representing the so…
▽ More
In this paper, we investigate the Markovian iteration method for solving coupled forward-backward stochastic differential equations (FBSDEs) featuring a fully coupled forward drift, meaning the drift term explicitly depends on both the forward and backward processes. An FBSDE system typically involves three stochastic processes: the forward process $X$, the backward process $Y$ representing the solution, and the $Z$ process corresponding to the scaled derivative of $Y$. Prior research by Bender and Zhang (2008) has established convergence results for iterative schemes dealing with $Y$-coupled FBSDEs. However, extending these results to equations with $Z$ coupling poses significant challenges, especially in uniformly controlling the Lipschitz constant of the decoupling fields across iterations and time steps within a fixed-point framework.
To overcome this issue, we propose a novel differentiation-based method for handling the $Z$ process. This approach enables improved management of the Lipschitz continuity of decoupling fields, facilitating the well-posedness of the discretized FBSDE system with fully coupled drift. We rigorously prove the convergence of our Markovian iteration method in this more complex setting. Finally, numerical experiments confirm our theoretical insights, showcasing the effectiveness and accuracy of the proposed methodology.
△ Less
Submitted 3 April, 2025;
originally announced April 2025.
-
The deep multi-FBSDE method: a robust deep learning method for coupled FBSDEs
Authors:
Kristoffer Andersson,
Adam Andersson,
Cornelis W. Oosterlee
Abstract:
We introduce the deep multi-FBSDE method for robust approximation of coupled forward-backward stochastic differential equations (FBSDEs), focusing on cases where the deep BSDE method of Han, Jentzen, and E (2018) fails to converge. To overcome the convergence issues, we consider a family of FBSDEs that are equivalent to the original problem in the sense that they satisfy the same associated partia…
▽ More
We introduce the deep multi-FBSDE method for robust approximation of coupled forward-backward stochastic differential equations (FBSDEs), focusing on cases where the deep BSDE method of Han, Jentzen, and E (2018) fails to converge. To overcome the convergence issues, we consider a family of FBSDEs that are equivalent to the original problem in the sense that they satisfy the same associated partial differential equation (PDE). Our algorithm proceeds in two phases: first, we approximate the initial condition for the FBSDE family, and second, we approximate the original FBSDE using the initial condition approximated in the first phase. Numerical experiments show that our method converges even when the standard deep BSDE method does not.
△ Less
Submitted 31 May, 2025; v1 submitted 17 March, 2025;
originally announced March 2025.
-
A deep BSDE approach for the simultaneous pricing and delta-gamma hedging of large portfolios consisting of high-dimensional multi-asset Bermudan options
Authors:
Balint Negyesi,
Cornelis W. Oosterlee
Abstract:
A deep BSDE approach is presented for the pricing and delta-gamma hedging of high-dimensional Bermudan options, with applications in portfolio risk management. Large portfolios of a mixture of multi-asset European and Bermudan derivatives are cast into the framework of discretely reflected BSDEs. This system is discretized by the One Step Malliavin scheme (Negyesi et al. [2024, 2025]) of discretel…
▽ More
A deep BSDE approach is presented for the pricing and delta-gamma hedging of high-dimensional Bermudan options, with applications in portfolio risk management. Large portfolios of a mixture of multi-asset European and Bermudan derivatives are cast into the framework of discretely reflected BSDEs. This system is discretized by the One Step Malliavin scheme (Negyesi et al. [2024, 2025]) of discretely reflected Markovian BSDEs, which involves a $Γ$ process, corresponding to second-order sensitivities of the associated option prices. The discretized system is solved by a neural network regression Monte Carlo method, efficiently for a large number of underlyings. The resulting option Deltas and Gammas are used to discretely rebalance the corresponding replicating strategies. Numerical experiments are presented on both high-dimensional basket options and large portfolios consisting of multiple options with varying early exercise rights, moneyness and volatility. These examples demonstrate the robustness and accuracy of the method up to $100$ risk factors. The resulting hedging strategies significantly outperform benchmark methods both in the case of standard delta- and delta-gamma hedging.
△ Less
Submitted 17 February, 2025;
originally announced February 2025.
-
A numerical Fourier cosine expansion method with higher order Taylor schemes for fully coupled FBSDEs
Authors:
Balint Negyesi,
Cornelis W. Oosterlee
Abstract:
A higher-order numerical method is presented for scalar valued, coupled forward-backward stochastic differential equations. Unlike most classical references, the forward component is not only discretized by an Euler-Maruyama approximation but also by higher-order Taylor schemes. This includes the famous Milstein scheme, providing an improved strong convergence rate of order 1; and the simplified o…
▽ More
A higher-order numerical method is presented for scalar valued, coupled forward-backward stochastic differential equations. Unlike most classical references, the forward component is not only discretized by an Euler-Maruyama approximation but also by higher-order Taylor schemes. This includes the famous Milstein scheme, providing an improved strong convergence rate of order 1; and the simplified order 2.0 weak Taylor scheme exhibiting weak convergence rate of order 2. In order to have a fully-implementable scheme in case of these higher-order Taylor approximations, which involve the derivatives of the decoupling fields, we use the COS method built on Fourier cosine expansions to approximate the conditional expectations arising from the numerical approximation of the backward component. Even though higher-order numerical approximations for the backward equation are deeply studied in the literature, to the best of our understanding, the present numerical scheme is the first which achieves strong convergence of order 1 for the whole coupled system, including the forward equation, which is often the main interest in applications such as stochastic control. Numerical experiments demonstrate the proclaimed higher-order convergence, both in case of strong and weak convergence rates, for various equations ranging from decoupled to the fully-coupled settings.
△ Less
Submitted 19 January, 2025;
originally announced January 2025.
-
Modeling and Replication of the Prepayment Option of Mortgages including Behavioral Uncertainty
Authors:
Leonardo Perotti,
Lech A. Grzelak,
Cornelis W. Oosterlee
Abstract:
Prepayment risk embedded in fixed-rate mortgages forms a significant fraction of a financial institution's exposure, and it receives particular attention because of the magnitude of the underlying market. The embedded prepayment option (EPO) bears the same interest rate risk as an exotic interest rate swap (IRS) with a suitable stochastic notional. We investigate the effect of relaxing the assumpt…
▽ More
Prepayment risk embedded in fixed-rate mortgages forms a significant fraction of a financial institution's exposure, and it receives particular attention because of the magnitude of the underlying market. The embedded prepayment option (EPO) bears the same interest rate risk as an exotic interest rate swap (IRS) with a suitable stochastic notional. We investigate the effect of relaxing the assumption of a deterministic relationship between the market interest rate incentive and the prepayment rate. A non-hedgeable risk factor is modeled to capture the uncertainty in mortgage owners' behavior, leading to an incomplete market. We prove under natural assumptions that including behavioral uncertainty reduces the exposure's value. We statically replicate the exposure resulting from the EPO with IRSs and swaptions, and we show that a replication based on swaps solely cannot easily control the right tail of the exposure distribution, while including swaptions enables that. The replication framework is flexible and focuses on different regions in the exposure distribution. Since a non-hedgeable risk factor entails the existence of multiple equivalent martingale measures, pricing and optimal replication are not unique. We investigate the effect of a market price of risk misspecification and we provide a methodology to generate robust hedging strategies. Such strategies, obtained as solutions to a saddle-point problem, allow us to bound the exposure against a misspecification of the pricing measure.
△ Less
Submitted 28 October, 2024;
originally announced October 2024.
-
The Deep Latent Space Particle Filter for Real-Time Data Assimilation with Uncertainty Quantification
Authors:
Nikolaj T. Mücke,
Sander M. Bohté,
Cornelis W. Oosterlee
Abstract:
In Data Assimilation, observations are fused with simulations to obtain an accurate estimate of the state and parameters for a given physical system. Combining data with a model, however, while accurately estimating uncertainty, is computationally expensive and infeasible to run in real-time for complex systems. Here, we present a novel particle filter methodology, the Deep Latent Space Particle f…
▽ More
In Data Assimilation, observations are fused with simulations to obtain an accurate estimate of the state and parameters for a given physical system. Combining data with a model, however, while accurately estimating uncertainty, is computationally expensive and infeasible to run in real-time for complex systems. Here, we present a novel particle filter methodology, the Deep Latent Space Particle filter or D-LSPF, that uses neural network-based surrogate models to overcome this computational challenge. The D-LSPF enables filtering in the low-dimensional latent space obtained using Wasserstein AEs with modified vision transformer layers for dimensionality reduction and transformers for parameterized latent space time stepping. As we demonstrate on three test cases, including leak localization in multi-phase pipe flow and seabed identification for fully nonlinear water waves, the D-LSPF runs orders of magnitude faster than a high-fidelity particle filter and 3-5 times faster than alternative methods while being up to an order of magnitude more accurate. The D-LSPF thus enables real-time data assimilation with uncertainty quantification for physical systems.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Parallel-in-Time Iterative Methods for Pricing American Options
Authors:
Xian-Ming Gu,
Jun Liu,
Cornelis W. Oosterlee
Abstract:
For pricing American options, %after suitable discretization in space and time, a sequence of discrete linear complementarity problems (LCPs) or equivalently Hamilton-Jacobi-Bellman (HJB) equations need to be solved in a sequential time-stepping manner. In each time step, the policy iteration or its penalty variant is often applied due to their fast convergence rates. In this paper, we aim to solv…
▽ More
For pricing American options, %after suitable discretization in space and time, a sequence of discrete linear complementarity problems (LCPs) or equivalently Hamilton-Jacobi-Bellman (HJB) equations need to be solved in a sequential time-stepping manner. In each time step, the policy iteration or its penalty variant is often applied due to their fast convergence rates. In this paper, we aim to solve for all time steps simultaneously, by applying the policy iteration to an ``all-at-once form" of the HJB equations, where two different parallel-in-time preconditioners are proposed to accelerate the solution of the linear systems within the policy iteration. Our proposed methods are generally applicable for such all-at-once forms of the HJB equation, arising from option pricing problems with optimal stopping and nontrivial underlying asset models. Numerical examples are presented to show the feasibility and robust convergence behavior of the proposed methodology.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Generalized convergence of the deep BSDE method: a step towards fully-coupled FBSDEs and applications in stochastic control
Authors:
Balint Negyesi,
Zhipeng Huang,
Cornelis W. Oosterlee
Abstract:
We are concerned with high-dimensional coupled FBSDE systems approximated by the deep BSDE method of Han et al. (2018). It was shown by Han and Long (2020) that the errors induced by the deep BSDE method admit a posteriori estimate depending on the loss function, whenever the backward equation only couples into the forward diffusion through the Y process. We generalize this result to drift coeffic…
▽ More
We are concerned with high-dimensional coupled FBSDE systems approximated by the deep BSDE method of Han et al. (2018). It was shown by Han and Long (2020) that the errors induced by the deep BSDE method admit a posteriori estimate depending on the loss function, whenever the backward equation only couples into the forward diffusion through the Y process. We generalize this result to drift coefficients that may also depend on Z, and give sufficient conditions for convergence under standard assumptions. The resulting conditions are directly verifiable for any equation. Consequently, unlike in earlier theory, our convergence analysis enables the treatment of FBSDEs stemming from stochastic optimal control problems. In particular, we provide a theoretical justification for the non-convergence of the deep BSDE method observed in recent literature, and present direct guidelines for when convergence can be guaranteed in practice. Our theoretical findings are supported by several numerical experiments in high-dimensional settings.
△ Less
Submitted 19 January, 2025; v1 submitted 27 March, 2024;
originally announced March 2024.
-
On the Hull-White model with volatility smile for Valuation Adjustments
Authors:
T. van der Zwaard,
L. A. Grzelak,
C. W. Oosterlee
Abstract:
Affine Diffusion dynamics are frequently used for Valuation Adjustments (xVA) calculations due to their analytic tractability. However, these models cannot capture the market-implied skew and smile, which are relevant when computing xVA metrics. Hence, additional degrees of freedom are required to capture these market features. In this paper, we address this through an SDE with state-dependent coe…
▽ More
Affine Diffusion dynamics are frequently used for Valuation Adjustments (xVA) calculations due to their analytic tractability. However, these models cannot capture the market-implied skew and smile, which are relevant when computing xVA metrics. Hence, additional degrees of freedom are required to capture these market features. In this paper, we address this through an SDE with state-dependent coefficients. The SDE is consistent with the convex combination of a finite number of different AD dynamics. We combine Hull-White one-factor models where one model parameter is varied. We use the Randomized AD (RAnD) technique to parameterize the combination of dynamics. We refer to our SDE with state-dependent coefficients and the RAnD parametrization of the original models as the rHW model. The rHW model allows for efficient semi-analytic calibration to European swaptions through the analytic tractability of the Hull-White dynamics. We use a regression-based Monte-Carlo simulation to calculate exposures. In this setting, we demonstrate the significant effect of skew and smile on exposures and xVAs of linear and early-exercise interest rate derivatives.
△ Less
Submitted 21 March, 2024;
originally announced March 2024.
-
Convergence of the deep BSDE method for stochastic control problems formulated through the stochastic maximum principle
Authors:
Zhipeng Huang,
Balint Negyesi,
Cornelis W. Oosterlee
Abstract:
It is well-known that decision-making problems from stochastic control can be formulated by means of a forward-backward stochastic differential equation (FBSDE). Recently, the authors of Ji et al. 2022 proposed an efficient deep learning algorithm based on the stochastic maximum principle (SMP). In this paper, we provide a convergence result for this deep SMP-BSDE algorithm and compare its perform…
▽ More
It is well-known that decision-making problems from stochastic control can be formulated by means of a forward-backward stochastic differential equation (FBSDE). Recently, the authors of Ji et al. 2022 proposed an efficient deep learning algorithm based on the stochastic maximum principle (SMP). In this paper, we provide a convergence result for this deep SMP-BSDE algorithm and compare its performance with other existing methods. In particular, by adopting a strategy as in Han and Long 2020, we derive a-posteriori estimate, and show that the total approximation error can be bounded by the value of the loss functional and the discretization error. We present numerical examples for high-dimensional stochastic control problems, both in case of drift- and diffusion control, which showcase superior performance compared to existing algorithms.
△ Less
Submitted 31 July, 2024; v1 submitted 30 January, 2024;
originally announced January 2024.
-
A parallel preconditioner for the all-at-once linear system from evolutionary PDEs with Crank-Nicolson discretization
Authors:
Yong-Liang Zhao,
Xian-Ming Gu,
Cornelis W. Oosterlee
Abstract:
The Crank-Nicolson (CN) method is a well-known time integrator for evolutionary partial differential equations (PDEs) arising in many real-world applications. Since the solution at any time depends on the solution at previous time steps, the CN method is inherently difficult to parallelize. In this paper, we consider a parallel method for the solution of evolutionary PDEs with the CN scheme. Using…
▽ More
The Crank-Nicolson (CN) method is a well-known time integrator for evolutionary partial differential equations (PDEs) arising in many real-world applications. Since the solution at any time depends on the solution at previous time steps, the CN method is inherently difficult to parallelize. In this paper, we consider a parallel method for the solution of evolutionary PDEs with the CN scheme. Using an all-at-once approach, we can solve for all time steps simultaneously using a parallelizable over time preconditioner within a standard iterative method. Due to the diagonalization of the proposed preconditioner, we can prove that most eigenvalues of preconditioned matrices are equal to 1 and the others lie in the set: $\left\{z\in\mathbb{C}: 1/(1 + α) < |z| < 1/(1 - α)~{\rm and}~\Re{\rm e}(z) > 0\right\}$, where $0 < α< 1$ is a free parameter. Besides, the efficient implementation of the proposed preconditioner is described. Given certain conditions, we prove that the preconditioned GMRES method exhibits a mesh-independent convergence rate. Finally, we will verify both theoretical findings and the efficacy of the proposed preconditioner via numerical experiments on financial option pricing PDEs.
△ Less
Submitted 11 February, 2024; v1 submitted 29 January, 2024;
originally announced January 2024.
-
Principal Component Copulas for Capital Modelling and Systemic Risk
Authors:
K. B. Gubbels,
J. Y. Ypma,
C. W. Oosterlee
Abstract:
We introduce a class of copulas that we call Principal Component Copulas (PCCs). This class combines the strong points of copula-based techniques with principal component-based models, which results in flexibility when modelling tail dependence along the most important directions in high-dimensional data. We obtain theoretical results for PCCs that are important for practical applications. In part…
▽ More
We introduce a class of copulas that we call Principal Component Copulas (PCCs). This class combines the strong points of copula-based techniques with principal component-based models, which results in flexibility when modelling tail dependence along the most important directions in high-dimensional data. We obtain theoretical results for PCCs that are important for practical applications. In particular, we derive tractable expressions for the high-dimensional copula density, which can be represented in terms of characteristic functions. We also develop algorithms to perform Maximum Likelihood and Generalized Method of Moment estimation in high-dimensions and show very good performance in simulation experiments. Finally, we apply the copula to the international stock market in order to study systemic risk. We find that PCCs lead to excellent performance on measures of systemic risk due to their ability to distinguish between parallel market movements, which increase systemic risk, and orthogonal movements, which reduce systemic risk. As a result, we consider the PCC promising for internal capital models, which financial institutions use to protect themselves against systemic risk.
△ Less
Submitted 10 December, 2024; v1 submitted 20 December, 2023;
originally announced December 2023.
-
Energy-stable discretization of the one-dimensional two-fluid model
Authors:
J. F. H. Buist,
B. Sanderse,
S. Dubinkina,
C. W. Oosterlee,
R. A. W. M. Henkes
Abstract:
In this paper we present a complete framework for the energy-stable simulation of stratified incompressible flow in channels, using the one-dimensional two-fluid model. Building on earlier energy-conserving work on the basic two-fluid model, our new framework includes diffusion, friction, and surface tension. We show that surface tension can be added in an energy-conserving manner, and that diffus…
▽ More
In this paper we present a complete framework for the energy-stable simulation of stratified incompressible flow in channels, using the one-dimensional two-fluid model. Building on earlier energy-conserving work on the basic two-fluid model, our new framework includes diffusion, friction, and surface tension. We show that surface tension can be added in an energy-conserving manner, and that diffusion and friction have a strictly dissipative effect on the energy.
We then propose spatial discretizations for these terms such that a semi-discrete model is obtained that has the same conservation properties as the continuous model. Additionally, we propose a new energy-stable advective flux scheme that is energy-conserving in smooth regions of the flow and strictly dissipative where sharp gradients appear. This is obtained by combining, using flux limiters, a previously developed energy-conserving advective flux with a novel first-order upwind scheme that is shown to be strictly dissipative.
The complete framework, with diffusion, surface tension, and a bounded energy, is linearly stable to short wavelength perturbations, and exhibits nonlinear damping near shocks. The model yields smoothly converging numerical solutions, even under conditions for which the basic two-fluid model is ill-posed. With our explicit expressions for the dissipation rates, we are able to attribute the nonlinear damping to the different dissipation mechanisms, and compare their effects.
△ Less
Submitted 21 October, 2023;
originally announced October 2023.
-
D-TIPO: Deep time-inconsistent portfolio optimization with stocks and options
Authors:
Kristoffer Andersson,
Cornelis W. Oosterlee
Abstract:
In this paper, we propose a machine learning algorithm for time-inconsistent portfolio optimization. The proposed algorithm builds upon neural network based trading schemes, in which the asset allocation at each time point is determined by a a neural network. The loss function is given by an empirical version of the objective function of the portfolio optimization problem. Moreover, various tradin…
▽ More
In this paper, we propose a machine learning algorithm for time-inconsistent portfolio optimization. The proposed algorithm builds upon neural network based trading schemes, in which the asset allocation at each time point is determined by a a neural network. The loss function is given by an empirical version of the objective function of the portfolio optimization problem. Moreover, various trading constraints are naturally fulfilled by choosing appropriate activation functions in the output layers of the neural networks. Besides this, our main contribution is to add options to the portfolio of risky assets and a risk-free bond and using additional neural networks to determine the amount allocated into the options as well as their strike prices.
We consider objective functions more in line with the rational preference of an investor than the classical mean-variance, apply realistic trading constraints and model the assets with a correlated jump-diffusion SDE. With an incomplete market and a more involved objective function, we show that it is beneficial to add options to the portfolio. Moreover, it is shown that adding options leads to a more constant stock allocation with less demand for drastic re-allocations.
△ Less
Submitted 5 September, 2023; v1 submitted 21 August, 2023;
originally announced August 2023.
-
GPU acceleration of the Seven-League Scheme for large time step simulations of stochastic differential equations
Authors:
Shuaiqiang Liu,
Graziana Colonna,
Lech A. Grzelak,
Cornelis W. Oosterlee
Abstract:
Monte Carlo simulation is widely used to numerically solve stochastic differential equations. Although the method is flexible and easy to implement, it may be slow to converge. Moreover, an inaccurate solution will result when using large time steps. The Seven League scheme, a deep learning-based numerical method, has been proposed to address these issues. This paper generalizes the scheme regardi…
▽ More
Monte Carlo simulation is widely used to numerically solve stochastic differential equations. Although the method is flexible and easy to implement, it may be slow to converge. Moreover, an inaccurate solution will result when using large time steps. The Seven League scheme, a deep learning-based numerical method, has been proposed to address these issues. This paper generalizes the scheme regarding parallel computing, particularly on Graphics Processing Units (GPUs), improving the computational speed.
△ Less
Submitted 10 February, 2023;
originally announced February 2023.
-
AIDA: Analytic Isolation and Distance-based Anomaly Detection Algorithm
Authors:
Luis Antonio Souto Arias,
Cornelis W. Oosterlee,
Pasquale Cirillo
Abstract:
We combine the metrics of distance and isolation to develop the Analytic Isolation and Distance-based Anomaly (AIDA) detection algorithm. AIDA is the first distance-based method that does not rely on the concept of nearest-neighbours, making it a parameter-free model.
Differently from the prevailing literature, in which the isolation metric is always computed via simulations, we show that AIDA a…
▽ More
We combine the metrics of distance and isolation to develop the Analytic Isolation and Distance-based Anomaly (AIDA) detection algorithm. AIDA is the first distance-based method that does not rely on the concept of nearest-neighbours, making it a parameter-free model.
Differently from the prevailing literature, in which the isolation metric is always computed via simulations, we show that AIDA admits an analytical expression for the outlier score, providing new insights into the isolation metric. Additionally, we present an anomaly explanation method based on AIDA, the Tempered Isolation-based eXplanation (TIX) algorithm, which finds the most relevant outlier features even in data sets with hundreds of dimensions. We test both algorithms on synthetic and empirical data: we show that AIDA is competitive when compared to other state-of-the-art methods, and it is superior in finding outliers hidden in multidimensional feature subspaces. Finally, we illustrate how the TIX algorithm is able to find outliers in multidimensional feature subspaces, and use these explanations to analyze common benchmarks used in anomaly detection.
△ Less
Submitted 8 December, 2022; v1 submitted 5 December, 2022;
originally announced December 2022.
-
Efficient Wrong-Way Risk Modelling for Funding Valuation Adjustments
Authors:
T. van der Zwaard,
L. A. Grzelak,
C. W. Oosterlee
Abstract:
Wrong-Way Risk (WWR) is an important component in Funding Valuation Adjustment (FVA) modelling. Yet, the standard assumption is independence between market risks and the counterparty defaults and funding costs. This typical industrial setting is our point of departure, where we aim to assess the impact of WWR without running a full Monte Carlo simulation with all credit and funding processes. We p…
▽ More
Wrong-Way Risk (WWR) is an important component in Funding Valuation Adjustment (FVA) modelling. Yet, the standard assumption is independence between market risks and the counterparty defaults and funding costs. This typical industrial setting is our point of departure, where we aim to assess the impact of WWR without running a full Monte Carlo simulation with all credit and funding processes. We propose to split the exposure profile into two parts: an independent and a WWR-driven part. For the former, exposures can be re-used from the standard xVA calculation. We express the second part of the exposure profile in terms of the stochastic drivers and approximate these by a common Gaussian stochastic factor. Within the affine setting, the proposed approximation is generic, is an add-on to the existing xVA calculations and provides an efficient and robust way to include WWR in FVA modelling. Case studies for an interest rate swap and a representative multi-currency portfolio of swaps illustrate that the approximation method is applicable in a practical setting. We analyze the approximation error and use the approximation to compute WWR sensitivities, which are needed for risk management. The approach is equally applicable to other metrics such as Credit Valuation Adjustment.
△ Less
Submitted 6 June, 2024; v1 submitted 25 September, 2022;
originally announced September 2022.
-
A new self-exciting jump-diffusion process for option pricing
Authors:
Luis A. Souto Arias,
Pasquale Cirillo,
Cornelis W. Oosterlee
Abstract:
We propose a new jump-diffusion process, the Heston-Queue-Hawkes (HQH) model, combining the well-known Heston model and the recently introduced Queue-Hawkes (Q-Hawkes) jump process. Like the Hawkes process, the HQH model can capture the effects of self-excitation and contagion. However, since the characteristic function of the HQH process is known in closed-form, Fourier-based fast pricing algorit…
▽ More
We propose a new jump-diffusion process, the Heston-Queue-Hawkes (HQH) model, combining the well-known Heston model and the recently introduced Queue-Hawkes (Q-Hawkes) jump process. Like the Hawkes process, the HQH model can capture the effects of self-excitation and contagion. However, since the characteristic function of the HQH process is known in closed-form, Fourier-based fast pricing algorithms, like the COS method, can be fully exploited with this model. Furthermore, we show that by using partial integrals of the characteristic function, which are also explicitly known for the HQH process, we can reduce the dimensionality of the COS method, and so its numerical complexity. Numerical results for European and Bermudan options show that the HQH model offers a wider range of volatility smiles compared to the Bates model, while its computational burden is considerably smaller than that of the Heston-Hawkes (HH) process.
△ Less
Submitted 10 February, 2023; v1 submitted 26 May, 2022;
originally announced May 2022.
-
Relevance of Wrong-Way Risk in Funding Valuation Adjustments
Authors:
T. van der Zwaard,
L. A. Grzelak,
C. W. Oosterlee
Abstract:
In March 2020, the world was thrown into financial distress. This manifested itself in increased uncertainty in the financial markets. Many interest rates collapsed, and funding spreads surged significantly, which increased due to the market turmoil. In light of these events, it is essential to understand and model Wrong-Way Risk (WWR) in a Funding Valuation Adjustment (FVA) context. WWR may curre…
▽ More
In March 2020, the world was thrown into financial distress. This manifested itself in increased uncertainty in the financial markets. Many interest rates collapsed, and funding spreads surged significantly, which increased due to the market turmoil. In light of these events, it is essential to understand and model Wrong-Way Risk (WWR) in a Funding Valuation Adjustment (FVA) context. WWR may currently be absent from FVA calculations in banks' Valuation Adjustment (xVA) engines. However, in this letter, we demonstrate that WWR effects are non-negligible in FVA modelling from a risk-management perspective. We look at the impact of various modelling choices, such as including the default times of the relevant parties, as well as stochastic and deterministic funding spreads. A case study is presented for interest rate derivatives.
△ Less
Submitted 15 June, 2022; v1 submitted 6 April, 2022;
originally announced April 2022.
-
Solution of integrals with fractional Brownian motion for different Hurst indices
Authors:
Fei Gao,
Shuaiqiang Liu,
Cornelis W. Oosterlee,
Nico M. Temme
Abstract:
In this paper, we will evaluate integrals that define the conditional expectation, variance and characteristic function of stochastic processes with respect to fractional Brownian motion (fBm) for all relevant Hurst indices, i.e. $H \in (0,1)$. The fractional Ornstein-Uhlenbeck (fOU) process, for example, gives rise to highly nontrivial integration formulas that need careful analysis when consider…
▽ More
In this paper, we will evaluate integrals that define the conditional expectation, variance and characteristic function of stochastic processes with respect to fractional Brownian motion (fBm) for all relevant Hurst indices, i.e. $H \in (0,1)$. The fractional Ornstein-Uhlenbeck (fOU) process, for example, gives rise to highly nontrivial integration formulas that need careful analysis when considering the whole range of Hurst indices. We will show that the classical technique of analytic continuation, from complex analysis, provides a way of extending the domain of validity of an integral, from $H\in(1/2,1)$, to the larger domain, $H\in(0,1)$. Numerical experiments for different Hurst indices confirm the robustness and efficiency of the integral formulations presented here. Moreover, we provide accurate and highly efficient financial option pricing results for processes that are related to the fOU process, with the help of Fourier cosine expansions.
△ Less
Submitted 11 March, 2022; v1 submitted 4 March, 2022;
originally announced March 2022.
-
Convergence of a robust deep FBSDE method for stochastic control
Authors:
Kristoffer Andersson,
Adam Andersson,
Cornelis W. Oosterlee
Abstract:
In this paper, we propose a deep learning based numerical scheme for strongly coupled FBSDEs, stemming from stochastic control. It is a modification of the deep BSDE method in which the initial value to the backward equation is not a free parameter, and with a new loss function being the weighted sum of the cost of the control problem, and a variance term which coincides with the mean squared erro…
▽ More
In this paper, we propose a deep learning based numerical scheme for strongly coupled FBSDEs, stemming from stochastic control. It is a modification of the deep BSDE method in which the initial value to the backward equation is not a free parameter, and with a new loss function being the weighted sum of the cost of the control problem, and a variance term which coincides with the mean squared error in the terminal condition. We show by a numerical example that a direct extension of the classical deep BSDE method to FBSDEs, fails for a simple linear-quadratic control problem, and motivate why the new method works. Under regularity and boundedness assumptions on the exact controls of time continuous and time discrete control problems, we provide an error analysis for our method. We show empirically that the method converges for three different problems, one being the one that failed for a direct extension of the deep BSDE method.
△ Less
Submitted 9 February, 2023; v1 submitted 18 January, 2022;
originally announced January 2022.
-
Markov Chain Generative Adversarial Neural Networks for Solving Bayesian Inverse Problems in Physics Applications
Authors:
Nikolaj T. Mücke,
Benjamin Sanderse,
Sander Bohté,
Cornelis W. Oosterlee
Abstract:
In the context of solving inverse problems for physics applications within a Bayesian framework, we present a new approach, Markov Chain Generative Adversarial Neural Networks (MCGANs), to alleviate the computational costs associated with solving the Bayesian inference problem. GANs pose a very suitable framework to aid in the solution of Bayesian inference problems, as they are designed to genera…
▽ More
In the context of solving inverse problems for physics applications within a Bayesian framework, we present a new approach, Markov Chain Generative Adversarial Neural Networks (MCGANs), to alleviate the computational costs associated with solving the Bayesian inference problem. GANs pose a very suitable framework to aid in the solution of Bayesian inference problems, as they are designed to generate samples from complicated high-dimensional distributions. By training a GAN to sample from a low-dimensional latent space and then embedding it in a Markov Chain Monte Carlo method, we can highly efficiently sample from the posterior, by replacing both the high-dimensional prior and the expensive forward map. We prove that the proposed methodology converges to the true posterior in the Wasserstein-1 distance and that sampling from the latent space is equivalent to sampling in the high-dimensional space in a weak sense. The method is showcased on two test cases where we perform both state and parameter estimation simultaneously. The approach is shown to be up to two orders of magnitude more accurate than alternative approaches while also being up to two orders of magnitude computationally faster, in multiple test cases, including the important engineering setting of detecting leaks in pipelines.
△ Less
Submitted 6 September, 2022; v1 submitted 24 November, 2021;
originally announced November 2021.
-
The One Step Malliavin scheme: new discretization of BSDEs implemented with deep learning regressions
Authors:
Balint Negyesi,
Kristoffer Andersson,
Cornelis W. Oosterlee
Abstract:
A novel discretization is presented for forward-backward stochastic differential equations (FBSDE) with differentiable coefficients, simultaneously solving the BSDE and its Malliavin sensitivity problem. The control process is estimated by the corresponding linear BSDE driving the trajectories of the Malliavin derivatives of the solution pair, which implies the need to provide accurate $Γ$ estimat…
▽ More
A novel discretization is presented for forward-backward stochastic differential equations (FBSDE) with differentiable coefficients, simultaneously solving the BSDE and its Malliavin sensitivity problem. The control process is estimated by the corresponding linear BSDE driving the trajectories of the Malliavin derivatives of the solution pair, which implies the need to provide accurate $Γ$ estimates. The approximation is based on a merged formulation given by the Feynman-Kac formulae and the Malliavin chain rule. The continuous time dynamics is discretized with a theta-scheme. In order to allow for an efficient numerical solution of the arising semi-discrete conditional expectations in possibly high-dimensions, it is fundamental that the chosen approach admits to differentiable estimates. Two fully-implementable schemes are considered: the BCOS method as a reference in the one-dimensional framework and neural network Monte Carlo regressions in case of high-dimensional problems, similarly to the recently emerging class of Deep BSDE methods [Han et al. (2018), Huré et al. (2020)]. An error analysis is carried out to show $L^2$ convergence of order $1/2$, under standard Lipschitz assumptions and additive noise in the forward diffusion. Numerical experiments are provided for a range of different semi- and quasi-linear equations up to $50$ dimensions, demonstrating that the proposed scheme yields a significant improvement in the control estimations.
△ Less
Submitted 11 October, 2021;
originally announced October 2021.
-
Pricing and Hedging Prepayment Risk in a Mortgage Portfolio
Authors:
Emanuele Casamassima,
Lech A. Grzelak,
Frank A. Mulder,
Cornelis W. Oosterlee
Abstract:
Understanding mortgage prepayment is crucial for any financial institution providing mortgages, and it is important for hedging the risk resulting from such unexpected cash flows. Here, in the setting of a Dutch mortgage provider, we propose to include non-linear financial instruments in the hedge portfolio when dealing with mortgages with the option to prepay part of the notional early. Based on…
▽ More
Understanding mortgage prepayment is crucial for any financial institution providing mortgages, and it is important for hedging the risk resulting from such unexpected cash flows. Here, in the setting of a Dutch mortgage provider, we propose to include non-linear financial instruments in the hedge portfolio when dealing with mortgages with the option to prepay part of the notional early. Based on the assumption that there is a correlation between prepayment and the interest rates in the market, a model is proposed which is based on a specific refinancing incentive. The linear and non-linear risks are addressed by a set of tradeable instruments in a static hedge strategy. We will show that a stochastic model for the notional of a mortgage unveils non-linear risk embedded in a prepayment option. Based on a calibration of the refinancing incentive on a data set of more than thirty million observations, a functional form of the prepayments is defined, which accurately reflects the borrowers' behaviour. We compare this functional form with a fully rational model, where the option to prepay is assumed to be exercised rationally.
△ Less
Submitted 13 October, 2021; v1 submitted 30 September, 2021;
originally announced September 2021.
-
Positive Stochastic Collocation for the Collocated Local Volatility Model
Authors:
Fabien Le Floc'h,
Cornelis W. Oosterlee
Abstract:
This paper presents how to apply the stochastic collocation technique to assets that can not move below a boundary. It shows that the polynomial collocation towards a lognormal distribution does not work well. Then, the potentials issues of the related collocated local volatility model (CLV) are explored. Finally, a simple analytical expression for the Dupire local volatility derived from the opti…
▽ More
This paper presents how to apply the stochastic collocation technique to assets that can not move below a boundary. It shows that the polynomial collocation towards a lognormal distribution does not work well. Then, the potentials issues of the related collocated local volatility model (CLV) are explored. Finally, a simple analytical expression for the Dupire local volatility derived from the option prices modelled by stochastic collocation is given.
△ Less
Submitted 6 September, 2021;
originally announced September 2021.
-
Energy-conserving formulation of the two-fluid model for incompressible two-phase flow in channels and pipes
Authors:
J. F. H. Buist,
B. Sanderse,
S. Dubinkina,
R. A. W. M. Henkes,
C. W. Oosterlee
Abstract:
We show that the one-dimensional (1D) two-fluid model (TFM) for stratified flow in channels and pipes (in its incompressible, isothermal form) satisfies an energy conservation equation, which arises naturally from the mass and momentum conservation equations that constitute the model. This result extends upon earlier work on the shallow water equations (SWE), with the important difference that we…
▽ More
We show that the one-dimensional (1D) two-fluid model (TFM) for stratified flow in channels and pipes (in its incompressible, isothermal form) satisfies an energy conservation equation, which arises naturally from the mass and momentum conservation equations that constitute the model. This result extends upon earlier work on the shallow water equations (SWE), with the important difference that we include non-conservative pressure terms in the analysis, and that we propose a formulation that holds for ducts with an arbitrary cross-sectional shape, with the 2D channel and circular pipe geometries as special cases.
The second novel result of this work is the formulation of a finite volume scheme for the TFM that satisfies a discrete form of the continuous energy equation. This discretization is derived in a manner that runs parallel to the continuous analysis. Due to the non-conservative pressure terms it is essential to employ a staggered grid, which requires careful consideration in defining the discrete energy and energy fluxes, and the relations between them and the discrete model. Numerical simulations confirm that the discrete energy is conserved.
△ Less
Submitted 21 December, 2021; v1 submitted 15 April, 2021;
originally announced April 2021.
-
Monte Carlo Simulation of SDEs using GANs
Authors:
Jorino van Rhijn,
Cornelis W. Oosterlee,
Lech A. Grzelak,
Shuaiqiang Liu
Abstract:
Generative adversarial networks (GANs) have shown promising results when applied on partial differential equations and financial time series generation. We investigate if GANs can also be used to approximate one-dimensional Ito stochastic differential equations (SDEs). We propose a scheme that approximates the path-wise conditional distribution of SDEs for large time steps. Standard GANs are only…
▽ More
Generative adversarial networks (GANs) have shown promising results when applied on partial differential equations and financial time series generation. We investigate if GANs can also be used to approximate one-dimensional Ito stochastic differential equations (SDEs). We propose a scheme that approximates the path-wise conditional distribution of SDEs for large time steps. Standard GANs are only able to approximate processes in distribution, yielding a weak approximation to the SDE. A conditional GAN architecture is proposed that enables strong approximation. We inform the discriminator of this GAN with the map between the prior input to the generator and the corresponding output samples, i.e. we introduce a `supervised GAN'. We compare the input-output map obtained with the standard GAN and supervised GAN and show experimentally that the standard GAN may fail to provide a path-wise approximation. The GAN is trained on a dataset obtained with exact simulation. The architecture was tested on geometric Brownian motion (GBM) and the Cox-Ingersoll-Ross (CIR) process. The supervised GAN outperformed the Euler and Milstein schemes in strong error on a discretisation with large time steps. It also outperformed the standard conditional GAN when approximating the conditional distribution. We also demonstrate how standard GANs may give rise to non-parsimonious input-output maps that are sensitive to perturbations, which motivates the need for constraints and regularisation on GAN generators.
△ Less
Submitted 3 April, 2021;
originally announced April 2021.
-
Valuation of electricity storage contracts using the COS method
Authors:
Boris C. Boonstra,
Cornelis W. Oosterlee
Abstract:
Storage of electricity has become increasingly important, due to the gradual replacement of fossil fuels by more variable and uncertain renewable energy sources. In this paper, we provide details on how to mathematically formalize a corresponding electricity storage contract, taking into account the physical limitations of a storage facility and the operational constraints of the electricity grid.…
▽ More
Storage of electricity has become increasingly important, due to the gradual replacement of fossil fuels by more variable and uncertain renewable energy sources. In this paper, we provide details on how to mathematically formalize a corresponding electricity storage contract, taking into account the physical limitations of a storage facility and the operational constraints of the electricity grid. We give details of a valuation technique to price these contracts, where the electricity prices follow a structural model based on a stochastic polynomial process. In particular, we show that the Fourier-based COS method can be used to price the contracts accurately and efficiently.
△ Less
Submitted 8 January, 2021;
originally announced January 2021.
-
Reduced Order Modeling for Parameterized Time-Dependent PDEs using Spatially and Memory Aware Deep Learning
Authors:
Nikolaj T. Mücke,
Sander M. Bohté,
Cornelis W. Oosterlee
Abstract:
We present a novel reduced order model (ROM) approach for parameterized time-dependent PDEs based on modern learning. The ROM is suitable for multi-query problems and is nonintrusive. It is divided into two distinct stages: A nonlinear dimensionality reduction stage that handles the spatially distributed degrees of freedom based on convolutional autoencoders, and a parameterized time-stepping stag…
▽ More
We present a novel reduced order model (ROM) approach for parameterized time-dependent PDEs based on modern learning. The ROM is suitable for multi-query problems and is nonintrusive. It is divided into two distinct stages: A nonlinear dimensionality reduction stage that handles the spatially distributed degrees of freedom based on convolutional autoencoders, and a parameterized time-stepping stage based on memory aware neural networks (NNs), specifically causal convolutional and long short-term memory NNs. Strategies to ensure generalization and stability are discussed. The methodology is tested on the heat equation, advection equation, and the incompressible Navier-Stokes equations, to show the variety of problems the ROM can handle.
△ Less
Submitted 23 November, 2020;
originally announced November 2020.
-
Rule-based Strategies for Dynamic Life Cycle Investment
Authors:
T. R. B. den Haan,
K. W. Chau,
M. van der Schans,
C. W. Oosterlee
Abstract:
In this work, we consider rule-based investment strategies for managing a defined contribution saving scheme under the Dutch pension fund testing model. We found that dynamic rule-based investment can outperform traditional static strategies, by which we mean that the pensioner can achieve the target retirement income with higher probability and limit the shortfall when target is not met. In compa…
▽ More
In this work, we consider rule-based investment strategies for managing a defined contribution saving scheme under the Dutch pension fund testing model. We found that dynamic rule-based investment can outperform traditional static strategies, by which we mean that the pensioner can achieve the target retirement income with higher probability and limit the shortfall when target is not met. In comparison with the popular dynamic programming technique, the rule-based strategy has a more stable asset allocation throughout time and avoid excessive transactions, which may be hard to explain to the investor. We also study a combined strategy of rule based target and dynamic programming in this work. Another key feature of this work is that there is no risk-free asset under our setting, instead, a matching portfolio is introduced for the investor to avoid unnecessary risk.
△ Less
Submitted 4 November, 2020;
originally announced November 2020.
-
Deep learning for CVA computations of large portfolios of financial derivatives
Authors:
Kristoffer Andersson,
Cornelis W. Oosterlee
Abstract:
In this paper, we propose a neural network-based method for CVA computations of a portfolio of derivatives. In particular, we focus on portfolios consisting of a combination of derivatives, with and without true optionality, \textit{e.g.,} a portfolio of a mix of European- and Bermudan-type derivatives. CVA is computed, with and without netting, for different levels of WWR and for different levels…
▽ More
In this paper, we propose a neural network-based method for CVA computations of a portfolio of derivatives. In particular, we focus on portfolios consisting of a combination of derivatives, with and without true optionality, \textit{e.g.,} a portfolio of a mix of European- and Bermudan-type derivatives. CVA is computed, with and without netting, for different levels of WWR and for different levels of credit quality of the counterparty. We show that the CVA is overestimated with up to 25\% by using the standard procedure of not adjusting the exercise strategy for the default-risk of the counterparty. For the Expected Shortfall of the CVA dynamics, the overestimation was found to be more than 100\% in some non-extreme cases.
△ Less
Submitted 26 October, 2020;
originally announced October 2020.
-
On high-order schemes for tempered fractional partial differential equations
Authors:
Linlin Bu,
Cornelis W. Oosterlee
Abstract:
In this paper, we propose third-order semi-discretized schemes in space based on the tempered weighted and shifted Grünwald difference (tempered-WSGD) operators for the tempered fractional diffusion equation. We also show stability and convergence analysis for the fully discrete scheme based a Crank--Nicolson scheme in time. A third-order scheme for the tempered Black--Scholes equation is also pro…
▽ More
In this paper, we propose third-order semi-discretized schemes in space based on the tempered weighted and shifted Grünwald difference (tempered-WSGD) operators for the tempered fractional diffusion equation. We also show stability and convergence analysis for the fully discrete scheme based a Crank--Nicolson scheme in time. A third-order scheme for the tempered Black--Scholes equation is also proposed and tested numerically. Some numerical experiments are carried out to confirm accuracy and effectiveness of these proposed methods.
△ Less
Submitted 15 September, 2020;
originally announced September 2020.
-
The Seven-League Scheme: Deep learning for large time step Monte Carlo simulations of stochastic differential equations
Authors:
Shuaiqiang Liu,
Lech A. Grzelak,
Cornelis W. Oosterlee
Abstract:
We propose an accurate data-driven numerical scheme to solve Stochastic Differential Equations (SDEs), by taking large time steps. The SDE discretization is built up by means of a polynomial chaos expansion method, on the basis of accurately determined stochastic collocation (SC) points. By employing an artificial neural network to learn these SC points, we can perform Monte Carlo simulations with…
▽ More
We propose an accurate data-driven numerical scheme to solve Stochastic Differential Equations (SDEs), by taking large time steps. The SDE discretization is built up by means of a polynomial chaos expansion method, on the basis of accurately determined stochastic collocation (SC) points. By employing an artificial neural network to learn these SC points, we can perform Monte Carlo simulations with large time steps. Error analysis confirms that this data-driven scheme results in accurate SDE solutions in the sense of strong convergence, provided the learning methodology is robust and accurate. With a method variant called the compression-decompression collocation and interpolation technique, we can drastically reduce the number of neural network functions that have to be learned, so that computational speed is enhanced. Numerical experiments confirm a high-quality strong convergence error when using large time steps, and the novel scheme outperforms some classical numerical SDE discretizations. Some applications, here in financial option valuation, are also presented.
△ Less
Submitted 23 September, 2021; v1 submitted 7 September, 2020;
originally announced September 2020.
-
Financial option valuation by unsupervised learning with artificial neural networks
Authors:
Beatriz Salvador,
Cornelis W. Oosterlee,
Remco van der Meer
Abstract:
Artificial neural networks (ANNs) have recently also been applied to solve partial differential equations (PDEs). In this work, the classical problem of pricing European and American financial options, based on the corresponding PDE formulations, is studied. Instead of using numerical techniques based on finite element or difference methods, we address the problem using ANNs in the context of unsu…
▽ More
Artificial neural networks (ANNs) have recently also been applied to solve partial differential equations (PDEs). In this work, the classical problem of pricing European and American financial options, based on the corresponding PDE formulations, is studied. Instead of using numerical techniques based on finite element or difference methods, we address the problem using ANNs in the context of unsupervised learning. As a result, the ANN learns the option values for all possible underlying stock values at future time points, based on the minimization of a suitable loss function. For the European option, we solve the linear Black-Scholes equation, whereas for the American option, we solve the linear complementarity problem formulation. Two-asset exotic option values are also computed, since ANNs enable the accurate valuation of high-dimensional options. The resulting errors of the ANN approach are assessed by comparing to the analytic option values or to numerical reference solutions (for American options, computed by finite elements).
△ Less
Submitted 25 May, 2020;
originally announced May 2020.
-
A Computational Approach to Hedging Credit Valuation Adjustment in a Jump-Diffusion Setting
Authors:
T. van der Zwaard,
L. A. Grzelak,
C. W. Oosterlee
Abstract:
This study contributes to understanding Valuation Adjustments (xVA) by focussing on the dynamic hedging of Credit Valuation Adjustment (CVA), corresponding Profit & Loss (P&L) and the P&L explain. This is done in a Monte Carlo simulation setting, based on a theoretical hedging framework discussed in existing literature. We look at hedging CVA market risk for a portfolio with European options on a…
▽ More
This study contributes to understanding Valuation Adjustments (xVA) by focussing on the dynamic hedging of Credit Valuation Adjustment (CVA), corresponding Profit & Loss (P&L) and the P&L explain. This is done in a Monte Carlo simulation setting, based on a theoretical hedging framework discussed in existing literature. We look at hedging CVA market risk for a portfolio with European options on a stock, first in a Black-Scholes setting, then in a Merton jump-diffusion setting. Furthermore, we analyze the trading business at a bank after including xVAs in pricing. We provide insights into the hedging of derivatives and their xVAs by analyzing and visualizing the cash-flows of a portfolio from a desk structure perspective. The case study shows that not charging CVA at trade inception results in an expected loss. Furthermore, hedging CVA market risk is crucial to end up with a stable trading strategy. In the Black-Scholes setting this can be done using the underlying stock, whereas in the Merton jump-diffusion setting we need to add extra options to the hedge portfolio to properly hedge the jump risk. In addition to the simulation, we derive analytical results that explain our observations from the numerical experiments. Understanding the hedging of CVA helps to deal with xVAs in a practical setting.
△ Less
Submitted 14 September, 2020; v1 submitted 21 May, 2020;
originally announced May 2020.
-
On Calibration Neural Networks for extracting implied information from American options
Authors:
Shuaiqiang Liu,
Álvaro Leitao,
Anastasia Borovykh,
Cornelis W. Oosterlee
Abstract:
Extracting implied information, like volatility and/or dividend, from observed option prices is a challenging task when dealing with American options, because of the computational costs needed to solve the corresponding mathematical problem many thousands of times. We will employ a data-driven machine learning approach to estimate the Black-Scholes implied volatility and the dividend yield for Ame…
▽ More
Extracting implied information, like volatility and/or dividend, from observed option prices is a challenging task when dealing with American options, because of the computational costs needed to solve the corresponding mathematical problem many thousands of times. We will employ a data-driven machine learning approach to estimate the Black-Scholes implied volatility and the dividend yield for American options in a fast and robust way. To determine the implied volatility, the inverse function is approximated by an artificial neural network on the computational domain of interest, which decouples the offline (training) and online (prediction) phases and thus eliminates the need for an iterative process. For the implied dividend yield, we formulate the inverse problem as a calibration problem and determine simultaneously the implied volatility and dividend yield. For this, a generic and robust calibration framework, the Calibration Neural Network (CaNN), is introduced to estimate multiple parameters. It is shown that machine learning can be used as an efficient numerical technique to extract implied information from American options.
△ Less
Submitted 31 January, 2020;
originally announced January 2020.
-
Exploration of a Cosine Expansion Lattice Scheme
Authors:
Ki Wai Chau,
Cornelis W. Oosterlee
Abstract:
In this article, we combine a lattice sequence from Quasi-Monte Carlo rules with the philosophy of the Fourier-cosine method to design an approximation scheme for expectation computation. We study the error of this scheme and compare this scheme with our previous work on wavelets. Also, some numerical experiments are performed.
In this article, we combine a lattice sequence from Quasi-Monte Carlo rules with the philosophy of the Fourier-cosine method to design an approximation scheme for expectation computation. We study the error of this scheme and compare this scheme with our previous work on wavelets. Also, some numerical experiments are performed.
△ Less
Submitted 5 July, 2019;
originally announced July 2019.
-
Efficient Computation of Various Valuation Adjustments Under Local Lévy Models
Authors:
Anastasia Borovykh,
Andrea Pascucci,
Cornelis W. Oosterlee
Abstract:
Various valuation adjustments, or XVAs, can be written in terms of non-linear PIDEs equivalent to FBSDEs. In this paper we develop a Fourier-based method for solving FBSDEs in order to efficiently and accurately price Bermudan derivatives, including options and swaptions, with XVA under the flexible dynamics of a local Lévy model: this framework includes a local volatility function and a local jum…
▽ More
Various valuation adjustments, or XVAs, can be written in terms of non-linear PIDEs equivalent to FBSDEs. In this paper we develop a Fourier-based method for solving FBSDEs in order to efficiently and accurately price Bermudan derivatives, including options and swaptions, with XVA under the flexible dynamics of a local Lévy model: this framework includes a local volatility function and a local jump measure. Due to the unavailability of the characteristic function for such processes, we use an asymptotic approximation based on the adjoint formulation of the problem.
△ Less
Submitted 5 May, 2019;
originally announced May 2019.
-
A neural network-based framework for financial model calibration
Authors:
Shuaiqiang Liu,
Anastasia Borovykh,
Lech A. Grzelak,
Cornelis W. Oosterlee
Abstract:
A data-driven approach called CaNN (Calibration Neural Network) is proposed to calibrate financial asset price models using an Artificial Neural Network (ANN). Determining optimal values of the model parameters is formulated as training hidden neurons within a machine learning framework, based on available financial option prices. The framework consists of two parts: a forward pass in which we tra…
▽ More
A data-driven approach called CaNN (Calibration Neural Network) is proposed to calibrate financial asset price models using an Artificial Neural Network (ANN). Determining optimal values of the model parameters is formulated as training hidden neurons within a machine learning framework, based on available financial option prices. The framework consists of two parts: a forward pass in which we train the weights of the ANN off-line, valuing options under many different asset model parameter settings; and a backward pass, in which we evaluate the trained ANN-solver on-line, aiming to find the weights of the neurons in the input layer. The rapid on-line learning of implied volatility by ANNs, in combination with the use of an adapted parallel global optimization method, tackles the computation bottleneck and provides a fast and reliable technique for calibrating model parameters while avoiding, as much as possible, getting stuck in local minima. Numerical experiments confirm that this machine-learning framework can be employed to calibrate parameters of high-dimensional stochastic volatility models efficiently and accurately.
△ Less
Submitted 23 April, 2019;
originally announced April 2019.
-
A parametric acceleration of multilevel Monte Carlo convergence for nonlinear variably saturated flow
Authors:
Prashant Kumar,
Carmen Rodrigo,
Francisco J. Gaspar,
Cornelis W. Oosterlee
Abstract:
We present a multilevel Monte Carlo (MLMC) method for the uncertainty quantification of variably saturated porous media flow that are modeled using the Richards' equation. We propose a stochastic extension for the empirical models that are typically employed to close the Richards' equations. This is achieved by treating the soil parameters in these models as spatially correlated random fields with…
▽ More
We present a multilevel Monte Carlo (MLMC) method for the uncertainty quantification of variably saturated porous media flow that are modeled using the Richards' equation. We propose a stochastic extension for the empirical models that are typically employed to close the Richards' equations. This is achieved by treating the soil parameters in these models as spatially correlated random fields with appropriately defined marginal distributions. As some of these parameters can only take values in a specific range, non-Gaussian models are utilized. The randomness in these parameters may result in path-wise highly nonlinear systems, so that a robust solver with respect to the random input is required. For this purpose, a solution method based on a combination of the modified Picard iteration and a cell-centered multigrid method for heterogeneous diffusion coefficients is utilized. Moreover, we propose a non-standard MLMC estimator to solve the resulting high-dimensional stochastic Richards' equation. The improved efficiency of this multilevel estimator is achieved by parametric continuation that allows us to incorporate simpler nonlinear problems on coarser levels for variance reduction while the target strongly nonlinear problem is solved only on the finest level. Several numerical experiments are presented showing computational savings obtained by the new estimator compared to the original MC estimator.
△ Less
Submitted 19 March, 2019;
originally announced March 2019.
-
Generalisation in fully-connected neural networks for time series forecasting
Authors:
Anastasia Borovykh,
Cornelis W. Oosterlee,
Sander M. Bohte
Abstract:
In this paper we study the generalization capabilities of fully-connected neural networks trained in the context of time series forecasting. Time series do not satisfy the typical assumption in statistical learning theory of the data being i.i.d. samples from some data-generating distribution. We use the input and weight Hessians, that is the smoothness of the learned function with respect to the…
▽ More
In this paper we study the generalization capabilities of fully-connected neural networks trained in the context of time series forecasting. Time series do not satisfy the typical assumption in statistical learning theory of the data being i.i.d. samples from some data-generating distribution. We use the input and weight Hessians, that is the smoothness of the learned function with respect to the input and the width of the minimum in weight space, to quantify a network's ability to generalize to unseen data. While such generalization metrics have been studied extensively in the i.i.d. setting of for example image recognition, here we empirically validate their use in the task of time series forecasting. Furthermore we discuss how one can control the generalization capability of the network by means of the training process using the learning rate, batch size and the number of training iterations as controls. Using these hyperparameters one can efficiently control the complexity of the output function without imposing explicit constraints.
△ Less
Submitted 26 July, 2019; v1 submitted 14 February, 2019;
originally announced February 2019.
-
Pricing options and computing implied volatilities using neural networks
Authors:
Shuaiqiang Liu,
Cornelis W. Oosterlee,
Sander M. Bohte
Abstract:
This paper proposes a data-driven approach, by means of an Artificial Neural Network (ANN), to value financial options and to calculate implied volatilities with the aim of accelerating the corresponding numerical methods. With ANNs being universal function approximators, this method trains an optimized ANN on a data set generated by a sophisticated financial model, and runs the trained ANN as an…
▽ More
This paper proposes a data-driven approach, by means of an Artificial Neural Network (ANN), to value financial options and to calculate implied volatilities with the aim of accelerating the corresponding numerical methods. With ANNs being universal function approximators, this method trains an optimized ANN on a data set generated by a sophisticated financial model, and runs the trained ANN as an agent of the original solver in a fast and efficient way. We test this approach on three different types of solvers, including the analytic solution for the Black-Scholes equation, the COS method for the Heston stochastic volatility model and Brent's iterative root-finding method for the calculation of implied volatilities. The numerical results show that the ANN solver can reduce the computing time significantly.
△ Less
Submitted 23 April, 2019; v1 submitted 25 January, 2019;
originally announced January 2019.
-
On local Fourier analysis of multigrid methods for PDEs with jumping and random coefficients
Authors:
Prashant Kumar,
Carmen Rodrigo,
Francisco J. Gaspar,
Cornelis W. Oosterlee
Abstract:
In this paper, we propose a novel non-standard Local Fourier Analysis (LFA) variant for accurately predicting the multigrid convergence of problems with random and jumping coefficients. This LFA method is based on a specific basis of the Fourier space rather than the commonly used Fourier modes. To show the utility of this analysis, we consider, as an example, a simple cell-centered multigrid meth…
▽ More
In this paper, we propose a novel non-standard Local Fourier Analysis (LFA) variant for accurately predicting the multigrid convergence of problems with random and jumping coefficients. This LFA method is based on a specific basis of the Fourier space rather than the commonly used Fourier modes. To show the utility of this analysis, we consider, as an example, a simple cell-centered multigrid method for solving a steady-state single phase flow problem in a random porous medium. We successfully demonstrate the prediction capability of the proposed LFA using a number of challenging benchmark problems. The information provided by this analysis helps us to estimate a-priori the time needed for solving certain uncertainty quantification problems by means of a multigrid multilevel Monte Carlo method.
△ Less
Submitted 25 February, 2019; v1 submitted 14 March, 2018;
originally announced March 2018.
-
Stochastic grid bundling method for backward stochastic differential equations
Authors:
Ki Wai Chau,
Cornelis W. Oosterlee
Abstract:
In this work, we apply the Stochastic Grid Bundling Method (SGBM) to numerically solve backward stochastic differential equations (BSDEs). The SGBM algorithm is based on conditional expectations approximation by means of bundling of Monte Carlo sample paths and a local regress-later regression within each bundle. The basic algorithm for solving the backward stochastic differential equations will b…
▽ More
In this work, we apply the Stochastic Grid Bundling Method (SGBM) to numerically solve backward stochastic differential equations (BSDEs). The SGBM algorithm is based on conditional expectations approximation by means of bundling of Monte Carlo sample paths and a local regress-later regression within each bundle. The basic algorithm for solving the backward stochastic differential equations will be introduced and an upper error bound is established for the local regression. A full error analysis is also conducted for the explicit version of our algorithm and numerical experiments are performed to demonstrate various properties of our algorithm.
△ Less
Submitted 25 March, 2019; v1 submitted 16 January, 2018;
originally announced January 2018.
-
Conditional Time Series Forecasting with Convolutional Neural Networks
Authors:
Anastasia Borovykh,
Sander Bohte,
Cornelis W. Oosterlee
Abstract:
We present a method for conditional time series forecasting based on an adaptation of the recent deep convolutional WaveNet architecture. The proposed network contains stacks of dilated convolutions that allow it to access a broad range of history when forecasting, a ReLU activation function and conditioning is performed by applying multiple convolutional filters in parallel to separate time serie…
▽ More
We present a method for conditional time series forecasting based on an adaptation of the recent deep convolutional WaveNet architecture. The proposed network contains stacks of dilated convolutions that allow it to access a broad range of history when forecasting, a ReLU activation function and conditioning is performed by applying multiple convolutional filters in parallel to separate time series which allows for the fast processing of data and the exploitation of the correlation structure between the multivariate time series. We test and analyze the performance of the convolutional network both unconditionally as well as conditionally for financial time series forecasting using the S&P500, the volatility index, the CBOE interest rate and several exchange rates and extensively compare it to the performance of the well-known autoregressive model and a long-short term memory network. We show that a convolutional network is well-suited for regression-type problems and is able to effectively learn dependencies in and between the series without the need for long historical time series, is a time-efficient and easy to implement alternative to recurrent-type networks and tends to outperform linear and recurrent models.
△ Less
Submitted 17 September, 2018; v1 submitted 14 March, 2017;
originally announced March 2017.
-
On the wavelets-based SWIFT method for backward stochastic differential equations
Authors:
Ki Wai Chau,
Cornelis W. Oosterlee
Abstract:
We propose a numerical algorithm for backward stochastic differential equations based on time discretization and trigonometric wavelets. This method combines the effectiveness of Fourier-based methods and the simplicity of a wavelet-based formula, resulting in an algorithm that is both accurate and easy to implement. Furthermore, we mitigate the problem of errors near the computation boundaries by…
▽ More
We propose a numerical algorithm for backward stochastic differential equations based on time discretization and trigonometric wavelets. This method combines the effectiveness of Fourier-based methods and the simplicity of a wavelet-based formula, resulting in an algorithm that is both accurate and easy to implement. Furthermore, we mitigate the problem of errors near the computation boundaries by means of an antireflective boundary technique, giving an improved approximation. We test our algorithm with different numerical experiments.
△ Less
Submitted 9 November, 2016;
originally announced November 2016.
-
Pricing Bermudan options under local Lévy models with default
Authors:
Anastasia Borovykh,
Cornelis W. Oosterlee,
Andrea Pascucci
Abstract:
We consider a defaultable asset whose risk-neutral pricing dynamics are described by an exponential Lévy-type martingale. This class of models allows for a local volatility, local default intensity and a locally dependent Lévy measure. We present a pricing method for Bermudan options based on an analytical approximation of the characteristic function combined with the COS method. Due to a special…
▽ More
We consider a defaultable asset whose risk-neutral pricing dynamics are described by an exponential Lévy-type martingale. This class of models allows for a local volatility, local default intensity and a locally dependent Lévy measure. We present a pricing method for Bermudan options based on an analytical approximation of the characteristic function combined with the COS method. Due to a special form of the obtained characteristic function the price can be computed using a Fast Fourier Transform-based algorithm resulting in a fast and accurate calculation. The Greeks can be computed at almost no additional computational cost. Error bounds for the approximation of the characteristic function as well as for the total option price are given.
△ Less
Submitted 29 April, 2016;
originally announced April 2016.
-
Monte Carlo Calculation of Exposure Profiles and Greeks for Bermudan and Barrier Options under the Heston Hull-White Model
Authors:
Q. Feng,
C. W. Oosterlee
Abstract:
Valuation of Credit Valuation Adjustment (CVA) has become an important field as its calculation is required in Basel III, issued in 2010, in the wake of the credit crisis. Exposure, which is defined as the potential future loss of a default event without any recovery, is one of the key elementsfor pricing CVA. This paper provides a backward dynamics framework for assessing exposure profiles of Eur…
▽ More
Valuation of Credit Valuation Adjustment (CVA) has become an important field as its calculation is required in Basel III, issued in 2010, in the wake of the credit crisis. Exposure, which is defined as the potential future loss of a default event without any recovery, is one of the key elementsfor pricing CVA. This paper provides a backward dynamics framework for assessing exposure profiles of European, Bermudan and barrier options under the Heston and Heston Hull-White asset dynamics. We discuss the potential of an efficient and adaptive Monte Carlo approach, the Stochastic Grid Bundling Method}(SGBM), which employs the techniques of simulation, regression and bundling. Greeks of the exposure profiles can be calculated in the same backward iteration with little extra effort. Assuming independence between default event and exposure profiles, we give examples of calculating exposure, CVA and Greeks for Bermudan and barrier options.
△ Less
Submitted 11 December, 2014;
originally announced December 2014.