-
Generating drawdown-realistic financial price paths using path signatures
Authors:
Emiel Lemahieu,
Kris Boudt,
Maarten Wyns
Abstract:
A novel generative machine learning approach for the simulation of sequences of financial price data with drawdowns quantifiably close to empirical data is introduced. Applications such as pricing drawdown insurance options or developing portfolio drawdown control strategies call for a host of drawdown-realistic paths. Historical scenarios may be insufficient to effectively train and backtest the…
▽ More
A novel generative machine learning approach for the simulation of sequences of financial price data with drawdowns quantifiably close to empirical data is introduced. Applications such as pricing drawdown insurance options or developing portfolio drawdown control strategies call for a host of drawdown-realistic paths. Historical scenarios may be insufficient to effectively train and backtest the strategy, while standard parametric Monte Carlo does not adequately preserve drawdowns. We advocate a non-parametric Monte Carlo approach combining a variational autoencoder generative model with a drawdown reconstruction loss function. To overcome issues of numerical complexity and non-differentiability, we approximate drawdown as a linear function of the moments of the path, known in the literature as path signatures. We prove the required regularity of drawdown function and consistency of the approximation. Furthermore, we obtain close numerical approximations using linear regression for fractional Brownian and empirical data. We argue that linear combinations of the moments of a path yield a mathematically non-trivial smoothing of the drawdown function, which gives one leeway to simulate drawdown-realistic price paths by including drawdown evaluation metrics in the learning objective. We conclude with numerical experiments on mixed equity, bond, real estate and commodity portfolios and obtain a host of drawdown-realistic paths.
△ Less
Submitted 8 September, 2023;
originally announced September 2023.
-
The R package sentometrics to compute, aggregate and predict with textual sentiment
Authors:
David Ardia,
Keven Bluteau,
Samuel Borms,
Kris Boudt
Abstract:
We provide a hands-on introduction to optimized textual sentiment indexation using the R package sentometrics. Textual sentiment analysis is increasingly used to unlock the potential information value of textual data. The sentometrics package implements an intuitive framework to efficiently compute sentiment scores of numerous texts, to aggregate the scores into multiple time series, and to use th…
▽ More
We provide a hands-on introduction to optimized textual sentiment indexation using the R package sentometrics. Textual sentiment analysis is increasingly used to unlock the potential information value of textual data. The sentometrics package implements an intuitive framework to efficiently compute sentiment scores of numerous texts, to aggregate the scores into multiple time series, and to use these time series to predict other variables. The workflow of the package is illustrated with a built-in corpus of news articles from two major U.S. journals to forecast the CBOE Volatility Index.
△ Less
Submitted 20 October, 2021;
originally announced October 2021.
-
The Minimum Regularized Covariance Determinant estimator
Authors:
Kris Boudt,
Peter J. Rousseeuw,
Steven Vanduffel,
Tim Verdonck
Abstract:
The Minimum Covariance Determinant (MCD) approach robustly estimates the location and scatter matrix using the subset of given size with lowest sample covariance determinant. Its main drawback is that it cannot be applied when the dimension exceeds the subset size. We propose the Minimum Regularized Covariance Determinant (MRCD) approach, which differs from the MCD in that the scatter matrix is a…
▽ More
The Minimum Covariance Determinant (MCD) approach robustly estimates the location and scatter matrix using the subset of given size with lowest sample covariance determinant. Its main drawback is that it cannot be applied when the dimension exceeds the subset size. We propose the Minimum Regularized Covariance Determinant (MRCD) approach, which differs from the MCD in that the scatter matrix is a convex combination of a target matrix and the sample covariance matrix of the subset. A data-driven procedure sets the weight of the target matrix, so that the regularization is only used when needed. The MRCD estimator is defined in any dimension, is well-conditioned by construction and preserves the good robustness properties of the MCD. We prove that so-called concentration steps can be performed to reduce the MRCD objective function, and we exploit this fact to construct a fast algorithm. We verify the accuracy and robustness of the MRCD estimator in a simulation study and illustrate its practical use for outlier detection and regression analysis on real-life high-dimensional data sets in chemistry and criminology.
△ Less
Submitted 1 December, 2018; v1 submitted 24 January, 2017;
originally announced January 2017.
-
Value-at-Risk Prediction in R with the GAS Package
Authors:
David Ardia,
Kris Boudt,
Leopoldo Catania
Abstract:
GAS models have been recently proposed in time-series econometrics as valuable tools for signal extraction and prediction. This paper details how financial risk managers can use GAS models for Value-at-Risk (VaR) prediction using the novel GAS package for R. Details and code snippets for prediction, comparison and backtesting with GAS models are presented. An empirical application considering Dow…
▽ More
GAS models have been recently proposed in time-series econometrics as valuable tools for signal extraction and prediction. This paper details how financial risk managers can use GAS models for Value-at-Risk (VaR) prediction using the novel GAS package for R. Details and code snippets for prediction, comparison and backtesting with GAS models are presented. An empirical application considering Dow Jones Index constituents investigates the VaR forecasting performance of GAS models.
△ Less
Submitted 18 November, 2016;
originally announced November 2016.
-
Generalized Autoregressive Score Models in R: The GAS Package
Authors:
David Ardia,
Kris Boudt,
Leopoldo Catania
Abstract:
This paper presents the R package GAS for the analysis of time series under the Generalized Autoregressive Score (GAS) framework of Creal et al. (2013) and Harvey (2013). The distinctive feature of the GAS approach is the use of the score function as the driver of time-variation in the parameters of nonlinear models. The GAS package provides functions to simulate univariate and multivariate GAS pr…
▽ More
This paper presents the R package GAS for the analysis of time series under the Generalized Autoregressive Score (GAS) framework of Creal et al. (2013) and Harvey (2013). The distinctive feature of the GAS approach is the use of the score function as the driver of time-variation in the parameters of nonlinear models. The GAS package provides functions to simulate univariate and multivariate GAS processes, estimate the GAS parameters and to make time series forecasts. We illustrate the use of the GAS package with a detailed case study on estimating the time-varying conditional densities of a set of financial assets.
△ Less
Submitted 8 September, 2016;
originally announced September 2016.