-
Analyzing Pension Fund Mortality with Gaussian Processes in a Sub Population Framework
Authors:
Eduardo F. L. de Melo,
Michael Ludkovski,
Rodrigo S. Targino
Abstract:
Pension fund populations often have mortality experiences that are substantially different from the national benchmark. In a motivating case study of Brazilian corporate pension funds, pensioners are observed to have mortality that is 40-55% below the national average, due to the underlying socioeconomic disparities. Direct analysis of a pension fund population is challenging due to very sparse da…
▽ More
Pension fund populations often have mortality experiences that are substantially different from the national benchmark. In a motivating case study of Brazilian corporate pension funds, pensioners are observed to have mortality that is 40-55% below the national average, due to the underlying socioeconomic disparities. Direct analysis of a pension fund population is challenging due to very sparse data, with age-specific annual death counts often in low single digits. We design and study a collection of stochastic sub-population frameworks that coherently capture and project pensioner mortality rates via deflator factors relative to a reference population. Superseding parametric approaches, we propose Gaussian process (GP) based models that flexibly estimate Age- and/or Year-specific deflators. We demonstrate that the GP models achieve better goodness of fit and uncertainty quantification. Our models are illustrated on two Brazilian pension funds in the context of exogenous national and insurance industry mortality tables. The GP models are implemented in R Stan using a fully Bayesian approach and take into account over-dispersion relative to the Poisson likelihood.
△ Less
Submitted 4 June, 2025;
originally announced June 2025.
-
Intraday Battery Dispatch for Hybrid Renewable Energy Assets
Authors:
Thiha Aung,
Mike Ludkovski
Abstract:
We develop a mathematical model for intraday dispatch of co-located wind-battery energy assets. Focusing on the primary objective of firming grid-side actual production vis-a-vis the preset day-ahead hourly generation targets, we conduct a comprehensive study of the resulting stochastic control problem across different firming formulations and wind generation dynamics. Among others, we provide a c…
▽ More
We develop a mathematical model for intraday dispatch of co-located wind-battery energy assets. Focusing on the primary objective of firming grid-side actual production vis-a-vis the preset day-ahead hourly generation targets, we conduct a comprehensive study of the resulting stochastic control problem across different firming formulations and wind generation dynamics. Among others, we provide a closed-form solution in the special case of a quadratic objective and linear dynamics, as well as design a novel adaptation of a Gaussian Process-based Regression Monte Carlo algorithm for our setting. Extensions studied include an asymmetric loss function for peak shaving, capturing the cost of battery cycling, and the role of battery duration. In the applied portion of our work, we calibrate our model to a collection of 140+ wind-battery assets in Texas, benchmarking the economic benefits of firming based on outputs of a realistic unit commitment and economic dispatch solver.
△ Less
Submitted 15 March, 2025;
originally announced March 2025.
-
Selecting Critical Scenarios of DER Adoption in Distribution Grids Using Bayesian Optimization
Authors:
Olivier Mulkin,
Miguel Heleno,
Mike Ludkovski
Abstract:
We develop a new methodology to select scenarios of DER adoption most critical for distribution grids. Anticipating risks of future voltage and line flow violations due to additional PV adopters is central for utility investment planning but continues to rely on deterministic or ad hoc scenario selection. We propose a highly efficient search framework based on multi-objective Bayesian Optimization…
▽ More
We develop a new methodology to select scenarios of DER adoption most critical for distribution grids. Anticipating risks of future voltage and line flow violations due to additional PV adopters is central for utility investment planning but continues to rely on deterministic or ad hoc scenario selection. We propose a highly efficient search framework based on multi-objective Bayesian Optimization. We treat underlying grid stress metrics as computationally expensive black-box functions, approximated via Gaussian Process surrogates and design an acquisition function based on probability of scenarios being Pareto-critical across a collection of line- and bus-based violation objectives. Our approach provides a statistical guarantee and offers an order of magnitude speed-up relative to a conservative exhaustive search. Case studies on realistic feeders with 200-400 buses demonstrate the effectiveness and accuracy of our approach.
△ Less
Submitted 23 January, 2025;
originally announced January 2025.
-
A groundwater market model
Authors:
Igor Cialenco,
Michael Ludkovski
Abstract:
We introduce the problem of groundwater trading, capturing the emergent groundwater market setups among stakeholders in a given groundwater basin. The agents optimize their production, taking into account their available water rights, the requisite water consumption, and the opportunity to trade water among themselves. We study the resulting Nash equilibrium, providing a full characterization of t…
▽ More
We introduce the problem of groundwater trading, capturing the emergent groundwater market setups among stakeholders in a given groundwater basin. The agents optimize their production, taking into account their available water rights, the requisite water consumption, and the opportunity to trade water among themselves. We study the resulting Nash equilibrium, providing a full characterization of the 1-period setting and initial results about the features of the multi-period game driven by the ability of agents to bank their water rights in order to smooth out the intertemporal shocks.
△ Less
Submitted 23 January, 2025;
originally announced January 2025.
-
Probabilistic Spatiotemporal Modeling of Day-Ahead Wind Power Generation with Input-Warped Gaussian Processes
Authors:
Qiqi Li,
Mike Ludkovski
Abstract:
We design a Gaussian Process (GP) spatiotemporal model to capture features of day-ahead wind power forecasts. We work with hourly-scale day-ahead forecasts across hundreds of wind farm locations, with the main aim of constructing a fully probabilistic joint model across space and hours of the day. To this end, we design a separable space-time kernel, implementing both temporal and spatial input wa…
▽ More
We design a Gaussian Process (GP) spatiotemporal model to capture features of day-ahead wind power forecasts. We work with hourly-scale day-ahead forecasts across hundreds of wind farm locations, with the main aim of constructing a fully probabilistic joint model across space and hours of the day. To this end, we design a separable space-time kernel, implementing both temporal and spatial input warping to capture the non-stationarity in the covariance of wind power. We conduct synthetic experiments to validate our choice of the spatial kernel and to demonstrate the effectiveness of warping in addressing nonstationarity. The second half of the paper is devoted to a detailed case study using a realistic, fully calibrated dataset representing wind farms in the ERCOT region of Texas.
△ Less
Submitted 10 September, 2024;
originally announced September 2024.
-
Least-Cost Structuring of 24/7 Carbon-Free Electricity Procurements
Authors:
Mike Ludkovski,
Saad Mouti,
Glen Swindle
Abstract:
We consider the construction of renewable portfolios targeting specified carbon-free (CFE) hourly performance scores. We work in a probabilistic framework that uses a collection of simulation scenarios and imposes probability constraints on achieving the desired CFE score. In our approach there is a fixed set of available CFE generators and a given load customer who seeks to minimize annual procur…
▽ More
We consider the construction of renewable portfolios targeting specified carbon-free (CFE) hourly performance scores. We work in a probabilistic framework that uses a collection of simulation scenarios and imposes probability constraints on achieving the desired CFE score. In our approach there is a fixed set of available CFE generators and a given load customer who seeks to minimize annual procurement costs. We illustrate results using a realistic dataset of jointly calibrated solar and wind assets, and compare different approaches to handling multiple loads.
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
Analyzing State-Level Longevity Trends with the U.S. Mortality Database
Authors:
Mike Ludkovski,
Doris Padilla
Abstract:
We investigate state-level age-specific mortality trends based on the United States Mortality Database (USMDB) published by the Human Mortality Database. In tandem with looking at the longevity experience across the 51 states, we also consider a collection of socio-demographic, economic and educational covariates that correlate with mortality trends. To obtain smoothed mortality surfaces for each…
▽ More
We investigate state-level age-specific mortality trends based on the United States Mortality Database (USMDB) published by the Human Mortality Database. In tandem with looking at the longevity experience across the 51 states, we also consider a collection of socio-demographic, economic and educational covariates that correlate with mortality trends. To obtain smoothed mortality surfaces for each state, we implement the machine learning framework of Multi-Output Gaussian Process regression (Huynh & Ludkovski 2021) on targeted groupings of 3-6 states. Our detailed exploratory analysis shows that the mortality experience is highly inhomogeneous across states in terms of respective Age structures. We moreover document multiple divergent trends between best and worst states, between Females and Males, and between younger and older Ages. The comparisons across the 50+ fitted models offer opportunities for rich insights about drivers of mortality in the U.S. and are visualized through numerous figures and an online interactive dashboard.
△ Less
Submitted 3 January, 2024; v1 submitted 3 December, 2023;
originally announced December 2023.
-
Extreme Scenario Selection in Day-Ahead Power Grid Operational Planning
Authors:
Guillermo Terrén-Serrano,
Michael Ludkovski
Abstract:
We propose and analyze the application of statistical functional depth metrics for the selection of extreme scenarios in day-ahead grid planning. Our primary motivation is screening of probabilistic scenarios for realized load and renewable generation, in order to identify scenarios most relevant for operational risk mitigation. To handle the high-dimensionality of the scenarios across asset class…
▽ More
We propose and analyze the application of statistical functional depth metrics for the selection of extreme scenarios in day-ahead grid planning. Our primary motivation is screening of probabilistic scenarios for realized load and renewable generation, in order to identify scenarios most relevant for operational risk mitigation. To handle the high-dimensionality of the scenarios across asset classes and intra-day periods, we employ functional measures of depth to sub-select outlying scenarios that are most likely to be the riskiest for the grid operation. We investigate a range of functional depth measures, as well as a range of operational risks, including load shedding, operational costs, reserves shortfall and variable renewable energy curtailment. The effectiveness of the proposed screening approach is demonstrated through a case study on the realistic Texas-7k grid.
△ Less
Submitted 20 September, 2023;
originally announced September 2023.
-
Expressive Mortality Models through Gaussian Process Kernels
Authors:
Mike Ludkovski,
Jimmy Risk
Abstract:
We develop a flexible Gaussian Process (GP) framework for learning the covariance structure of Age- and Year-specific mortality surfaces. Utilizing the additive and multiplicative structure of GP kernels, we design a genetic programming algorithm to search for the most expressive kernel for a given population. Our compositional search builds off the Age-Period-Cohort (APC) paradigm to construct a…
▽ More
We develop a flexible Gaussian Process (GP) framework for learning the covariance structure of Age- and Year-specific mortality surfaces. Utilizing the additive and multiplicative structure of GP kernels, we design a genetic programming algorithm to search for the most expressive kernel for a given population. Our compositional search builds off the Age-Period-Cohort (APC) paradigm to construct a covariance prior best matching the spatio-temporal dynamics of a mortality dataset. We apply the resulting genetic algorithm (GA) on synthetic case studies to validate the ability of the GA to recover APC structure, and on real-life national-level datasets from the Human Mortality Database. Our machine-learning based analysis provides novel insight into the presence/absence of Cohort effects in different populations, and into the relative smoothness of mortality surfaces along the Age and Year dimensions. Our modelling work is done with the PyTorch libraries in Python and provides an in-depth investigation of employing GA to aid in compositional kernel search for GP surrogates.
△ Less
Submitted 2 May, 2023;
originally announced May 2023.
-
Large Scale Probabilistic Simulation of Renewables Production
Authors:
Mike Ludkovski,
Glen Swindle,
Eric Grannan
Abstract:
We develop a probabilistic framework for joint simulation of short-term electricity generation from renewable assets. In this paper we describe a method for producing hourly day-ahead scenarios of generated power at grid-scale across hundreds of assets. These scenarios are conditional on specified forecasts and yield a full uncertainty quantification both at the marginal asset-level and across ass…
▽ More
We develop a probabilistic framework for joint simulation of short-term electricity generation from renewable assets. In this paper we describe a method for producing hourly day-ahead scenarios of generated power at grid-scale across hundreds of assets. These scenarios are conditional on specified forecasts and yield a full uncertainty quantification both at the marginal asset-level and across asset collections. Our simulation pipeline first applies asset calibration to normalize hourly, daily and seasonal generation profiles, and to Gaussianize the forecast--actuals distribution. We then develop a novel clustering approach to stably estimate the covariance matrix across assets; clustering is done hierarchically to achieve scalability. An extended case study using an ERCOT-like system with nearly 500 solar and wind farms is used for illustration.
△ Less
Submitted 10 May, 2022;
originally announced May 2022.
-
On Parametric Optimal Execution and Machine Learning Surrogates
Authors:
Tao Chen,
Mike Ludkovski,
Moritz Voß
Abstract:
We investigate optimal order execution problems in discrete time with instantaneous price impact and stochastic resilience. First, in the setting of linear transient price impact we derive a closed-form recursion for the optimal strategy, extending the deterministic results from Obizhaeva and Wang (J Financial Markets, 2013). Second, we develop a numerical algorithm based on dynamic programming an…
▽ More
We investigate optimal order execution problems in discrete time with instantaneous price impact and stochastic resilience. First, in the setting of linear transient price impact we derive a closed-form recursion for the optimal strategy, extending the deterministic results from Obizhaeva and Wang (J Financial Markets, 2013). Second, we develop a numerical algorithm based on dynamic programming and deep learning for the case of nonlinear transient price impact as proposed by Bouchaud et al. (Quant. Finance, 2004). Specifically, we utilize an actor-critic framework that constructs two neural-network (NN) surrogates for the value function and the feedback control. The flexible scalability of NN functional approximators enables parametric learning, i.e., incorporating several model or market parameters as part of the input space. Precise calibration of price impact, resilience, etc., is known to be extremely challenging and hence it is critical to understand sensitivity of the execution policy to these parameters. Our NN learner organically scales across multiple input dimensions and is shown to accurately approximate optimal strategies across a wide range of parameter configurations. We provide a fully reproducible Jupyter Notebook with our NN implementation, which is of independent pedagogical interest, demonstrating the ease of use of NN surrogates in (parametric) stochastic control problems.
△ Less
Submitted 29 October, 2023; v1 submitted 18 April, 2022;
originally announced April 2022.
-
Regression Monte Carlo for Impulse Control
Authors:
Mike Ludkovski
Abstract:
I develop a numerical algorithm for stochastic impulse control in the spirit of Regression Monte Carlo for optimal stopping. The approach consists in generating statistical surrogates (aka functional approximators) for the continuation function. The surrogates are recursively trained by empirical regression over simulated state trajectories. In parallel, the same surrogates are used to learn the i…
▽ More
I develop a numerical algorithm for stochastic impulse control in the spirit of Regression Monte Carlo for optimal stopping. The approach consists in generating statistical surrogates (aka functional approximators) for the continuation function. The surrogates are recursively trained by empirical regression over simulated state trajectories. In parallel, the same surrogates are used to learn the intervention function characterizing the optimal impulse amounts. I discuss appropriate surrogate types for this task, as well as the choice of training sets. Case studies from forest rotation and irreversible investment illustrate the numerical scheme and highlight its flexibility and extensibility. Implementation in \texttt{R} is provided as a publicly available package posted on GitHub.
△ Less
Submitted 12 March, 2022;
originally announced March 2022.
-
Joint Models for Cause-of-Death Mortality in Multiple Populations
Authors:
Nhan Huynh,
Mike Ludkovski
Abstract:
We investigate jointly modeling Age-specific rates of various causes of death in a multinational setting. We apply Multi-Output Gaussian Processes (MOGP), a spatial machine learning method, to smooth and extrapolate multiple cause-of-death mortality rates across several countries and both genders. To maintain flexibility and scalability, we investigate MOGPs with Kronecker-structured kernels and l…
▽ More
We investigate jointly modeling Age-specific rates of various causes of death in a multinational setting. We apply Multi-Output Gaussian Processes (MOGP), a spatial machine learning method, to smooth and extrapolate multiple cause-of-death mortality rates across several countries and both genders. To maintain flexibility and scalability, we investigate MOGPs with Kronecker-structured kernels and latent factors. In particular, we develop a custom multi-level MOGP that leverages the gridded structure of mortality tables to efficiently capture heterogeneity and dependence across different factor inputs. Results are illustrated with datasets from the Human Cause-of-Death Database (HCD). We discuss a case study involving cancer variations in three European nations, and a US-based study that considers eight top-level causes and includes comparison to all-cause analysis. Our models provide insights into the commonality of cause-specific mortality trends and demonstrate the opportunities for respective data fusion.
△ Less
Submitted 12 November, 2021;
originally announced November 2021.
-
Large-scale local surrogate modeling of stochastic simulation experiments
Authors:
D Austin Cole,
Robert B Gramacy,
Mike Ludkovski
Abstract:
Gaussian process (GP) surrogate modeling for large computer experiments is limited by cubic runtimes, especially with data from stochastic simulations with input-dependent noise. A popular workaround to reduce computational complexity involves local approximation (e.g., LAGP). However, LAGP has only been vetted in deterministic settings. A recent variation utilizing inducing points (LIGP) for addi…
▽ More
Gaussian process (GP) surrogate modeling for large computer experiments is limited by cubic runtimes, especially with data from stochastic simulations with input-dependent noise. A popular workaround to reduce computational complexity involves local approximation (e.g., LAGP). However, LAGP has only been vetted in deterministic settings. A recent variation utilizing inducing points (LIGP) for additional sparsity improves upon LAGP on the speed-vs-accuracy frontier. The authors show that another benefit of LIGP over LAGP is that (local) nugget estimation for stochastic responses is more natural, especially when designs contain substantial replication as is common when attempting to separate signal from noise. Woodbury identities, extended in LIGP from inducing points to replicates, afford efficient computation in terms of unique design locations only. This increases the amount of local data (i.e., the neighborhood size) that may be incorporated without additional flops, thereby enhancing statistical efficiency. Performance of the authors' LIGP upgrades is illustrated on benchmark data and real-world stochastic simulation experiments, including an options pricing control framework. Results indicatethat LIGP provides more accurate prediction and uncertainty quantification for varying data dimension and replication strategies versus modern alternatives.
△ Less
Submitted 30 May, 2022; v1 submitted 11 September, 2021;
originally announced September 2021.
-
mlOSP: Towards a Unified Implementation of Regression Monte Carlo Algorithms
Authors:
Mike Ludkovski
Abstract:
We introduce mlOSP, a computational template for Machine Learning for Optimal Stopping Problems. The template is implemented in the R statistical environment and publicly available via a GitHub repository. mlOSP presents a unified numerical implementation of Regression Monte Carlo (RMC) approaches to optimal stopping, providing a state-of-the-art, open-source, reproducible and transparent platform…
▽ More
We introduce mlOSP, a computational template for Machine Learning for Optimal Stopping Problems. The template is implemented in the R statistical environment and publicly available via a GitHub repository. mlOSP presents a unified numerical implementation of Regression Monte Carlo (RMC) approaches to optimal stopping, providing a state-of-the-art, open-source, reproducible and transparent platform. Highlighting its modular nature, we present multiple novel variants of RMC algorithms, especially in terms of constructing simulation designs for training the regressors, as well as in terms of machine learning regression modules. Furthermore, mlOSP nests most of the existing RMC schemes, allowing for a consistent and verifiable benchmarking of extant algorithms. The article contains extensive R code snippets and figures, and serves as a vignette to the underlying software package.
△ Less
Submitted 2 October, 2022; v1 submitted 1 December, 2020;
originally announced December 2020.
-
KrigHedge: Gaussian Process Surrogates for Delta Hedging
Authors:
Mike Ludkovski,
Yuri Saporito
Abstract:
We investigate a machine learning approach to option Greeks approximation based on Gaussian process (GP) surrogates. The method takes in noisily observed option prices, fits a nonparametric input-output map and then analytically differentiates the latter to obtain the various price sensitivities. Our motivation is to compute Greeks in cases where direct computation is expensive, such as in local v…
▽ More
We investigate a machine learning approach to option Greeks approximation based on Gaussian process (GP) surrogates. The method takes in noisily observed option prices, fits a nonparametric input-output map and then analytically differentiates the latter to obtain the various price sensitivities. Our motivation is to compute Greeks in cases where direct computation is expensive, such as in local volatility models, or can only ever be done approximately. We provide a detailed analysis of numerous aspects of GP surrogates, including choice of kernel family, simulation design, choice of trend function and impact of noise.
We further discuss the application to Delta hedging, including a new Lemma that relates quality of the Delta approximation to discrete-time hedging loss. Results are illustrated with two extensive case studies that consider estimation of Delta, Theta and Gamma and benchmark approximation quality and uncertainty quantification using a variety of statistical metrics. Among our key take-aways are the recommendation to use Matern kernels, the benefit of including virtual training points to capture boundary conditions, and the significant loss of fidelity when training on stock-path-based datasets.
△ Less
Submitted 14 January, 2022; v1 submitted 16 October, 2020;
originally announced October 2020.
-
An Impulse-Regime Switching Game Model of Vertical Competition
Authors:
René Aïd,
Luciano Campi,
Liangchen Li,
Mike Ludkovski
Abstract:
We study a new kind of non-zero-sum stochastic differential game with mixed impulse/switching controls, motivated by strategic competition in commodity markets. A representative upstream firm produces a commodity that is used by a representative downstream firm to produce a final consumption good. Both firms can influence the price of the commodity. By shutting down or increasing generation capaci…
▽ More
We study a new kind of non-zero-sum stochastic differential game with mixed impulse/switching controls, motivated by strategic competition in commodity markets. A representative upstream firm produces a commodity that is used by a representative downstream firm to produce a final consumption good. Both firms can influence the price of the commodity. By shutting down or increasing generation capacities, the upstream firm influences the price with impulses. By switching (or not) to a substitute, the downstream firm influences the drift of the commodity price process. We study the resulting impulse--regime switching game between the two firms, focusing on explicit threshold-type equilibria. Remarkably, this class of games naturally gives rise to multiple Nash equilibria, which we obtain via a verification based approach. We exhibit three types of equilibria depending on the ultimate number of switches by the downstream firm (zero, one or an infinite number of switches). We illustrate the diversification effect provided by vertical integration in the specific case of the crude oil market. Our analysis shows that the diversification gains strongly depend on the pass-through from the crude price to the gasoline price.
△ Less
Submitted 8 June, 2020;
originally announced June 2020.
-
Adaptive Batching for Gaussian Process Surrogates with Application in Noisy Level Set Estimation
Authors:
Xiong Lyu,
Mike Ludkovski
Abstract:
We develop adaptive replicated designs for Gaussian process metamodels of stochastic experiments. Adaptive batching is a natural extension of sequential design heuristics with the benefit of replication growing as response features are learned, inputs concentrate, and the metamodeling overhead rises. Motivated by the problem of learning the level set of the mean simulator response we develop four…
▽ More
We develop adaptive replicated designs for Gaussian process metamodels of stochastic experiments. Adaptive batching is a natural extension of sequential design heuristics with the benefit of replication growing as response features are learned, inputs concentrate, and the metamodeling overhead rises. Motivated by the problem of learning the level set of the mean simulator response we develop four novel schemes: Multi-Level Batching (MLB), Ratchet Batching (RB), Adaptive Batched Stepwise Uncertainty Reduction (ABSUR), Adaptive Design with Stepwise Allocation (ADSA) and Deterministic Design with Stepwise Allocation (DDSA). Our algorithms simultaneously (MLB, RB and ABSUR) or sequentially (ADSA and DDSA) determine the sequential design inputs and the respective number of replicates. Illustrations using synthetic examples and an application in quantitative finance (Bermudan option pricing via Regression Monte Carlo) show that adaptive batching brings significant computational speed-ups with minimal loss of modeling fidelity.
△ Less
Submitted 13 July, 2021; v1 submitted 19 March, 2020;
originally announced March 2020.
-
Multi-Output Gaussian Processes for Multi-Population Longevity Modeling
Authors:
Nhan Huynh,
Mike Ludkovski
Abstract:
We investigate joint modeling of longevity trends using the spatial statistical framework of Gaussian Process regression. Our analysis is motivated by the Human Mortality Database (HMD) that provides unified raw mortality tables for nearly 40 countries. Yet few stochastic models exist for handling more than two populations at a time. To bridge this gap, we leverage a spatial covariance framework f…
▽ More
We investigate joint modeling of longevity trends using the spatial statistical framework of Gaussian Process regression. Our analysis is motivated by the Human Mortality Database (HMD) that provides unified raw mortality tables for nearly 40 countries. Yet few stochastic models exist for handling more than two populations at a time. To bridge this gap, we leverage a spatial covariance framework from machine learning that treats populations as distinct levels of a factor covariate, explicitly capturing the cross-population dependence. The proposed multi-output Gaussian Process models straightforwardly scale up to a dozen populations and moreover intrinsically generate coherent joint longevity scenarios. In our numerous case studies we investigate predictive gains from aggregating mortality experience across nations and genders, including by borrowing the most recently available "foreign" data. We show that in our approach, information fusion leads to more precise (and statistically more credible) forecasts. We implement our models in \texttt{R}, as well as a Bayesian version in \texttt{Stan} that provides further uncertainty quantification regarding the estimated mortality covariance structure. All examples utilize public HMD datasets.
△ Less
Submitted 5 March, 2020;
originally announced March 2020.
-
A Machine Learning Approach to Adaptive Robust Utility Maximization and Hedging
Authors:
Tao Chen,
Michael Ludkovski
Abstract:
We investigate the adaptive robust control framework for portfolio optimization and loss-based hedging under drift and volatility uncertainty. Adaptive robust problems offer many advantages but require handling a double optimization problem (infimum over market measures, supremum over the control) at each instance. Moreover, the underlying Bellman equations are intrinsically multi-dimensional. We…
▽ More
We investigate the adaptive robust control framework for portfolio optimization and loss-based hedging under drift and volatility uncertainty. Adaptive robust problems offer many advantages but require handling a double optimization problem (infimum over market measures, supremum over the control) at each instance. Moreover, the underlying Bellman equations are intrinsically multi-dimensional. We propose a novel machine learning approach that solves for the local saddle-point at a chosen set of inputs and then uses a nonparametric (Gaussian process) regression to obtain a functional representation of the value function. Our algorithm resembles control randomization and regression Monte Carlo techniques but also brings multiple innovations, including adaptive experimental design, separate surrogates for optimal control and the local worst-case measure, and computational speed-ups for the sup-inf optimization. Thanks to the new scheme we are able to consider settings that have been previously computationally intractable and provide several new financial insights about learning and optimal trading under unknown market parameters. In particular, we demonstrate the financial advantages of adaptive robust framework compared to adaptive and static robust alternatives.
△ Less
Submitted 4 May, 2020; v1 submitted 30 November, 2019;
originally announced December 2019.
-
Statistical Learning for Probability-Constrained Stochastic Optimal Control
Authors:
Alessandro Balata,
Michael Ludkovski,
Aditya Maheshwari,
Jan Palczewski
Abstract:
We investigate Monte Carlo based algorithms for solving stochastic control problems with probabilistic constraints. Our motivation comes from microgrid management, where the controller tries to optimally dispatch a diesel generator while maintaining low probability of blackouts. The key question we investigate are empirical simulation procedures for learning the admissible control set that is spec…
▽ More
We investigate Monte Carlo based algorithms for solving stochastic control problems with probabilistic constraints. Our motivation comes from microgrid management, where the controller tries to optimally dispatch a diesel generator while maintaining low probability of blackouts. The key question we investigate are empirical simulation procedures for learning the admissible control set that is specified implicitly through a probability constraint on the system state. We propose a variety of relevant statistical tools including logistic regression, Gaussian process regression, quantile regression and support vector machines, which we then incorporate into an overall Regression Monte Carlo (RMC) framework for approximate dynamic programming. Our results indicate that using logistic or Gaussian process regression to estimate the admissibility probability outperforms the other options. Our algorithms offer an efficient and reliable extension of RMC to probability-constrained control. We illustrate our findings with two case studies for the microgrid problem.
△ Less
Submitted 23 August, 2020; v1 submitted 30 April, 2019;
originally announced May 2019.
-
Dynamic Contagion in a Banking System with Births and Defaults
Authors:
Tomoyuki Ichiba,
Michael Ludkovski,
Andrey Sarantsev
Abstract:
We consider a dynamic model of interconnected banks. New banks can emerge, and existing banks can default, creating a birth-and-death setup. Microscopically, banks evolve as independent geometric Brownian motions. Systemic effects are captured through default contagion: as one bank defaults, reserves of other banks are reduced by a random proportion. After examining the long-term stability of this…
▽ More
We consider a dynamic model of interconnected banks. New banks can emerge, and existing banks can default, creating a birth-and-death setup. Microscopically, banks evolve as independent geometric Brownian motions. Systemic effects are captured through default contagion: as one bank defaults, reserves of other banks are reduced by a random proportion. After examining the long-term stability of this system, we investigate mean-field limits as the number of banks tends to infinity. Our main results concern the measure-valued scaling limit which is governed by a McKean-Vlasov jump-diffusion. The default impact creates a mean-field drift, while the births and defaults introduce jump terms tied to the current distribution of the process. Individual dynamics in the limit is described by the propagation of chaos phenomenon. In certain cases, we explicitly characterize the limiting average reserves.
△ Less
Submitted 27 May, 2019; v1 submitted 25 July, 2018;
originally announced July 2018.
-
Evaluating Gaussian Process Metamodels and Sequential Designs for Noisy Level Set Estimation
Authors:
Xiong Lyu,
Mickael Binois,
Michael Ludkovski
Abstract:
We consider the problem of learning the level set for which a noisy black-box function exceeds a given threshold. To efficiently reconstruct the level set, we investigate Gaussian process (GP) metamodels. Our focus is on strongly stochastic samplers, in particular with heavy-tailed simulation noise and low signal-to-noise ratio. To guard against noise misspecification, we assess the performance of…
▽ More
We consider the problem of learning the level set for which a noisy black-box function exceeds a given threshold. To efficiently reconstruct the level set, we investigate Gaussian process (GP) metamodels. Our focus is on strongly stochastic samplers, in particular with heavy-tailed simulation noise and low signal-to-noise ratio. To guard against noise misspecification, we assess the performance of three variants: (i) GPs with Student-$t$ observations; (ii) Student-$t$ processes (TPs); and (iii) classification GPs modeling the sign of the response. In conjunction with these metamodels, we analyze several acquisition functions for guiding the sequential experimental designs, extending existing stepwise uncertainty reduction criteria to the stochastic contour-finding context. This also motivates our development of (approximate) updating formulas to efficiently compute such acquisition functions. Our schemes are benchmarked by using a variety of synthetic experiments in 1--6 dimensions. We also consider an application of level set estimation for determining the optimal exercise policy of Bermudan options in finance.
△ Less
Submitted 1 March, 2020; v1 submitted 17 July, 2018;
originally announced July 2018.
-
Stochastic Switching Games
Authors:
Liangchen Li,
Michael Ludkovski
Abstract:
We study nonzero-sum stochastic switching games. Two players compete for market dominance through controlling (via timing options) the discrete-state market regime $M$. Switching decisions are driven by a continuous stochastic factor $X$ that modulates instantaneous revenue rates and switching costs. This generates a competitive feedback between the short-term fluctuations due to $X$ and the mediu…
▽ More
We study nonzero-sum stochastic switching games. Two players compete for market dominance through controlling (via timing options) the discrete-state market regime $M$. Switching decisions are driven by a continuous stochastic factor $X$ that modulates instantaneous revenue rates and switching costs. This generates a competitive feedback between the short-term fluctuations due to $X$ and the medium-term advantages based on $M$. We construct threshold-type Feedback Nash Equilibria which characterize stationary strategies describing long-run dynamic equilibrium market organization. Two sequential approximation schemes link the switching equilibrium to (i) constrained optimal switching, (ii) multi-stage timing games. We provide illustrations using an Ornstein-Uhlenbeck $X$ that leads to a recurrent equilibrium $M^\ast$ and a Geometric Brownian Motion $X$ that makes $M^\ast$ eventually "absorbed" as one player eventually gains permanent advantage. Explicit computations and comparative statics regarding the emergent macroscopic market equilibrium are also provided.
△ Less
Submitted 10 July, 2018;
originally announced July 2018.
-
Probabilistic Bisection with Spatial Metamodels
Authors:
Sergio Rodriguez,
Mike Ludkovski
Abstract:
Probabilistic Bisection Algorithm performs root finding based on knowledge acquired from noisy oracle responses. We consider the generalized PBA setting (G-PBA) where the statistical distribution of the oracle is unknown and location-dependent, so that model inference and Bayesian knowledge updating must be performed simultaneously. To this end, we propose to leverage the spatial structure of a ty…
▽ More
Probabilistic Bisection Algorithm performs root finding based on knowledge acquired from noisy oracle responses. We consider the generalized PBA setting (G-PBA) where the statistical distribution of the oracle is unknown and location-dependent, so that model inference and Bayesian knowledge updating must be performed simultaneously. To this end, we propose to leverage the spatial structure of a typical oracle by constructing a statistical surrogate for the underlying logistic regression step. We investigate several non-parametric surrogates, including Binomial Gaussian Processes (B-GP), Polynomial, Kernel, and Spline Logistic Regression. In parallel, we develop sampling policies that adaptively balance learning the oracle distribution and learning the root. One of our proposals mimics active learning with B-GPs and provides a novel look-ahead predictive variance formula. The resulting gains of our Spatial PBA algorithm relative to earlier G-PBA models are illustrated with synthetic examples and a challenging stochastic root finding problem from Bermudan option pricing.
△ Less
Submitted 29 June, 2018;
originally announced July 2018.
-
Simulation Methods for Stochastic Storage Problems: A Statistical Learning Perspective
Authors:
Michael Ludkovski,
Aditya Maheshwari
Abstract:
We consider solution of stochastic storage problems through regression Monte Carlo (RMC) methods. Taking a statistical learning perspective, we develop the dynamic emulation algorithm (DEA) that unifies the different existing approaches in a single modular template. We then investigate the two central aspects of regression architecture and experimental design that constitute DEA. For the regressio…
▽ More
We consider solution of stochastic storage problems through regression Monte Carlo (RMC) methods. Taking a statistical learning perspective, we develop the dynamic emulation algorithm (DEA) that unifies the different existing approaches in a single modular template. We then investigate the two central aspects of regression architecture and experimental design that constitute DEA. For the regression piece, we discuss various non-parametric approaches, in particular introducing the use of Gaussian process regression in the context of stochastic storage. For simulation design, we compare the performance of traditional design (grid discretization), against space-filling, and several adaptive alternatives. The overall DEA template is illustrated with multiple examples drawing from natural gas storage valuation and optimal control of back-up generator in a microgrid.
△ Less
Submitted 29 March, 2018;
originally announced March 2018.
-
Generalized Probabilistic Bisection for Stochastic Root-Finding
Authors:
Sergio Rodriguez,
Michael Ludkovski
Abstract:
We consider numerical schemes for root finding of noisy responses through generalizing the Probabilistic Bisection Algorithm (PBA) to the more practical context where the sampling distribution is unknown and location-dependent. As in standard PBA, we rely on a knowledge state for the approximate posterior of the root location. To implement the corresponding Bayesian updating, we also carry out inf…
▽ More
We consider numerical schemes for root finding of noisy responses through generalizing the Probabilistic Bisection Algorithm (PBA) to the more practical context where the sampling distribution is unknown and location-dependent. As in standard PBA, we rely on a knowledge state for the approximate posterior of the root location. To implement the corresponding Bayesian updating, we also carry out inference of oracle accuracy, namely learning the probability of correct response. To this end we utilize batched querying in combination with a variety of frequentist and Bayesian estimators based on majority vote, as well as the underlying functional responses, if available. For guiding sampling selection we investigate both Information Directed sampling, as well as Quantile sampling. Our numerical experiments show that these strategies perform quite differently; in particular we demonstrate the efficiency of randomized quantile sampling which is reminiscent of Thompson sampling. Our work is motivated by the root-finding sub-routine in pricing of Bermudan financial derivatives, illustrated in the last section of the paper.
△ Less
Submitted 2 November, 2017;
originally announced November 2017.
-
Sequential Design and Spatial Modeling for Portfolio Tail Risk Measurement
Authors:
Michael Ludkovski,
James Risk
Abstract:
We consider calculation of capital requirements when the underlying economic scenarios are determined by simulatable risk factors. In the respective nested simulation framework, the goal is to estimate portfolio tail risk, quantified via VaR or TVaR of a given collection of future economic scenarios representing factor levels at the risk horizon. Traditionally, evaluating portfolio losses of an ou…
▽ More
We consider calculation of capital requirements when the underlying economic scenarios are determined by simulatable risk factors. In the respective nested simulation framework, the goal is to estimate portfolio tail risk, quantified via VaR or TVaR of a given collection of future economic scenarios representing factor levels at the risk horizon. Traditionally, evaluating portfolio losses of an outer scenario is done by computing a conditional expectation via inner-level Monte Carlo and is computationally expensive. We introduce several inter-related machine learning techniques to speed up this computation, in particular by properly accounting for the simulation noise. Our main workhorse is an advanced Gaussian Process (GP) regression approach which uses nonparametric spatial modeling to efficiently learn the relationship between the stochastic factors defining scenarios and corresponding portfolio value. Leveraging this emulator, we develop sequential algorithms that adaptively allocate inner simulation budgets to target the quantile region. The GP framework also yields better uncertainty quantification for the resulting VaR/TVaR estimators that reduces bias and variance compared to existing methods. We illustrate the proposed strategies with two case-studies in two and six dimensions.
△ Less
Submitted 17 May, 2018; v1 submitted 14 October, 2017;
originally announced October 2017.
-
Mean Field Game Approach to Production and Exploration of Exhaustible Commodities
Authors:
Michael Ludkovski,
Xuwei Yang
Abstract:
In a game theoretic framework, we study energy markets with a continuum of homogenous producers who produce energy from an exhaustible resource such as oil. Each producer simultaneously optimizes production rate that drives her revenues, as well as exploration effort to replenish her reserves. This exploration activity is modeled through a controlled point process that leads to stochastic incremen…
▽ More
In a game theoretic framework, we study energy markets with a continuum of homogenous producers who produce energy from an exhaustible resource such as oil. Each producer simultaneously optimizes production rate that drives her revenues, as well as exploration effort to replenish her reserves. This exploration activity is modeled through a controlled point process that leads to stochastic increments to reserves level. The producers interact with each other through the market price that depends on the aggregate production. We employ a mean field game approach to solve for a Markov Nash equilibrium and develop numerical schemes to solve the resulting system of non-local HJB and transport equations with non-local coupling. A time-stationary formulation is also explored, as well as the fluid limit where exploration becomes deterministic.
△ Less
Submitted 14 October, 2017;
originally announced October 2017.
-
Replication or exploration? Sequential design for stochastic simulation experiments
Authors:
Mickael Binois,
Jiangeng Huang,
Robert B Gramacy,
Mike Ludkovski
Abstract:
We investigate the merits of replication, and provide methods for optimal design (including replicates), with the goal of obtaining globally accurate emulation of noisy computer simulation experiments. We first show that replication can be beneficial from both design and computational perspectives, in the context of Gaussian process surrogate modeling. We then develop a lookahead based sequential…
▽ More
We investigate the merits of replication, and provide methods for optimal design (including replicates), with the goal of obtaining globally accurate emulation of noisy computer simulation experiments. We first show that replication can be beneficial from both design and computational perspectives, in the context of Gaussian process surrogate modeling. We then develop a lookahead based sequential design scheme that can determine if a new run should be at an existing input location (i.e., replicate) or at a new one (explore). When paired with a newly developed heteroskedastic Gaussian process model, our dynamic design scheme facilitates learning of signal and noise relationships which can vary throughout the input space. We show that it does so efficiently, on both computational and statistical grounds. In addition to illustrative synthetic examples, we demonstrate performance on two challenging real-data simulation experiments, from inventory management and epidemiology.
△ Less
Submitted 25 January, 2019; v1 submitted 9 October, 2017;
originally announced October 2017.
-
Order Flows and Limit Order Book Resiliency on the Meso-Scale
Authors:
Kyle Bechler,
Michael Ludkovski
Abstract:
We investigate the behavior of limit order books on the meso-scale motivated by order execution scheduling algorithms. To do so we carry out empirical analysis of the order flows from market and limit order submissions, aggregated from tick-by-tick data via volume-based bucketing, as well as various LOB depth and shape metrics. We document a nonlinear relationship between trade imbalance and price…
▽ More
We investigate the behavior of limit order books on the meso-scale motivated by order execution scheduling algorithms. To do so we carry out empirical analysis of the order flows from market and limit order submissions, aggregated from tick-by-tick data via volume-based bucketing, as well as various LOB depth and shape metrics. We document a nonlinear relationship between trade imbalance and price change, which however can be converted into a linear link by considering a weighted average of market and limit order flows. We also document a hockey-stick dependence between trade imbalance and one-sided limit order flows, highlighting numerous asymmetric effects between the active and passive sides of the LOB. To address the phenomenological features of price formation, book resilience, and scarce liquidity we apply a variety of statistical models to test for predictive power of different predictors. We show that on the meso-scale the limit order flows (as well as the relative addition/cancellation rates) carry the most predictive power. Another finding is that the deeper LOB shape, rather than just the book imbalance, is more relevant on this timescale. The empirical results are based on analysis of six large-tick assets from Nasdaq.
△ Less
Submitted 9 August, 2017;
originally announced August 2017.
-
Practical heteroskedastic Gaussian process modeling for large simulation experiments
Authors:
Mickael Binois,
Robert B. Gramacy,
Michael Ludkovski
Abstract:
We present a unified view of likelihood based Gaussian progress regression for simulation experiments exhibiting input-dependent noise. Replication plays an important role in that context, however previous methods leveraging replicates have either ignored the computational savings that come from such design, or have short-cut full likelihood-based inference to remain tractable. Starting with homos…
▽ More
We present a unified view of likelihood based Gaussian progress regression for simulation experiments exhibiting input-dependent noise. Replication plays an important role in that context, however previous methods leveraging replicates have either ignored the computational savings that come from such design, or have short-cut full likelihood-based inference to remain tractable. Starting with homoskedastic processes, we show how multiple applications of a well-known Woodbury identity facilitate inference for all parameters under the likelihood (without approximation), bypassing the typical full-data sized calculations. We then borrow a latent-variable idea from machine learning to address heteroskedasticity, adapting it to work within the same thrifty inferential framework, thereby simultaneously leveraging the computational and statistical efficiency of designs with replication. The result is an inferential scheme that can be characterized as single objective function, complete with closed form derivatives, for rapid library-based optimization. Illustrations are provided, including real-world simulation experiments from manufacturing and the management of epidemics.
△ Less
Submitted 13 November, 2017; v1 submitted 17 November, 2016;
originally announced November 2016.
-
Gaussian Process Models for Mortality Rates and Improvement Factors
Authors:
Mike Ludkovski,
Jimmy Risk,
Howard Zail
Abstract:
We develop a Gaussian process ("GP") framework for modeling mortality rates and mortality improvement factors. GP regression is a nonparametric, data-driven approach for determining the spatial dependence in mortality rates and jointly smoothing raw rates across dimensions, such as calendar year and age. The GP model quantifies uncertainty associated with smoothed historical experience and generat…
▽ More
We develop a Gaussian process ("GP") framework for modeling mortality rates and mortality improvement factors. GP regression is a nonparametric, data-driven approach for determining the spatial dependence in mortality rates and jointly smoothing raw rates across dimensions, such as calendar year and age. The GP model quantifies uncertainty associated with smoothed historical experience and generates full stochastic trajectories for out-of-sample forecasts. Our framework is well suited for updating projections when newly available data arrives, and for dealing with "edge" issues where credibility is lower. We present a detailed analysis of Gaussian process model performance for US mortality experience based on the CDC datasets. We investigate the interaction between mean and residual modeling, Bayesian and non-Bayesian GP methodologies, accuracy of in-sample and out-of-sample forecasting, and stability of model parameters. We also document the general decline, along with strong age-dependency, in mortality improvement factors over the past few years, contrasting our findings with the Society of Actuaries ("SOA") MP-2014 and -2015 models that do not fully reflect these recent trends.
△ Less
Submitted 11 April, 2018; v1 submitted 29 August, 2016;
originally announced August 2016.
-
Bayesian Epidemic Detection in Multiple Populations
Authors:
Michael Ludkovski,
Katherine Shatskikh
Abstract:
Traditional epidemic detection algorithms make decisions using only local information. We propose a novel approach that explicitly models spatial information fusion from several metapopulations. Our method also takes into account cost-benefit considerations regarding the announcement of epidemic. We utilize a compartmental stochastic model within a Bayesian detection framework which leads to a dyn…
▽ More
Traditional epidemic detection algorithms make decisions using only local information. We propose a novel approach that explicitly models spatial information fusion from several metapopulations. Our method also takes into account cost-benefit considerations regarding the announcement of epidemic. We utilize a compartmental stochastic model within a Bayesian detection framework which leads to a dynamic optimization problem. The resulting adaptive, non-parametric detection strategy optimally balances detection delay vis-a-vis probability of false alarms. Taking advantage of the underlying state-space structure, we represent the stopping rule in terms of a detection map which visualizes the relationship between the multivariate system state and policy making. It also allows us to obtain an efficient simulation-based solution algorithm that is based on the Sequential Regression Monte Carlo (SRMC) approach of Gramacy and Ludkovski (SIFIN, 2015). We illustrate our results on synthetic examples and also quantify the advantages of our adaptive detection relative to conventional threshold-based strategies.
△ Less
Submitted 14 September, 2015;
originally announced September 2015.
-
Kriging Metamodels and Experimental Design for Bermudan Option Pricing
Authors:
Michael Ludkovski
Abstract:
We investigate two new strategies for the numerical solution of optimal stopping problems within the Regression Monte Carlo (RMC) framework of Longstaff and Schwartz. First, we propose the use of stochastic kriging (Gaussian process) meta-models for fitting the continuation value. Kriging offers a flexible, nonparametric regression approach that quantifies approximation quality. Second, we connect…
▽ More
We investigate two new strategies for the numerical solution of optimal stopping problems within the Regression Monte Carlo (RMC) framework of Longstaff and Schwartz. First, we propose the use of stochastic kriging (Gaussian process) meta-models for fitting the continuation value. Kriging offers a flexible, nonparametric regression approach that quantifies approximation quality. Second, we connect the choice of stochastic grids used in RMC to the Design of Experiments paradigm. We examine space-filling and adaptive experimental designs; we also investigate the use of batching with replicated simulations at design sites to improve the signal-to-noise ratio. Numerical case studies for valuing Bermudan Puts and Max-Calls under a variety of asset dynamics illustrate that our methods offer significant reduction in simulation budgets over existing approaches.
△ Less
Submitted 26 October, 2016; v1 submitted 7 September, 2015;
originally announced September 2015.
-
Sequential Design for Ranking Response Surfaces
Authors:
Ruimeng Hu,
Mike Ludkovski
Abstract:
We propose and analyze sequential design methods for the problem of ranking several response surfaces. Namely, given $L \ge 2$ response surfaces over a continuous input space $\cal X$, the aim is to efficiently find the index of the minimal response across the entire $\cal X$. The response surfaces are not known and have to be noisily sampled one-at-a-time. This setting is motivated by stochastic…
▽ More
We propose and analyze sequential design methods for the problem of ranking several response surfaces. Namely, given $L \ge 2$ response surfaces over a continuous input space $\cal X$, the aim is to efficiently find the index of the minimal response across the entire $\cal X$. The response surfaces are not known and have to be noisily sampled one-at-a-time. This setting is motivated by stochastic control applications and requires joint experimental design both in space and response-index dimensions. To generate sequential design heuristics we investigate stepwise uncertainty reduction approaches, as well as sampling based on posterior classification complexity. We also make connections between our continuous-input formulation and the discrete framework of pure regret in multi-armed bandits. To model the response surfaces we utilize kriging surrogates. Several numerical examples using both synthetic data and an epidemics control problem are provided to illustrate our approach and the efficacy of respective adaptive designs.
△ Less
Submitted 12 July, 2016; v1 submitted 3 September, 2015;
originally announced September 2015.
-
Statistical Emulators for Pricing and Hedging Longevity Risk Products
Authors:
James Risk,
Michael Ludkovski
Abstract:
We propose the use of statistical emulators for the purpose of valuing mortality-linked contracts in stochastic mortality models. Such models typically require (nested) evaluation of expected values of nonlinear functionals of multi-dimensional stochastic processes. Except in the simplest cases, no closed-form expressions are available, necessitating numerical approximation. Rather than building a…
▽ More
We propose the use of statistical emulators for the purpose of valuing mortality-linked contracts in stochastic mortality models. Such models typically require (nested) evaluation of expected values of nonlinear functionals of multi-dimensional stochastic processes. Except in the simplest cases, no closed-form expressions are available, necessitating numerical approximation. Rather than building ad hoc analytic approximations, we advocate the use of modern statistical tools from machine learning to generate a flexible, non-parametric surrogate for the true mappings. This method allows performance guarantees regarding approximation accuracy and removes the need for nested simulation. We illustrate our approach with case studies involving (i) a Lee-Carter model with mortality shocks, (ii) index-based static hedging with longevity basis risk; (iii) a Cairns-Blake-Dowd stochastic survival probability model.
△ Less
Submitted 14 September, 2015; v1 submitted 3 August, 2015;
originally announced August 2015.
-
Optimal Execution with Dynamic Order Flow Imbalance
Authors:
Kyle Bechler,
Mike Ludkovski
Abstract:
We examine optimal execution models that take into account both market microstructure impact and informational costs. Informational footprint is related to order flow and is represented by the trader's influence on the flow imbalance process, while microstructure influence is captured by instantaneous price impact. We propose a continuous-time stochastic control problem that balances between these…
▽ More
We examine optimal execution models that take into account both market microstructure impact and informational costs. Informational footprint is related to order flow and is represented by the trader's influence on the flow imbalance process, while microstructure influence is captured by instantaneous price impact. We propose a continuous-time stochastic control problem that balances between these two costs. Incorporating order flow imbalance leads to the consideration of the current market state and specifically whether one's orders lean with or against the prevailing order flow, key components often ignored by execution models in the literature. In particular, to react to changing order flow, we endogenize the trading horizon $T$. After developing the general indefinite-horizon formulation, we investigate several tractable approximations that sequentially optimize over price impact and over $T$. These approximations, especially a dynamic version based on receding horizon control, are shown to be very accurate and connect to the prevailing Almgren-Chriss framework. We also discuss features of empirical order flow and links between our model and "Optimal Execution Horizon" by Easley et al (Mathematical Finance, 2013).
△ Less
Submitted 18 October, 2014; v1 submitted 9 September, 2014;
originally announced September 2014.
-
Sequential Design for Optimal Stopping Problems
Authors:
Robert B. Gramacy,
Mike Ludkovski
Abstract:
We propose a new approach to solve optimal stopping problems via simulation. Working within the backward dynamic programming/Snell envelope framework, we augment the methodology of Longstaff-Schwartz that focuses on approximating the stopping strategy. Namely, we introduce adaptive generation of the stochastic grids anchoring the simulated sample paths of the underlying state process. This allows…
▽ More
We propose a new approach to solve optimal stopping problems via simulation. Working within the backward dynamic programming/Snell envelope framework, we augment the methodology of Longstaff-Schwartz that focuses on approximating the stopping strategy. Namely, we introduce adaptive generation of the stochastic grids anchoring the simulated sample paths of the underlying state process. This allows for active learning of the classifiers partitioning the state space into the continuation and stopping regions. To this end, we examine sequential design schemes that adaptively place new design points close to the stopping boundaries. We then discuss dynamic regression algorithms that can implement such recursive estimation and local refinement of the classifiers. The new algorithm is illustrated with a variety of numerical experiments, showing that an order of magnitude savings in terms of design size can be achieved. We also compare with existing benchmarks in the context of pricing multi-dimensional Bermudan options.
△ Less
Submitted 29 July, 2014; v1 submitted 16 September, 2013;
originally announced September 2013.
-
Sequential Bayesian Inference in Hidden Markov Stochastic Kinetic Models with Application to Detection and Response to Seasonal Epidemics
Authors:
Junjing Lin,
Michael Ludkovski
Abstract:
We study sequential Bayesian inference in stochastic kinetic models with latent factors. Assuming continuous observation of all the reactions, our focus is on joint inference of the unknown reaction rates and the dynamic latent states, modeled as a hidden Markov factor. Using insights from nonlinear filtering of continuous-time jump Markov processes we develop a novel sequential Monte Carlo algori…
▽ More
We study sequential Bayesian inference in stochastic kinetic models with latent factors. Assuming continuous observation of all the reactions, our focus is on joint inference of the unknown reaction rates and the dynamic latent states, modeled as a hidden Markov factor. Using insights from nonlinear filtering of continuous-time jump Markov processes we develop a novel sequential Monte Carlo algorithm for this purpose. Our approach applies the ideas of particle learning to minimize particle degeneracy and exploit the analytical jump Markov structure. A motivating application of our methods is modeling of seasonal infectious disease outbreaks represented through a compartmental epidemic model. We demonstrate inference in such models with several numerical illustrations and also discuss predictive analysis of epidemic countermeasures using sequential Bayes estimates.
△ Less
Submitted 16 January, 2013;
originally announced January 2013.
-
Inventory Management with Partially Observed Nonstationary Demand
Authors:
Erhan Bayraktar,
Mike Ludkovski
Abstract:
We consider a continuous-time model for inventory management with Markov modulated non-stationary demands. We introduce active learning by assuming that the state of the world is unobserved and must be inferred by the manager. We also assume that demands are observed only when they are completely met. We first derive the explicit filtering equations and pass to an equivalent fully observed impulse…
▽ More
We consider a continuous-time model for inventory management with Markov modulated non-stationary demands. We introduce active learning by assuming that the state of the world is unobserved and must be inferred by the manager. We also assume that demands are observed only when they are completely met. We first derive the explicit filtering equations and pass to an equivalent fully observed impulse control problem in terms of the sufficient statistics, the a posteriori probability process and the current inventory level. We then solve this equivalent formulation and directly characterize an optimal inventory policy. We also describe a computational procedure to calculate the value function and the optimal policy and present two numerical illustrations.
△ Less
Submitted 27 June, 2012;
originally announced June 2012.
-
European Option Pricing with Liquidity Shocks
Authors:
Michael Ludkovski,
Qunying Shen
Abstract:
We study the valuation and hedging problem of European options in a market subject to liquidity shocks. Working within a Markovian regime-switching setting, we model illiquidity as the inability to trade. To isolate the impact of such liquidity constraints, we focus on the case where the market is completely static in the illiquid regime. We then consider derivative pricing using either equivalent…
▽ More
We study the valuation and hedging problem of European options in a market subject to liquidity shocks. Working within a Markovian regime-switching setting, we model illiquidity as the inability to trade. To isolate the impact of such liquidity constraints, we focus on the case where the market is completely static in the illiquid regime. We then consider derivative pricing using either equivalent martingale measures or exponential indifference mechanisms. Our main results concern the analysis of the semi-linear coupled HJB equation satisfied by the indifference price, as well as its asymptotics when the probability of a liquidity shock is small. We then present several numerical studies of the liquidity risk premia obtained in our models leading to practical guidelines on how to adjust for liquidity risk in option valuation and hedging.
△ Less
Submitted 4 May, 2012;
originally announced May 2012.
-
Finite Horizon Decision Timing with Partially Observable Poisson Processes
Authors:
Michael Ludkovski,
Semih Sezer
Abstract:
We study decision timing problems on finite horizon with Poissonian information arrivals. In our model, a decision maker wishes to optimally time her action in order to maximize her expected reward. The reward depends on an unobservable Markovian environment, and information about the environment is collected through a (compound) Poisson observation process. Examples of such systems arise in inves…
▽ More
We study decision timing problems on finite horizon with Poissonian information arrivals. In our model, a decision maker wishes to optimally time her action in order to maximize her expected reward. The reward depends on an unobservable Markovian environment, and information about the environment is collected through a (compound) Poisson observation process. Examples of such systems arise in investment timing, reliability theory, Bayesian regime detection and technology adoption models. We solve the problem by studying an optimal stopping problem for a piecewise-deterministic process which gives the posterior likelihoods of the unobservable environment. Our method lends itself to simple numerical implementation and we present several illustrative numerical examples.
△ Less
Submitted 7 May, 2011;
originally announced May 2011.
-
Liquidation in Limit Order Books with Controlled Intensity
Authors:
Erhan Bayraktar,
Michael Ludkovski
Abstract:
We consider a framework for solving optimal liquidation problems in limit order books. In particular, order arrivals are modeled as a point process whose intensity depends on the liquidation price. We set up a stochastic control problem in which the goal is to maximize the expected revenue from liquidating the entire position held. We solve this optimal liquidation problem for power-law and expone…
▽ More
We consider a framework for solving optimal liquidation problems in limit order books. In particular, order arrivals are modeled as a point process whose intensity depends on the liquidation price. We set up a stochastic control problem in which the goal is to maximize the expected revenue from liquidating the entire position held. We solve this optimal liquidation problem for power-law and exponential-decay order book models and discuss several extensions. We also consider the continuous selling (or fluid) limit when the trading units are ever smaller and the intensity is ever larger. This limit provides an analytical approximation to the value function and the optimal solution. Using techniques from viscosity solutions we show that the discrete state problem and its optimal solution converge to the corresponding quantities in the continuous selling limit uniformly on compacts.
△ Less
Submitted 26 January, 2012; v1 submitted 2 May, 2011;
originally announced May 2011.
-
Optimal Timing to Purchase Options
Authors:
Tim Leung,
Michael Ludkovski
Abstract:
We study the optimal timing of derivative purchases in incomplete markets. In our model, an investor attempts to maximize the spread between her model price and the offered market price through optimally timing her purchase. Both the investor and the market value the options by risk-neutral expectations but under different equivalent martingale measures representing different market views. The str…
▽ More
We study the optimal timing of derivative purchases in incomplete markets. In our model, an investor attempts to maximize the spread between her model price and the offered market price through optimally timing her purchase. Both the investor and the market value the options by risk-neutral expectations but under different equivalent martingale measures representing different market views. The structure of the resulting optimal stopping problem depends on the interaction between the respective market price of risk and the option payoff. In particular, a crucial role is played by the delayed purchase premium that is related to the stochastic bracket between the market price and the buyer's risk premia. Explicit characterization of the purchase timing is given for two representative classes of Markovian models: (i) defaultable equity models with local intensity; (ii) diffusion stochastic volatility models. Several numerical examples are presented to illustrate the results. Our model is also applicable to the optimal rolling of long-dated options and sequential buying and selling of options.
△ Less
Submitted 3 April, 2011; v1 submitted 21 August, 2010;
originally announced August 2010.
-
Illiquidity Effects in Optimal Consumption-Investment Problems
Authors:
Michael Ludkovski,
Hyekyung Min
Abstract:
We study the effect of liquidity freezes on an economic agent optimizing her utility of consumption in a perturbed Black-Scholes-Merton model. The single risky asset follows a geometric Brownian motion but is subject to liquidity shocks, during which no trading is possible and stock dynamics are modified. The liquidity regime is governed by a two-state Markov chain. We derive the asymptotic effect…
▽ More
We study the effect of liquidity freezes on an economic agent optimizing her utility of consumption in a perturbed Black-Scholes-Merton model. The single risky asset follows a geometric Brownian motion but is subject to liquidity shocks, during which no trading is possible and stock dynamics are modified. The liquidity regime is governed by a two-state Markov chain. We derive the asymptotic effect of such freezes on optimal consumption and investment schedules in the two cases of (i) small probability of liquidity shock; (ii) fast-scale liquidity regime switching. Explicit formulas are obtained for logarithmic and hyperbolic utility maximizers on infinite horizon. We also derive the corresponding loss in utility and compare with a recent related finite-horizon model of Diesinger, Kraft and Seifried (2009).
△ Less
Submitted 29 September, 2010; v1 submitted 9 April, 2010;
originally announced April 2010.
-
Stochastic Switching Games and Duopolistic Competition in Emissions Markets
Authors:
Michael Ludkovski
Abstract:
We study optimal behavior of energy producers under a CO_2 emission abatement program. We focus on a two-player discrete-time model where each producer is sequentially optimizing her emission and production schedules. The game-theoretic aspect is captured through a reduced-form price-impact model for the CO_2 allowance price. Such duopolistic competition results in a new type of a non-zero-sum sto…
▽ More
We study optimal behavior of energy producers under a CO_2 emission abatement program. We focus on a two-player discrete-time model where each producer is sequentially optimizing her emission and production schedules. The game-theoretic aspect is captured through a reduced-form price-impact model for the CO_2 allowance price. Such duopolistic competition results in a new type of a non-zero-sum stochastic switching game on finite horizon. Existence of game Nash equilibria is established through generalization to randomized switching strategies. No uniqueness is possible and we therefore consider a variety of correlated equilibrium mechanisms. We prove existence of correlated equilibrium points in switching games and give a recursive description of equilibrium game values. A simulation-based algorithm to solve for the game values is constructed and a numerical example is presented.
△ Less
Submitted 21 August, 2010; v1 submitted 19 January, 2010;
originally announced January 2010.
-
A Simulation Approach to Optimal Stopping Under Partial Information
Authors:
Mike Ludkovski
Abstract:
We study the numerical solution of nonlinear partially observed optimal stopping problems. The system state is taken to be a multi-dimensional diffusion and drives the drift of the observation process, which is another multi-dimensional diffusion with correlated noise. Such models where the controller is not fully aware of her environment are of interest in applied probability and financial math…
▽ More
We study the numerical solution of nonlinear partially observed optimal stopping problems. The system state is taken to be a multi-dimensional diffusion and drives the drift of the observation process, which is another multi-dimensional diffusion with correlated noise. Such models where the controller is not fully aware of her environment are of interest in applied probability and financial mathematics. We propose a new approximate numerical algorithm based on the particle filtering and regression Monte Carlo methods. The algorithm maintains a continuous state-space and yields an integrated approach to the filtering and control sub-problems. Our approach is entirely simulation-based and therefore allows for a robust implementation with respect to model specification. We carry out the error analysis of our scheme and illustrate with several computational examples. An extension to discretely observed stochastic volatility models is also considered.
△ Less
Submitted 14 February, 2009;
originally announced February 2009.
-
Optimal Trade Execution in Illiquid Markets
Authors:
Erhan Bayraktar,
Mike Ludkovski
Abstract:
We study optimal trade execution strategies in financial markets with discrete order flow. The agent has a finite liquidation horizon and must minimize price impact given a random number of incoming trade counterparties. Assuming that the order flow $N$ is given by a Poisson process, we give a full analysis of the properties and computation of the optimal dynamic execution strategy. Extensions,…
▽ More
We study optimal trade execution strategies in financial markets with discrete order flow. The agent has a finite liquidation horizon and must minimize price impact given a random number of incoming trade counterparties. Assuming that the order flow $N$ is given by a Poisson process, we give a full analysis of the properties and computation of the optimal dynamic execution strategy. Extensions, whereby (a) $N$ is a fully-observed regime-switching Poisson process; and (b) $N$ is a Markov-modulated compound Poisson process driven by a hidden Markov chain, are also considered.
We derive and compare the properties of the three cases and illustrate our results with computational examples.
△ Less
Submitted 14 February, 2009;
originally announced February 2009.
-
Optimal Risk Sharing under Distorted Probabilities
Authors:
M. Ludkovski,
V. R. Young
Abstract:
We study optimal risk sharing among $n$ agents endowed with distortion risk measures. Our model includes market frictions that can either represent linear transaction costs or risk premia charged by a clearing house for the agents. Risk sharing under third-party constraints is also considered. We obtain an explicit formula for Pareto optimal allocations. In particular, we find that a stop-loss o…
▽ More
We study optimal risk sharing among $n$ agents endowed with distortion risk measures. Our model includes market frictions that can either represent linear transaction costs or risk premia charged by a clearing house for the agents. Risk sharing under third-party constraints is also considered. We obtain an explicit formula for Pareto optimal allocations. In particular, we find that a stop-loss or deductible risk sharing is optimal in the case of two agents and several common distortion functions. This extends recent result of Jouini et al. (2006) to the problem with unbounded risks and market frictions.
△ Less
Submitted 22 September, 2008;
originally announced September 2008.