Search | arXiv e-print repository

Partial-Information Q-Learning for General Two-Player Stochastic Games

Authors: Negash Medhin, Andrew Papanicolaou, Marwen Zrida

Abstract: In this article we analyze a partial-information Nash Q-learning algorithm for a general 2-player stochastic game. Partial information refers to the setting where a player does not know the strategy or the actions taken by the opposing player. We prove convergence of this partially informed algorithm for general 2-player games with finitely many states and actions, and we confirm that the limiting… ▽ More In this article we analyze a partial-information Nash Q-learning algorithm for a general 2-player stochastic game. Partial information refers to the setting where a player does not know the strategy or the actions taken by the opposing player. We prove convergence of this partially informed algorithm for general 2-player games with finitely many states and actions, and we confirm that the limiting strategy is in fact a full-information Nash equilibrium. In implementation, partial information offers simplicity because it avoids computation of Nash equilibria at every time step. In contrast, full-information Q-learning uses the Lemke-Howson algorithm to compute Nash equilibria at every time step, which can be an effective approach but requires several assumptions to prove convergence and may have runtime error if Lemke-Howson encounters degeneracy. In simulations, the partial information results we obtain are comparable to those for full-information Q-learning and fictitious play. △ Less

Submitted 21 February, 2023; originally announced February 2023.

arXiv:1908.02164 [pdf]

Statistical Arbitrage for Multiple Co-Integrated Stocks

Authors: T. N. Li, A. Papanicolaou

Abstract: In this article, we analyse optimal statistical arbitrage strategies from stochastic control and optimisation problems for multiple co-integrated stocks with eigenportfolios being factors. Optimal portfolio weights are found by solving a Hamilton-Jacobi-Bellman (HJB) partial differential equation, which we solve for both an unconstrained portfolio and a portfolio constrained to be market neutral.… ▽ More In this article, we analyse optimal statistical arbitrage strategies from stochastic control and optimisation problems for multiple co-integrated stocks with eigenportfolios being factors. Optimal portfolio weights are found by solving a Hamilton-Jacobi-Bellman (HJB) partial differential equation, which we solve for both an unconstrained portfolio and a portfolio constrained to be market neutral. Our analyses demonstrate sufficient conditions on the model parameters to ensure long-term stability of the HJB solutions and stable growth rates for the optimal portfolios. To gauge how these optimal portfolios behave in practice, we perform backtests on historical stock prices of the S&P 500 constituents from year 2000 through year 2021. These backtests suggest three key conclusions: that the proposed co-integrated model with eigenportfolios being factors can generate a large number of co-integrated stocks over a long time horizon, that the optimal portfolios are sensitive to parameter estimation, and that the statistical arbitrage strategies are more profitable in periods when overall market volatilities are high. △ Less

Submitted 8 February, 2022; v1 submitted 6 August, 2019; originally announced August 2019.

MSC Class: 62P05; 91B28; 93E20

arXiv:1711.05360 [pdf, other]

The Dispersion Bias

Authors: Lisa Goldberg, Alex Papanicolaou, Alex Shkolnik

Abstract: Estimation error has plagued quantitative finance since Harry Markowitz launched modern portfolio theory in 1952. Using random matrix theory, we characterize a source of bias in the sample eigenvectors of financial covariance matrices. Unchecked, the bias distorts weights of minimum variance portfolios and leads to risk forecasts that are severely biased downward. To address these issues, we devel… ▽ More Estimation error has plagued quantitative finance since Harry Markowitz launched modern portfolio theory in 1952. Using random matrix theory, we characterize a source of bias in the sample eigenvectors of financial covariance matrices. Unchecked, the bias distorts weights of minimum variance portfolios and leads to risk forecasts that are severely biased downward. To address these issues, we develop an eigenvector bias correction. Our approach is distinct from the regularization and eigenvalue shrinkage methods found in the literature. We provide theoretical guarantees on the improvement our correction provides as well as estimation methods for computing the optimal correction from data. △ Less

Submitted 15 February, 2018; v1 submitted 14 November, 2017; originally announced November 2017.

MSC Class: 91G10; 62H25; 62H12; 40C05; 62J07; 65F15; 65C60

arXiv:1607.06158 [pdf, other]

Dimension Reduction in Statistical Estimation of Partially Observed Multiscale Processes

Authors: Andrew Papanicolaou, Konstantinos Spiliopoulos

Abstract: We consider partially observed multiscale diffusion models that are specified up to an unknown vector parameter. We establish for a very general class of test functions that the filter of the original model converges to a filter of reduced dimension. Then, this result is used to justify statistical estimation for the unknown parameters of interest based on the model of reduced dimension but using… ▽ More We consider partially observed multiscale diffusion models that are specified up to an unknown vector parameter. We establish for a very general class of test functions that the filter of the original model converges to a filter of reduced dimension. Then, this result is used to justify statistical estimation for the unknown parameters of interest based on the model of reduced dimension but using the original available data. This allows to learn the unknown parameters of interest while working in lower dimensions, as opposed to working with the original high dimensional system. Simulation studies support and illustrate the theoretical results. △ Less

Submitted 26 November, 2017; v1 submitted 20 July, 2016; originally announced July 2016.

Comments: SIAM Journal of Uncertainty Quantification, 2017

MSC Class: 93E10; 93E11; 93C70; 62M07; 62M86

arXiv:1504.05309 [pdf, other]

Introduction to Stochastic Differential Equations (SDEs) for Finance

Authors: Andrew Papanicolaou

Abstract: These are course notes on the application of SDEs to options pricing. The author was partially supported by NSF grant DMS-0739195. These are course notes on the application of SDEs to options pricing. The author was partially supported by NSF grant DMS-0739195. △ Less

Submitted 2 January, 2019; v1 submitted 21 April, 2015; originally announced April 2015.

Comments: These are an evolving set of course notes. Eventually I hope to make them a book. They are posted on the arXiv so that others may see my approach to the topic

arXiv:1406.1936 [pdf, other]

Stochastic Analysis Seminar on Filtering Theory

Authors: Andrew Papanicolaou

Abstract: These notes were originally written for the Stochastic Analysis Seminar in the Department of Operations Research and Financial Engineering at Princeton University, in February of 2011. The seminar was attended and supported by members of the Research Training Group, with the author being partially supported by NSF grant DMS-0739195. These notes were originally written for the Stochastic Analysis Seminar in the Department of Operations Research and Financial Engineering at Princeton University, in February of 2011. The seminar was attended and supported by members of the Research Training Group, with the author being partially supported by NSF grant DMS-0739195. △ Less

Submitted 1 October, 2016; v1 submitted 7 June, 2014; originally announced June 2014.

Comments: 94 pages

arXiv:1305.1918 [pdf, other]

doi 10.1137/140952648

Filtering the Maximum Likelihood for Multiscale Problems

Authors: Andrew Papanicolaou, Konstantinos Spiliopoulos

Abstract: Filtering and parameter estimation under partial information for multiscale problems is studied in this paper. After proving mean square convergence of the nonlinear filter to a filter of reduced dimension, we establish that the conditional (on the observations) log-likelihood process has a correction term given by a type of central limit theorem. To achieve this we assume that the operator of the… ▽ More Filtering and parameter estimation under partial information for multiscale problems is studied in this paper. After proving mean square convergence of the nonlinear filter to a filter of reduced dimension, we establish that the conditional (on the observations) log-likelihood process has a correction term given by a type of central limit theorem. To achieve this we assume that the operator of the (hidden) fast process has a discrete spectrum and an orthonormal basis of eigenfunctions. Based on these results, we then propose to estimate the unknown parameters of the model based on the limiting log-likelihood, which is an easier function to optimize because it of reduced dimension. We also establish consistency and asymptotic normality of the maximum likelihood estimator based on the reduced log-likelihood. Simulation results illustrate our theoretical findings. △ Less

Submitted 29 May, 2014; v1 submitted 8 May, 2013; originally announced May 2013.

Comments: Keywords: Ergodic filtering, fast mean reversion, homogenization, Zakai equation, maximum likelihood estimation, central limit theory

Journal ref: SIAM Journal on Multiscale Modeling and Simulation 12(3) (2014) 1193-1229

arXiv:1203.6626 [pdf, other]

doi 10.1137/110819937

Nonlinear Filters for Hidden Markov Models of Regime Change with Fast Mean-Reverting States

Authors: Andrew Papanicolaou

Abstract: We consider filtering for a hidden Markov model that evolves with multiple time scales in the hidden states. In particular, we consider the case where one of the states is a scaled Ornstein-Uhlenbeck process with fast reversion to a shifting-mean that is controlled by a continuous time Markov chain modeling regime change. We show that the nonlinear filter for such a process can be approximated by… ▽ More We consider filtering for a hidden Markov model that evolves with multiple time scales in the hidden states. In particular, we consider the case where one of the states is a scaled Ornstein-Uhlenbeck process with fast reversion to a shifting-mean that is controlled by a continuous time Markov chain modeling regime change. We show that the nonlinear filter for such a process can be approximated by an averaged filter that asymptotically coincides with the true nonlinear filter of the regime-changing Markov chain as the rate of mean reversion approaches infinity. The asymptotics exploit weak converge of the state variables to an invariant distribution, which is significantly different from the strong convergence used to obtain asymptotic results in "Filtering for Fast Mean-Reverting Processes" (19). △ Less

Submitted 13 May, 2012; v1 submitted 29 March, 2012; originally announced March 2012.

Journal ref: SIAM Multiscale Modeling and Simulation, (2012) Vol. 10, No. 3, pp. 906-935

Showing 1–8 of 8 results for author: Papanicolaou, A