-
The Exploratory Multi-Asset Mean-Variance Portfolio Selection using Reinforcement Learning
Authors:
Yu Li,
Yuhan Wu,
Shuhua Zhang
Abstract:
In this paper, we study the continuous-time multi-asset mean-variance (MV) portfolio selection using a reinforcement learning (RL) algorithm, specifically the soft actor-critic (SAC) algorithm, in the time-varying financial market. A family of Gaussian portfolio selections is derived, and a policy iteration process is crafted to learn the optimal exploratory portfolio selection. We prove the conve…
▽ More
In this paper, we study the continuous-time multi-asset mean-variance (MV) portfolio selection using a reinforcement learning (RL) algorithm, specifically the soft actor-critic (SAC) algorithm, in the time-varying financial market. A family of Gaussian portfolio selections is derived, and a policy iteration process is crafted to learn the optimal exploratory portfolio selection. We prove the convergence of the policy iteration process theoretically, based on which the SAC algorithm is developed. To improve the algorithm's stability and the learning accuracy in the multi-asset scenario, we divide the model parameters that influence the optimal portfolio selection into three parts, and learn each part progressively. Numerical studies in the simulated and real financial markets confirm the superior performance of the proposed SAC algorithm under various criteria.
△ Less
Submitted 12 May, 2025;
originally announced May 2025.
-
Density Approximation of Affine Jump Diffusions via Closed-Form Moment Matching
Authors:
Yan-Feng Wu,
Jian-Qiang Hu
Abstract:
We develop a recursive approach for deriving closed-form solutions to both conditional and unconditional moments of affine jump diffusions with state-independent jump intensities. Using these moment solutions, we construct closed-form density approximations (up to a normalization constant) via moment matching for both conditional and unconditional distributions. Our framework enables important fin…
▽ More
We develop a recursive approach for deriving closed-form solutions to both conditional and unconditional moments of affine jump diffusions with state-independent jump intensities. Using these moment solutions, we construct closed-form density approximations (up to a normalization constant) via moment matching for both conditional and unconditional distributions. Our framework enables important financial applications, including efficient option pricing and exact simulation for affine jump diffusions. Numerical experiments demonstrate the method's superior computational efficiency compared to existing simulation techniques, while preserving numerical precision.
△ Less
Submitted 9 April, 2025;
originally announced April 2025.
-
Integrative Analysis of Financial Market Sentiment Using CNN and GRU for Risk Prediction and Alert Systems
Authors:
You Wu,
Mengfang Sun,
Hongye Zheng,
Jinxin Hu,
Yingbin Liang,
Zhenghao Lin
Abstract:
This document presents an in-depth examination of stock market sentiment through the integration of Convolutional Neural Networks (CNN) and Gated Recurrent Units (GRU), enabling precise risk alerts. The robust feature extraction capability of CNN is utilized to preprocess and analyze extensive network text data, identifying local features and patterns. The extracted feature sequences are then inpu…
▽ More
This document presents an in-depth examination of stock market sentiment through the integration of Convolutional Neural Networks (CNN) and Gated Recurrent Units (GRU), enabling precise risk alerts. The robust feature extraction capability of CNN is utilized to preprocess and analyze extensive network text data, identifying local features and patterns. The extracted feature sequences are then input into the GRU model to understand the progression of emotional states over time and their potential impact on future market sentiment and risk. This approach addresses the order dependence and long-term dependencies inherent in time series data, resulting in a detailed analysis of stock market sentiment and effective early warnings of future risks.
△ Less
Submitted 13 December, 2024;
originally announced December 2024.
-
Understanding the Excess Bond Premium
Authors:
Kevin Benson,
Ing-Haw Cheng,
John Hull,
Charles Martineau,
Yoshio Nozawa,
Vasily Strela,
Yuntao Wu,
Jun Yuan
Abstract:
We study the drivers of the Gilchrist and Zakrajšek (2012) excess bond premium (EBP) through the lens of the news. The monthly attention the news pays to 180 topics (Bybee et al., 2024) captures up to 80% of the variation in the EBP, and this component of variation forecasts macroeconomic movements. Greater news attention to financial intermediaries and crises tends to drive up the EBP and portend…
▽ More
We study the drivers of the Gilchrist and Zakrajšek (2012) excess bond premium (EBP) through the lens of the news. The monthly attention the news pays to 180 topics (Bybee et al., 2024) captures up to 80% of the variation in the EBP, and this component of variation forecasts macroeconomic movements. Greater news attention to financial intermediaries and crises tends to drive up the EBP and portend macroeconomic downturns, while greater news attention to politics and science tends to drive down the EBP. Attention-based estimates of EBP largely drive out the forecast power of direct sentiment measures for macroeconomic fluctuations and predict the business cycle going back to the early 1900's. Overall, we attribute predictive variation about the EBP for macroeconomic movements to variation in news attention to financial intermediaries, crises, and politics.
△ Less
Submitted 5 December, 2024;
originally announced December 2024.
-
Advanced Risk Prediction and Stability Assessment of Banks Using Time Series Transformer Models
Authors:
Wenying Sun,
Zhen Xu,
Wenqing Zhang,
Kunyuan Ma,
You Wu,
Mengfang Sun
Abstract:
This paper aims to study the prediction of the bank stability index based on the Time Series Transformer model. The bank stability index is an important indicator to measure the health status and risk resistance of financial institutions. Traditional prediction methods are difficult to adapt to complex market changes because they rely on single-dimensional macroeconomic data. This paper proposes a…
▽ More
This paper aims to study the prediction of the bank stability index based on the Time Series Transformer model. The bank stability index is an important indicator to measure the health status and risk resistance of financial institutions. Traditional prediction methods are difficult to adapt to complex market changes because they rely on single-dimensional macroeconomic data. This paper proposes a prediction framework based on the Time Series Transformer, which uses the self-attention mechanism of the model to capture the complex temporal dependencies and nonlinear relationships in financial data. Through experiments, we compare the model with LSTM, GRU, CNN, TCN and RNN-Transformer models. The experimental results show that the Time Series Transformer model outperforms other models in both mean square error (MSE) and mean absolute error (MAE) evaluation indicators, showing strong prediction ability. This shows that the Time Series Transformer model can better handle multidimensional time series data in bank stability prediction, providing new technical approaches and solutions for financial risk management.
△ Less
Submitted 4 December, 2024;
originally announced December 2024.
-
A Risk Sensitive Contract-unified Reinforcement Learning Approach for Option Hedging
Authors:
Xianhua Peng,
Xiang Zhou,
Bo Xiao,
Yi Wu
Abstract:
We propose a new risk sensitive reinforcement learning approach for the dynamic hedging of options. The approach focuses on the minimization of the tail risk of the final P&L of the seller of an option. Different from most existing reinforcement learning approaches that require a parametric model of the underlying asset, our approach can learn the optimal hedging strategy directly from the histori…
▽ More
We propose a new risk sensitive reinforcement learning approach for the dynamic hedging of options. The approach focuses on the minimization of the tail risk of the final P&L of the seller of an option. Different from most existing reinforcement learning approaches that require a parametric model of the underlying asset, our approach can learn the optimal hedging strategy directly from the historical market data without specifying a parametric model; in addition, the learned optimal hedging strategy is contract-unified, i.e., it applies to different options contracts with different initial underlying prices, strike prices, and maturities. Our approach extends existing reinforcement learning methods by learning the tail risk measures of the final hedging P&L and the optimal hedging strategy at the same time. We carry out comprehensive empirical study to show that, in the out-of-sample tests, the proposed reinforcement learning hedging strategy can obtain statistically significantly lower tail risk and higher mean of the final P&L than delta hedging methods.
△ Less
Submitted 14 November, 2024;
originally announced November 2024.
-
FinRobot: AI Agent for Equity Research and Valuation with Large Language Models
Authors:
Tianyu Zhou,
Pinqiao Wang,
Yilin Wu,
Hongyang Yang
Abstract:
As financial markets grow increasingly complex, there is a rising need for automated tools that can effectively assist human analysts in equity research, particularly within sell-side research. While Generative AI (GenAI) has attracted significant attention in this field, existing AI solutions often fall short due to their narrow focus on technical factors and limited capacity for discretionary ju…
▽ More
As financial markets grow increasingly complex, there is a rising need for automated tools that can effectively assist human analysts in equity research, particularly within sell-side research. While Generative AI (GenAI) has attracted significant attention in this field, existing AI solutions often fall short due to their narrow focus on technical factors and limited capacity for discretionary judgment. These limitations hinder their ability to adapt to new data in real-time and accurately assess risks, which diminishes their practical value for investors.
This paper presents FinRobot, the first AI agent framework specifically designed for equity research. FinRobot employs a multi-agent Chain of Thought (CoT) system, integrating both quantitative and qualitative analyses to emulate the comprehensive reasoning of a human analyst. The system is structured around three specialized agents: the Data-CoT Agent, which aggregates diverse data sources for robust financial integration; the Concept-CoT Agent, which mimics an analysts reasoning to generate actionable insights; and the Thesis-CoT Agent, which synthesizes these insights into a coherent investment thesis and report. FinRobot provides thorough company analysis supported by precise numerical data, industry-appropriate valuation metrics, and realistic risk assessments. Its dynamically updatable data pipeline ensures that research remains timely and relevant, adapting seamlessly to new financial information. Unlike existing automated research tools, such as CapitalCube and Wright Reports, FinRobot delivers insights comparable to those produced by major brokerage firms and fundamental research vendors. We open-source FinRobot at \url{https://github. com/AI4Finance-Foundation/FinRobot}.
△ Less
Submitted 13 November, 2024;
originally announced November 2024.
-
ajdmom: A Python Package for Deriving Moment Formulas of Affine Jump Diffusion Processes
Authors:
Yan-Feng Wu,
Jian-Qiang Hu
Abstract:
We introduce ajdmom, a Python package designed for automatically deriving moment formulae for the well-established affine jump diffusion processes with state-independent jump intensities. ajdmom can produce explicit closed-form expressions for conditional and unconditional moments of any order, significantly enhancing the usability of these models. Additionally, ajdmom can compute partial derivati…
▽ More
We introduce ajdmom, a Python package designed for automatically deriving moment formulae for the well-established affine jump diffusion processes with state-independent jump intensities. ajdmom can produce explicit closed-form expressions for conditional and unconditional moments of any order, significantly enhancing the usability of these models. Additionally, ajdmom can compute partial derivatives of these moments with respect to the model parameters, offering a valuable tool for sensitivity analysis. The package's modular architecture makes it easy for adaptation and extension by researchers. ajdmom is open-source and readily available for installation from GitHub or the Python package index (PyPI).
△ Less
Submitted 6 April, 2025; v1 submitted 10 November, 2024;
originally announced November 2024.
-
Deep-MacroFin: Informed Equilibrium Neural Network for Continuous Time Economic Models
Authors:
Yuntao Wu,
Jiayuan Guo,
Goutham Gopalakrishna,
Zissis Poulos
Abstract:
In this paper, we present Deep-MacroFin, a comprehensive framework designed to solve partial differential equations, with a particular focus on models in continuous time economics. This framework leverages deep learning methodologies, including Multi-Layer Perceptrons and the newly developed Kolmogorov-Arnold Networks. It is optimized using economic information encapsulated by Hamilton-Jacobi-Bell…
▽ More
In this paper, we present Deep-MacroFin, a comprehensive framework designed to solve partial differential equations, with a particular focus on models in continuous time economics. This framework leverages deep learning methodologies, including Multi-Layer Perceptrons and the newly developed Kolmogorov-Arnold Networks. It is optimized using economic information encapsulated by Hamilton-Jacobi-Bellman (HJB) equations and coupled algebraic equations. The application of neural networks holds the promise of accurately resolving high-dimensional problems with fewer computational demands and limitations compared to other numerical methods. This framework can be readily adapted for systems of partial differential equations in high dimensions. Importantly, it offers a more efficient (5$\times$ less CUDA memory and 40$\times$ fewer FLOPs in 100D problems) and user-friendly implementation than existing libraries. We also incorporate a time-stepping scheme to enhance training stability for nonlinear HJB equations, enabling the solution of 50D economic models.
△ Less
Submitted 13 May, 2025; v1 submitted 19 August, 2024;
originally announced August 2024.
-
Method of Moments Estimation for Affine Stochastic Volatility Models
Authors:
Yan-Feng Wu,
Xiangyu Yang,
Jian-Qiang Hu
Abstract:
We develop moment estimators for the parameters of affine stochastic volatility models. We first address the challenge of calculating moments for the models by introducing a recursive equation for deriving closed-form expressions for moments of any order. Consequently, we propose our moment estimators. We then establish a central limit theorem for our estimators and derive the explicit formulas fo…
▽ More
We develop moment estimators for the parameters of affine stochastic volatility models. We first address the challenge of calculating moments for the models by introducing a recursive equation for deriving closed-form expressions for moments of any order. Consequently, we propose our moment estimators. We then establish a central limit theorem for our estimators and derive the explicit formulas for the asymptotic covariance matrix. Finally, we provide numerical results to validate our method.
△ Less
Submitted 17 August, 2024;
originally announced August 2024.
-
The mean-variance portfolio selection based on the average and current profitability of the risky asset
Authors:
Yu Li,
Yuhan Wu,
Shuhua Zhang
Abstract:
We study the continuous-time pre-commitment mean-variance portfolio selection in a time-varying financial market. By introducing two indexes which respectively express the average profitability of the risky asset (AP) and the current profitability of the risky asset (CP), the optimal portfolio selection is represented by AP and CP. Furthermore, instead of the traditional maximum likelihood estimat…
▽ More
We study the continuous-time pre-commitment mean-variance portfolio selection in a time-varying financial market. By introducing two indexes which respectively express the average profitability of the risky asset (AP) and the current profitability of the risky asset (CP), the optimal portfolio selection is represented by AP and CP. Furthermore, instead of the traditional maximum likelihood estimation (MLE) of return rate and volatility of the risky asset, we estimate AP and CP with the second-order variation of an auxiliary wealth process. We prove that the estimations of AP and CP in this paper are more accurate than that in MLE. And, the portfolio selection is implemented in various simulated and real financial markets. Numerical studies confirm the superior performance of our portfolio selection with the estimation of AP and CP under various evaluation criteria.
△ Less
Submitted 15 August, 2024;
originally announced August 2024.
-
Advanced Financial Fraud Detection Using GNN-CL Model
Authors:
Yu Cheng,
Junjie Guo,
Shiqing Long,
You Wu,
Mengfang Sun,
Rong Zhang
Abstract:
The innovative GNN-CL model proposed in this paper marks a breakthrough in the field of financial fraud detection by synergistically combining the advantages of graph neural networks (gnn), convolutional neural networks (cnn) and long short-term memory (LSTM) networks. This convergence enables multifaceted analysis of complex transaction patterns, improving detection accuracy and resilience agains…
▽ More
The innovative GNN-CL model proposed in this paper marks a breakthrough in the field of financial fraud detection by synergistically combining the advantages of graph neural networks (gnn), convolutional neural networks (cnn) and long short-term memory (LSTM) networks. This convergence enables multifaceted analysis of complex transaction patterns, improving detection accuracy and resilience against complex fraudulent activities. A key novelty of this paper is the use of multilayer perceptrons (MLPS) to estimate node similarity, effectively filtering out neighborhood noise that can lead to false positives. This intelligent purification mechanism ensures that only the most relevant information is considered, thereby improving the model's understanding of the network structure. Feature weakening often plagues graph-based models due to the dilution of key signals. In order to further address the challenge of feature weakening, GNN-CL adopts reinforcement learning strategies. By dynamically adjusting the weights assigned to central nodes, it reinforces the importance of these influential entities to retain important clues of fraud even in less informative data. Experimental evaluations on Yelp datasets show that the results highlight the superior performance of GNN-CL compared to existing methods.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Statistics-Informed Parameterized Quantum Circuit via Maximum Entropy Principle for Data Science and Finance
Authors:
Xi-Ning Zhuang,
Zhao-Yun Chen,
Cheng Xue,
Xiao-Fan Xu,
Chao Wang,
Huan-Yu Liu,
Tai-Ping Sun,
Yun-Jie Wang,
Yu-Chun Wu,
Guo-Ping Guo
Abstract:
Quantum machine learning has demonstrated significant potential in solving practical problems, particularly in statistics-focused areas such as data science and finance. However, challenges remain in preparing and learning statistical models on a quantum processor due to issues with trainability and interpretability. In this letter, we utilize the maximum entropy principle to design a statistics-i…
▽ More
Quantum machine learning has demonstrated significant potential in solving practical problems, particularly in statistics-focused areas such as data science and finance. However, challenges remain in preparing and learning statistical models on a quantum processor due to issues with trainability and interpretability. In this letter, we utilize the maximum entropy principle to design a statistics-informed parameterized quantum circuit (SI-PQC) for efficiently preparing and training of quantum computational statistical models, including arbitrary distributions and their weighted mixtures. The SI-PQC features a static structure with trainable parameters, enabling in-depth optimized circuit compilation, exponential reductions in resource and time consumption, and improved trainability and interpretability for learning quantum states and classical model parameters simultaneously. As an efficient subroutine for preparing and learning in various quantum algorithms, the SI-PQC addresses the input bottleneck and facilitates the injection of prior knowledge.
△ Less
Submitted 18 June, 2024; v1 submitted 3 June, 2024;
originally announced June 2024.
-
Using CPI in Loss Given Default Forecasting Models for Commercial Real Estate Portfolio
Authors:
Ying Wu,
Garvit Arora,
Xuan Mei
Abstract:
Forecasting the loss given default (LGD) for defaulted Commercial Real Estate (CRE) loans poses a significant challenge due to the extended resolution and workout time associated with such defaults, particularly in CCAR and CECL framework where the utilization of post-default information, including macroeconomic variables (MEVs) such as unemployment (UER) and various rates, is restricted. The curr…
▽ More
Forecasting the loss given default (LGD) for defaulted Commercial Real Estate (CRE) loans poses a significant challenge due to the extended resolution and workout time associated with such defaults, particularly in CCAR and CECL framework where the utilization of post-default information, including macroeconomic variables (MEVs) such as unemployment (UER) and various rates, is restricted. The current environment of persistent inflation and resultant elevated rates further compounds the uncertainty surrounding predictive LGD models. In this paper, we leverage both internal and public data sources, including observations from the COVID-19 period, to present a list of evidence indicating that the growth rates of the Consumer Price, such as Year-over-Year (YoY) growth and logarithmic growth, are good leading indicators for various CRE related rates and indices. These include the Federal Funds Effective Rate and CRE market sales price indices in key locations such as Los Angeles, New York, and nationwide, encompassing both apartment and office segments. Furthermore, with CRE LGD data we demonstrate how incorporating CPI at the time of default can improve the accuracy of predicting CRE workout LGD. This is particularly helpful in addressing the common issue of early downturn underestimation encountered in CRE LGD models.
△ Less
Submitted 23 February, 2024;
originally announced February 2024.
-
An Exploration to the Correlation Structure and Clustering of Macroeconomic Variables
Authors:
Garvit Arora,
Shubhangi Tiwari,
Ying Wu,
Xuan Mei
Abstract:
As a quantitative characterization of the complicated economy, Macroeconomic Variables (MEVs), including GDP, inflation, unemployment, income, spending, interest rate, etc., are playing a crucial role in banks' portfolio management and stress testing exercise. In recent years, especially during the COVID-19 period and the current high inflation environment, people are frequently talking about the…
▽ More
As a quantitative characterization of the complicated economy, Macroeconomic Variables (MEVs), including GDP, inflation, unemployment, income, spending, interest rate, etc., are playing a crucial role in banks' portfolio management and stress testing exercise. In recent years, especially during the COVID-19 period and the current high inflation environment, people are frequently talking about the changing "correlation structure" of MEVs. In this paper, we use a principal component based algorithm to perform unsupervised clustering on MEVs so we can quantify and better understand MEVs' correlation structure in any given period. We also demonstrate how this method can be used to visualize historical MEVs pattern changes between 2000 and 2022. Further, we use this method to compare different hypothetical and/or historical macroeconomic scenarios and present our key findings. One of these interesting observations is that, for a list of 132 transformations derived from 44 targeted MEVs that cover 5 different aspects of the U.S. economy (which takes as a subset the 10+ key MEVs published by FRB), compared to benign years where there are typically 20-25 clusters, during the great financial crisis (GFC), i.e., 2007-2010, they exhibited a more synchronized and less diversified pattern of movement, forming roughly 15 clusters. We also see this contrast in the hypothetical CCAR2023 FRB scenarios where the Severely Adverse scenario has 15 clusters and the Baseline scenario has 21 clusters. We provide our interpretation to this observation and hope this research can inspire and benefit researchers from different domains all over the world.
△ Less
Submitted 20 May, 2024; v1 submitted 18 January, 2024;
originally announced January 2024.
-
DeRisk: An Effective Deep Learning Framework for Credit Risk Prediction over Real-World Financial Data
Authors:
Yancheng Liang,
Jiajie Zhang,
Hui Li,
Xiaochen Liu,
Yi Hu,
Yong Wu,
Jinyao Zhang,
Yongyan Liu,
Yi Wu
Abstract:
Despite the tremendous advances achieved over the past years by deep learning techniques, the latest risk prediction models for industrial applications still rely on highly handtuned stage-wised statistical learning tools, such as gradient boosting and random forest methods. Different from images or languages, real-world financial data are high-dimensional, sparse, noisy and extremely imbalanced,…
▽ More
Despite the tremendous advances achieved over the past years by deep learning techniques, the latest risk prediction models for industrial applications still rely on highly handtuned stage-wised statistical learning tools, such as gradient boosting and random forest methods. Different from images or languages, real-world financial data are high-dimensional, sparse, noisy and extremely imbalanced, which makes deep neural network models particularly challenging to train and fragile in practice. In this work, we propose DeRisk, an effective deep learning risk prediction framework for credit risk prediction on real-world financial data. DeRisk is the first deep risk prediction model that outperforms statistical learning approaches deployed in our company's production system. We also perform extensive ablation studies on our method to present the most critical factors for the empirical success of DeRisk.
△ Less
Submitted 7 August, 2023;
originally announced August 2023.
-
Quantum Encoding and Analysis on Continuous Time Stochastic Process with Financial Applications
Authors:
Xi-Ning Zhuang,
Zhao-Yun Chen,
Cheng Xue,
Yu-Chun Wu,
Guo-Ping Guo
Abstract:
The continuous time stochastic process is a mainstream mathematical instrument modeling the random world with a wide range of applications involving finance, statistics, physics, and time series analysis, while the simulation and analysis of the continuous time stochastic process is a challenging problem for classical computers. In this work, a general framework is established to prepare the path…
▽ More
The continuous time stochastic process is a mainstream mathematical instrument modeling the random world with a wide range of applications involving finance, statistics, physics, and time series analysis, while the simulation and analysis of the continuous time stochastic process is a challenging problem for classical computers. In this work, a general framework is established to prepare the path of a continuous time stochastic process in a quantum computer efficiently. The storage and computation resource is exponentially reduced on the key parameter of holding time, as the qubit number and the circuit depth are both optimized via our compressed state preparation method. The desired information, including the path-dependent and history-sensitive information that is essential for financial problems, can be extracted efficiently from the compressed sampling path, and admits a further quadratic speed-up. Moreover, this extraction method is more sensitive to those discontinuous jumps capturing extreme market events. Two applications of option pricing in Merton jump diffusion model and ruin probability computing in the collective risk model are given.
△ Less
Submitted 27 September, 2023; v1 submitted 3 August, 2022;
originally announced August 2022.
-
The China Trade Shock and the ESG Performances of US firms
Authors:
Hui Xu,
Yue Wu
Abstract:
How does import competition from China affect engagement on ESG initiatives by US corporates? On the one hand, reduced profitability due to import competition and lagging ESG performance of Chinese exporters can disincentivize US firms to put more resources to ESG initiatives. On the other hand, the shift from labor-intensive production to capital/technology-intensive production along with offshor…
▽ More
How does import competition from China affect engagement on ESG initiatives by US corporates? On the one hand, reduced profitability due to import competition and lagging ESG performance of Chinese exporters can disincentivize US firms to put more resources to ESG initiatives. On the other hand, the shift from labor-intensive production to capital/technology-intensive production along with offshoring may improve the US company's ESG performance. Moreover, US companies have incentives to actively pursue more ESG engagement to differentiate from Chinese imports. Exploiting a trade policy in which US congress granted China the Permanent Normal Trade Relations and the resulting change in expected tariff rates on Chinese imports, we find that greater import competition from China leads to an increase in the US company's ESG performance. The improvement primarily stems from "doing more positives" and from more involvement on environmental initiatives. Indirect and direct evidence shows that the improvement is not driven by the change in production process or offshoring, but is consistent with product differentiation. Our results suggest that the trade shock from China has significant impact on the US company's ESG performance.
△ Less
Submitted 28 January, 2022;
originally announced January 2022.
-
Towards Robust Representation of Limit Orders Books for Deep Learning Models
Authors:
Yufei Wu,
Mahmoud Mahfouz,
Daniele Magazzeni,
Manuela Veloso
Abstract:
The success of deep learning-based limit order book forecasting models is highly dependent on the quality and the robustness of the input data representation. A significant body of the quantitative finance literature focuses on utilising different deep learning architectures without taking into consideration the key assumptions these models make with respect to the input data representation. In th…
▽ More
The success of deep learning-based limit order book forecasting models is highly dependent on the quality and the robustness of the input data representation. A significant body of the quantitative finance literature focuses on utilising different deep learning architectures without taking into consideration the key assumptions these models make with respect to the input data representation. In this paper, we highlight the issues associated with the commonly-used representations of limit order book data from both a theoretical and practical perspectives. We also show the fragility of the representations under adversarial perturbations and propose two simple modifications to the existing representations that match the theoretical assumptions of deep learning models. Finally, we show experimentally how our proposed representations lead to state-of-the-art performance in both accuracy and robustness utilising very simple neural network architectures.
△ Less
Submitted 7 December, 2022; v1 submitted 10 October, 2021;
originally announced October 2021.
-
How Robust are Limit Order Book Representations under Data Perturbation?
Authors:
Yufei Wu,
Mahmoud Mahfouz,
Daniele Magazzeni,
Manuela Veloso
Abstract:
The success of machine learning models in the financial domain is highly reliant on the quality of the data representation. In this paper, we focus on the representation of limit order book data and discuss the opportunities and challenges for learning representations of such data. We also experimentally analyse the issues associated with existing representations and present a guideline for future…
▽ More
The success of machine learning models in the financial domain is highly reliant on the quality of the data representation. In this paper, we focus on the representation of limit order book data and discuss the opportunities and challenges for learning representations of such data. We also experimentally analyse the issues associated with existing representations and present a guideline for future research in this area.
△ Less
Submitted 10 October, 2021;
originally announced October 2021.
-
Quantum Quantitative Trading: High-Frequency Statistical Arbitrage Algorithm
Authors:
Xi-Ning Zhuang,
Zhao-Yun Chen,
Yu-Chun Wu,
Guo-Ping Guo
Abstract:
Quantitative trading is an integral part of financial markets with high calculation speed requirements, while no quantum algorithms have been introduced into this field yet. We propose quantum algorithms for high-frequency statistical arbitrage trading in this work by utilizing variable time condition number estimation and quantum linear regression.The algorithm complexity has been reduced from th…
▽ More
Quantitative trading is an integral part of financial markets with high calculation speed requirements, while no quantum algorithms have been introduced into this field yet. We propose quantum algorithms for high-frequency statistical arbitrage trading in this work by utilizing variable time condition number estimation and quantum linear regression.The algorithm complexity has been reduced from the classical benchmark O(N^2d) to O(sqrt(d)(kappa)^2(log(1/epsilon))^2 )). It shows quantum advantage, where N is the length of trading data, and d is the number of stocks, kappa is the condition number and epsilon is the desired precision. Moreover, two tool algorithms for condition number estimation and cointegration test are developed.
△ Less
Submitted 29 April, 2021;
originally announced April 2021.
-
Stock Market Trend Analysis Using Hidden Markov Model and Long Short Term Memory
Authors:
Mingwen Liu,
Junbang Huo,
Yulin Wu,
Jinge Wu
Abstract:
This paper intends to apply the Hidden Markov Model into stock market and and make predictions. Moreover, four different methods of improvement, which are GMM-HMM, XGB-HMM, GMM-HMM+LSTM and XGB-HMM+LSTM, will be discussed later with the results of experiment respectively. After that we will analyze the pros and cons of different models. And finally, one of the best will be used into stock market f…
▽ More
This paper intends to apply the Hidden Markov Model into stock market and and make predictions. Moreover, four different methods of improvement, which are GMM-HMM, XGB-HMM, GMM-HMM+LSTM and XGB-HMM+LSTM, will be discussed later with the results of experiment respectively. After that we will analyze the pros and cons of different models. And finally, one of the best will be used into stock market for timing strategy.
△ Less
Submitted 19 April, 2021;
originally announced April 2021.