-
Predictive Power of LLMs in Financial Markets
Authors:
Jerick Shi,
Burton Hollifield
Abstract:
Predicting the movement of the stock market and other assets has been valuable over the past few decades. Knowing how the value of a certain sector market may move in the future provides much information for investors, as they use that information to develop strategies to maximize profit or minimize risk. However, market data are quite noisy, and it is challenging to choose the right data or the r…
▽ More
Predicting the movement of the stock market and other assets has been valuable over the past few decades. Knowing how the value of a certain sector market may move in the future provides much information for investors, as they use that information to develop strategies to maximize profit or minimize risk. However, market data are quite noisy, and it is challenging to choose the right data or the right model to create such predictions. With the rise of large language models, there are ways to analyze certain data much more efficiently than before.
Our goal is to determine whether the GPT model provides more useful information compared to other traditional transformer models, such as the BERT model. We shall use data from the Federal Reserve Beige Book, which provides summaries of economic conditions in different districts in the US. Using such data, we then employ the LLM's to make predictions on the correlations. Using these correlations, we then compare the results with well-known strategies and determine whether knowing the economic conditions improves investment decisions. We conclude that the Beige Book does contain information regarding correlations amongst different assets, yet the GPT model has too much look-ahead bias and that traditional models still triumph.
△ Less
Submitted 25 November, 2024;
originally announced November 2024.
-
AlphaForge: A Framework to Mine and Dynamically Combine Formulaic Alpha Factors
Authors:
Hao Shi,
Weili Song,
Xinting Zhang,
Jiahe Shi,
Cuicui Luo,
Xiang Ao,
Hamid Arian,
Luis Seco
Abstract:
The complexity of financial data, characterized by its variability and low signal-to-noise ratio, necessitates advanced methods in quantitative investment that prioritize both performance and interpretability.Transitioning from early manual extraction to genetic programming, the most advanced approach in the alpha factor mining domain currently employs reinforcement learning to mine a set of combi…
▽ More
The complexity of financial data, characterized by its variability and low signal-to-noise ratio, necessitates advanced methods in quantitative investment that prioritize both performance and interpretability.Transitioning from early manual extraction to genetic programming, the most advanced approach in the alpha factor mining domain currently employs reinforcement learning to mine a set of combination factors with fixed weights. However, the performance of resultant alpha factors exhibits inconsistency, and the inflexibility of fixed factor weights proves insufficient in adapting to the dynamic nature of financial markets. To address this issue, this paper proposes a two-stage formulaic alpha generating framework AlphaForge, for alpha factor mining and factor combination. This framework employs a generative-predictive neural network to generate factors, leveraging the robust spatial exploration capabilities inherent in deep learning while concurrently preserving diversity. The combination model within the framework incorporates the temporal performance of factors for selection and dynamically adjusts the weights assigned to each component alpha factor. Experiments conducted on real-world datasets demonstrate that our proposed model outperforms contemporary benchmarks in formulaic alpha factor mining. Furthermore, our model exhibits a notable enhancement in portfolio returns within the realm of quantitative investment and real money investment.
△ Less
Submitted 12 December, 2024; v1 submitted 26 June, 2024;
originally announced June 2024.
-
TransCORALNet: A Two-Stream Transformer CORAL Networks for Supply Chain Credit Assessment Cold Start
Authors:
Jie Shi,
Arno P. J. M. Siebes,
Siamak Mehrkanoon
Abstract:
This paper proposes an interpretable two-stream transformer CORAL networks (TransCORALNet) for supply chain credit assessment under the segment industry and cold start problem. The model aims to provide accurate credit assessment prediction for new supply chain borrowers with limited historical data. Here, the two-stream domain adaptation architecture with correlation alignment (CORAL) loss is use…
▽ More
This paper proposes an interpretable two-stream transformer CORAL networks (TransCORALNet) for supply chain credit assessment under the segment industry and cold start problem. The model aims to provide accurate credit assessment prediction for new supply chain borrowers with limited historical data. Here, the two-stream domain adaptation architecture with correlation alignment (CORAL) loss is used as a core model and is equipped with transformer, which provides insights about the learned features and allow efficient parallelization during training. Thanks to the domain adaptation capability of the proposed model, the domain shift between the source and target domain is minimized. Therefore, the model exhibits good generalization where the source and target do not follow the same distribution, and a limited amount of target labeled instances exist. Furthermore, we employ Local Interpretable Model-agnostic Explanations (LIME) to provide more insight into the model prediction and identify the key features contributing to supply chain credit assessment decisions. The proposed model addresses four significant supply chain credit assessment challenges: domain shift, cold start, imbalanced-class and interpretability. Experimental results on a real-world data set demonstrate the superiority of TransCORALNet over a number of state-of-the-art baselines in terms of accuracy. The code is available on GitHub https://github.com/JieJieNiu/TransCORALN .
△ Less
Submitted 30 November, 2023;
originally announced November 2023.
-
Stock Market Prediction via Deep Learning Techniques: A Survey
Authors:
Jinan Zou,
Qingying Zhao,
Yang Jiao,
Haiyao Cao,
Yanxi Liu,
Qingsen Yan,
Ehsan Abbasnejad,
Lingqiao Liu,
Javen Qinfeng Shi
Abstract:
Existing surveys on stock market prediction often focus on traditional machine learning methods instead of deep learning methods. This motivates us to provide a structured and comprehensive overview of the research on stock market prediction. We present four elaborated subtasks of stock market prediction and propose a novel taxonomy to summarize the state-of-the-art models based on deep neural net…
▽ More
Existing surveys on stock market prediction often focus on traditional machine learning methods instead of deep learning methods. This motivates us to provide a structured and comprehensive overview of the research on stock market prediction. We present four elaborated subtasks of stock market prediction and propose a novel taxonomy to summarize the state-of-the-art models based on deep neural networks. In addition, we also provide detailed statistics on the datasets and evaluation metrics commonly used in the stock market. Finally, we point out several future directions by sharing some new perspectives on stock market prediction.
△ Less
Submitted 9 February, 2023; v1 submitted 24 December, 2022;
originally announced December 2022.
-
Optimal control of multiple Markov switching stochastic system with application to portfolio decision
Authors:
Jianmin Shi
Abstract:
In this paper we set up an optimal control framework for a hybrid stochastic system with dual or multiple Markov switching diffusion processes, while Markov chains governing these switching diffusions are not identical as assumed by the existing literature. As an application and illustration of this model, we solve a portfolio choice problem for an investor facing financial and labor markets that…
▽ More
In this paper we set up an optimal control framework for a hybrid stochastic system with dual or multiple Markov switching diffusion processes, while Markov chains governing these switching diffusions are not identical as assumed by the existing literature. As an application and illustration of this model, we solve a portfolio choice problem for an investor facing financial and labor markets that are both regime switching. In continuous time context we combine two separate Markov chains into one synthetic Markov chain and derive its corresponding generator matrix, then state the HJB equations for the optimal control problem with the newly synthesized Markov switching diffusion. Furthermore, we derive explicit solutions and value functions under some reasonable specifications.
△ Less
Submitted 30 October, 2020;
originally announced October 2020.
-
Optimal Reinsurance and Investment Strategies under Mean-Variance Criteria: Partial and Full Information
Authors:
Shihao Zhu,
Jingtao Shi
Abstract:
This paper is concerned with an optimal reinsurance and investment problem for an insurance firm under the criterion of mean-variance. The driving Brownian motion and the rate in return of the risky asset price dynamic equation cannot be directly observed. And the short-selling of stocks is prohibited. The problem is formulated as a stochastic linear-quadratic (LQ) optimal control problem where th…
▽ More
This paper is concerned with an optimal reinsurance and investment problem for an insurance firm under the criterion of mean-variance. The driving Brownian motion and the rate in return of the risky asset price dynamic equation cannot be directly observed. And the short-selling of stocks is prohibited. The problem is formulated as a stochastic linear-quadratic (LQ) optimal control problem where the control variables are constrained. Based on the separation principle and stochastic filtering theory, the partial information problem is solved. Efficient strategies and efficient frontier are presented in closed forms via solutions to two extended stochastic Riccati equations. As a comparison, the efficient strategies and efficient frontier are given by the viscosity solution for the Hamilton-Jacobi-Bellman (HJB) equation in the full information case. Some numerical illustrations are also provided.
△ Less
Submitted 2 June, 2020; v1 submitted 19 June, 2019;
originally announced June 2019.
-
Exploiting Investors Social Network for Stock Prediction in China's Market
Authors:
Xi Zhang,
Jiawei Shi,
Di Wang,
Binxing Fang
Abstract:
Recent works have shown that social media platforms are able to influence the trends of stock price movements. However, existing works have majorly focused on the U.S. stock market and lacked attention to certain emerging countries such as China, where retail investors dominate the market. In this regard, as retail investors are prone to be influenced by news or other social media, psychological a…
▽ More
Recent works have shown that social media platforms are able to influence the trends of stock price movements. However, existing works have majorly focused on the U.S. stock market and lacked attention to certain emerging countries such as China, where retail investors dominate the market. In this regard, as retail investors are prone to be influenced by news or other social media, psychological and behavioral features extracted from social media platforms are thought to well predict stock price movements in the China's market. Recent advances in the investor social network in China enables the extraction of such features from web-scale data. In this paper, on the basis of tweets from Xueqiu, a popular Chinese Twitter-like social platform specialized for investors, we analyze features with regard to collective sentiment and perception on stock relatedness and predict stock price movements by employing nonlinear models. The features of interest prove to be effective in our experiments.
△ Less
Submitted 2 January, 2018;
originally announced January 2018.
-
Benford's law first significant digit and distribution distances for testing the reliability of financial reports in developing countries
Authors:
Jing Shi,
Marcel Ausloos,
Tingting Zhu
Abstract:
We discuss a common suspicion about reported financial data, in 10 industrial sectors of the 6 so called "main developing countries" over the time interval [2000-2014]. These data are examined through Benford's law first significant digit and through distribution distances tests. It is shown that several visually anomalous data have to be a priori removed. Thereafter, the distributions much better…
▽ More
We discuss a common suspicion about reported financial data, in 10 industrial sectors of the 6 so called "main developing countries" over the time interval [2000-2014]. These data are examined through Benford's law first significant digit and through distribution distances tests. It is shown that several visually anomalous data have to be a priori removed. Thereafter, the distributions much better follow the first digit significant law, indicating the usefulness of a Benford's law test from the research starting line. The same holds true for distance tests. A few outliers are pointed out.
△ Less
Submitted 30 November, 2017;
originally announced December 2017.