-
Regression and Forecasting of U.S. Stock Returns Based on LSTM
Authors:
Shicheng Zhou,
Zizhou Zhang,
Rong Zhang,
Yuchen Yin,
Chia Hong Chang,
Qinyan Shen
Abstract:
This paper analyses the investment returns of three stock sectors, Manuf, Hitec, and Other, in the U.S. stock market, based on the Fama-French three-factor model, the Carhart four-factor model, and the Fama-French five-factor model, in order to test the validity of the Fama-French three-factor model, the Carhart four-factor model, and the Fama-French five-factor model for the three sectors of the…
▽ More
This paper analyses the investment returns of three stock sectors, Manuf, Hitec, and Other, in the U.S. stock market, based on the Fama-French three-factor model, the Carhart four-factor model, and the Fama-French five-factor model, in order to test the validity of the Fama-French three-factor model, the Carhart four-factor model, and the Fama-French five-factor model for the three sectors of the market. French five-factor model for the three sectors of the market. Also, the LSTM model is used to explore the additional factors affecting stock returns. The empirical results show that the Fama-French five-factor model has better validity for the three segments of the market under study, and the LSTM model has the ability to capture the factors affecting the returns of certain industries, and can better regress and predict the stock returns of the relevant industries. Keywords- Fama-French model; Carhart model; Factor model; LSTM model.
△ Less
Submitted 28 May, 2025; v1 submitted 3 February, 2025;
originally announced February 2025.
-
Tail Risk Alert Based on Conditional Autoregressive VaR by Regression Quantiles and Machine Learning Algorithms
Authors:
Zong Ke,
Yuchen Yin
Abstract:
As the increasing application of AI in finance, this paper will leverage AI algorithms to examine tail risk and develop a model to alter tail risk to promote the stability of US financial markets, and enhance the resilience of the US economy. Specifically, the paper constructs a multivariate multilevel CAViaR model, optimized by gradient descent and genetic algorithm, to study the tail risk spillo…
▽ More
As the increasing application of AI in finance, this paper will leverage AI algorithms to examine tail risk and develop a model to alter tail risk to promote the stability of US financial markets, and enhance the resilience of the US economy. Specifically, the paper constructs a multivariate multilevel CAViaR model, optimized by gradient descent and genetic algorithm, to study the tail risk spillover between the US stock market, foreign exchange market and credit market. The model is used to provide early warning of related risks in US stocks, US credit bonds, etc. The results show that, by analyzing the direction, magnitude, and pseudo-impulse response of the risk spillover, it is found that the credit market's spillover effect on the stock market and its duration are both greater than the spillover effect of the stock market and the other two markets on credit market, placing credit market in a central position for warning of extreme risks. Its historical information on extreme risks can serve as a predictor of the VaR of other markets.
△ Less
Submitted 8 December, 2024;
originally announced December 2024.
-
$\text{Alpha}^2$: Discovering Logical Formulaic Alphas using Deep Reinforcement Learning
Authors:
Feng Xu,
Yan Yin,
Xinyu Zhang,
Tianyuan Liu,
Shengyi Jiang,
Zongzhang Zhang
Abstract:
Alphas are pivotal in providing signals for quantitative trading. The industry highly values the discovery of formulaic alphas for their interpretability and ease of analysis, compared with the expressive yet overfitting-prone black-box alphas. In this work, we focus on discovering formulaic alphas. Prior studies on automatically generating a collection of formulaic alphas were mostly based on gen…
▽ More
Alphas are pivotal in providing signals for quantitative trading. The industry highly values the discovery of formulaic alphas for their interpretability and ease of analysis, compared with the expressive yet overfitting-prone black-box alphas. In this work, we focus on discovering formulaic alphas. Prior studies on automatically generating a collection of formulaic alphas were mostly based on genetic programming (GP), which is known to suffer from the problems of being sensitive to the initial population, converting to local optima, and slow computation speed. Recent efforts employing deep reinforcement learning (DRL) for alpha discovery have not fully addressed key practical considerations such as alpha correlations and validity, which are crucial for their effectiveness. In this work, we propose a novel framework for alpha discovery using DRL by formulating the alpha discovery process as program construction. Our agent, $\text{Alpha}^2$, assembles an alpha program optimized for an evaluation metric. A search algorithm guided by DRL navigates through the search space based on value estimates for potential alpha outcomes. The evaluation metric encourages both the performance and the diversity of alphas for a better final trading strategy. Our formulation of searching alphas also brings the advantage of pre-calculation dimensional analysis, ensuring the logical soundness of alphas, and pruning the vast search space to a large extent. Empirical experiments on real-world stock markets demonstrates $\text{Alpha}^2$'s capability to identify a diverse set of logical and effective alphas, which significantly improves the performance of the final trading strategy. The code of our method is available at https://github.com/x35f/alpha2.
△ Less
Submitted 26 June, 2024; v1 submitted 24 June, 2024;
originally announced June 2024.
-
FinPT: Financial Risk Prediction with Profile Tuning on Pretrained Foundation Models
Authors:
Yuwei Yin,
Yazheng Yang,
Jian Yang,
Qi Liu
Abstract:
Financial risk prediction plays a crucial role in the financial sector. Machine learning methods have been widely applied for automatically detecting potential risks and thus saving the cost of labor. However, the development in this field is lagging behind in recent years by the following two facts: 1) the algorithms used are somewhat outdated, especially in the context of the fast advance of gen…
▽ More
Financial risk prediction plays a crucial role in the financial sector. Machine learning methods have been widely applied for automatically detecting potential risks and thus saving the cost of labor. However, the development in this field is lagging behind in recent years by the following two facts: 1) the algorithms used are somewhat outdated, especially in the context of the fast advance of generative AI and large language models (LLMs); 2) the lack of a unified and open-sourced financial benchmark has impeded the related research for years. To tackle these issues, we propose FinPT and FinBench: the former is a novel approach for financial risk prediction that conduct Profile Tuning on large pretrained foundation models, and the latter is a set of high-quality datasets on financial risks such as default, fraud, and churn. In FinPT, we fill the financial tabular data into the pre-defined instruction template, obtain natural-language customer profiles by prompting LLMs, and fine-tune large foundation models with the profile text to make predictions. We demonstrate the effectiveness of the proposed FinPT by experimenting with a range of representative strong baselines on FinBench. The analytical studies further deepen the understanding of LLMs for financial risk prediction.
△ Less
Submitted 22 July, 2023;
originally announced August 2023.
-
Consistency of MLE for partially observed diffusions, with application in market microstructure modeling
Authors:
Sergey Nadtochiy,
Yuan Yin
Abstract:
This paper presents a tractable sufficient condition for the consistency of maximum likelihood estimators (MLEs) in partially observed diffusion models, stated in terms of stationary distribution of the associated fully observed diffusion, under the assumption that the set of unknown parameter values is finite. This sufficient condition is then verified in the context of a latent price model of ma…
▽ More
This paper presents a tractable sufficient condition for the consistency of maximum likelihood estimators (MLEs) in partially observed diffusion models, stated in terms of stationary distribution of the associated fully observed diffusion, under the assumption that the set of unknown parameter values is finite. This sufficient condition is then verified in the context of a latent price model of market microstructure, yielding consistency of maximum likelihood estimators of the unknown parameters in this model. Finally, we compute the latter estimators using historical financial data taken from the NASDAQ exchange.
△ Less
Submitted 7 December, 2024; v1 submitted 19 January, 2022;
originally announced January 2022.
-
Temporal-Relational Hypergraph Tri-Attention Networks for Stock Trend Prediction
Authors:
Chaoran Cui,
Xiaojie Li,
Juan Du,
Chunyun Zhang,
Xiushan Nie,
Meng Wang,
Yilong Yin
Abstract:
Predicting the future price trends of stocks is a challenging yet intriguing problem given its critical role to help investors make profitable decisions. In this paper, we present a collaborative temporal-relational modeling framework for end-to-end stock trend prediction. The temporal dynamics of stocks is firstly captured with an attention-based recurrent neural network. Then, different from exi…
▽ More
Predicting the future price trends of stocks is a challenging yet intriguing problem given its critical role to help investors make profitable decisions. In this paper, we present a collaborative temporal-relational modeling framework for end-to-end stock trend prediction. The temporal dynamics of stocks is firstly captured with an attention-based recurrent neural network. Then, different from existing studies relying on the pairwise correlations between stocks, we argue that stocks are naturally connected as a collective group, and introduce the hypergraph structures to jointly characterize the stock group-wise relationships of industry-belonging and fund-holding. A novel hypergraph tri-attention network (HGTAN) is proposed to augment the hypergraph convolutional networks with a hierarchical organization of intra-hyperedge, inter-hyperedge, and inter-hypergraph attention modules. In this manner, HGTAN adaptively determines the importance of nodes, hyperedges, and hypergraphs during the information propagation among stocks, so that the potential synergies between stock movements can be fully exploited. Extensive experiments on real-world data demonstrate the effectiveness of our approach. Also, the results of investment simulation show that our approach can achieve a more desirable risk-adjusted return. The data and codes of our work have been released at https://github.com/lixiaojieff/HGTAN.
△ Less
Submitted 4 March, 2022; v1 submitted 21 July, 2021;
originally announced July 2021.