-
A Distillation-based Future-aware Graph Neural Network for Stock Trend Prediction
Authors:
Zhipeng Liu,
Peibo Duan,
Mingyang Geng,
Bin Zhang
Abstract:
Stock trend prediction involves forecasting the future price movements by analyzing historical data and various market indicators. With the advancement of machine learning, graph neural networks (GNNs) have been extensively employed in stock prediction due to their powerful capability to capture spatiotemporal dependencies of stocks. However, despite the efforts of various GNN stock predictors to…
▽ More
Stock trend prediction involves forecasting the future price movements by analyzing historical data and various market indicators. With the advancement of machine learning, graph neural networks (GNNs) have been extensively employed in stock prediction due to their powerful capability to capture spatiotemporal dependencies of stocks. However, despite the efforts of various GNN stock predictors to enhance predictive performance, the improvements remain limited, as they focus solely on analyzing historical spatiotemporal dependencies, overlooking the correlation between historical and future patterns. In this study, we propose a novel distillation-based future-aware GNN framework (DishFT-GNN) for stock trend prediction. Specifically, DishFT-GNN trains a teacher model and a student model, iteratively. The teacher model learns to capture the correlation between distribution shifts of historical and future data, which is then utilized as intermediate supervision to guide the student model to learn future-aware spatiotemporal embeddings for accurate prediction. Through extensive experiments on two real-world datasets, we verify the state-of-the-art performance of DishFT-GNN.
△ Less
Submitted 15 February, 2025;
originally announced February 2025.
-
Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications
Authors:
Jimin Huang,
Mengxi Xiao,
Dong Li,
Zihao Jiang,
Yuzhe Yang,
Yifei Zhang,
Lingfei Qian,
Yan Wang,
Xueqing Peng,
Yang Ren,
Ruoyu Xiang,
Zhengyu Chen,
Xiao Zhang,
Yueru He,
Weiguang Han,
Shunian Chen,
Lihang Shen,
Daniel Kim,
Yangyang Yu,
Yupeng Cao,
Zhiyang Deng,
Haohang Li,
Duanyu Feng,
Yongfu Dai,
VijayaSai Somasundaram
, et al. (19 additional authors not shown)
Abstract:
Financial LLMs hold promise for advancing financial tasks and domain-specific applications. However, they are limited by scarce corpora, weak multimodal capabilities, and narrow evaluations, making them less suited for real-world application. To address this, we introduce \textit{Open-FinLLMs}, the first open-source multimodal financial LLMs designed to handle diverse tasks across text, tabular, t…
▽ More
Financial LLMs hold promise for advancing financial tasks and domain-specific applications. However, they are limited by scarce corpora, weak multimodal capabilities, and narrow evaluations, making them less suited for real-world application. To address this, we introduce \textit{Open-FinLLMs}, the first open-source multimodal financial LLMs designed to handle diverse tasks across text, tabular, time-series, and chart data, excelling in zero-shot, few-shot, and fine-tuning settings. The suite includes FinLLaMA, pre-trained on a comprehensive 52-billion-token corpus; FinLLaMA-Instruct, fine-tuned with 573K financial instructions; and FinLLaVA, enhanced with 1.43M multimodal tuning pairs for strong cross-modal reasoning. We comprehensively evaluate Open-FinLLMs across 14 financial tasks, 30 datasets, and 4 multimodal tasks in zero-shot, few-shot, and supervised fine-tuning settings, introducing two new multimodal evaluation datasets. Our results show that Open-FinLLMs outperforms afvanced financial and general LLMs such as GPT-4, across financial NLP, decision-making, and multi-modal tasks, highlighting their potential to tackle real-world challenges. To foster innovation and collaboration across academia and industry, we release all codes (https://anonymous.4open.science/r/PIXIU2-0D70/B1D7/LICENSE) and models under OSI-approved licenses.
△ Less
Submitted 6 June, 2025; v1 submitted 20 August, 2024;
originally announced August 2024.
-
MDGNN: Multi-Relational Dynamic Graph Neural Network for Comprehensive and Dynamic Stock Investment Prediction
Authors:
Hao Qian,
Hongting Zhou,
Qian Zhao,
Hao Chen,
Hongxiang Yao,
Jingwei Wang,
Ziqi Liu,
Fei Yu,
Zhiqiang Zhang,
Jun Zhou
Abstract:
The stock market is a crucial component of the financial system, but predicting the movement of stock prices is challenging due to the dynamic and intricate relations arising from various aspects such as economic indicators, financial reports, global news, and investor sentiment. Traditional sequential methods and graph-based models have been applied in stock movement prediction, but they have lim…
▽ More
The stock market is a crucial component of the financial system, but predicting the movement of stock prices is challenging due to the dynamic and intricate relations arising from various aspects such as economic indicators, financial reports, global news, and investor sentiment. Traditional sequential methods and graph-based models have been applied in stock movement prediction, but they have limitations in capturing the multifaceted and temporal influences in stock price movements. To address these challenges, the Multi-relational Dynamic Graph Neural Network (MDGNN) framework is proposed, which utilizes a discrete dynamic graph to comprehensively capture multifaceted relations among stocks and their evolution over time. The representation generated from the graph offers a complete perspective on the interrelationships among stocks and associated entities. Additionally, the power of the Transformer structure is leveraged to encode the temporal evolution of multiplex relations, providing a dynamic and effective approach to predicting stock investment. Further, our proposed MDGNN framework achieves the best performance in public datasets compared with state-of-the-art (SOTA) stock investment methods.
△ Less
Submitted 18 January, 2024;
originally announced February 2024.
-
Temporal Data Meets LLM -- Explainable Financial Time Series Forecasting
Authors:
Xinli Yu,
Zheng Chen,
Yuan Ling,
Shujing Dong,
Zongyi Liu,
Yanbin Lu
Abstract:
This paper presents a novel study on harnessing Large Language Models' (LLMs) outstanding knowledge and reasoning abilities for explainable financial time series forecasting. The application of machine learning models to financial time series comes with several challenges, including the difficulty in cross-sequence reasoning and inference, the hurdle of incorporating multi-modal signals from histo…
▽ More
This paper presents a novel study on harnessing Large Language Models' (LLMs) outstanding knowledge and reasoning abilities for explainable financial time series forecasting. The application of machine learning models to financial time series comes with several challenges, including the difficulty in cross-sequence reasoning and inference, the hurdle of incorporating multi-modal signals from historical news, financial knowledge graphs, etc., and the issue of interpreting and explaining the model results. In this paper, we focus on NASDAQ-100 stocks, making use of publicly accessible historical stock price data, company metadata, and historical economic/financial news. We conduct experiments to illustrate the potential of LLMs in offering a unified solution to the aforementioned challenges. Our experiments include trying zero-shot/few-shot inference with GPT-4 and instruction-based fine-tuning with a public LLM model Open LLaMA. We demonstrate our approach outperforms a few baselines, including the widely applied classic ARMA-GARCH model and a gradient-boosting tree model. Through the performance comparison results and a few examples, we find LLMs can make a well-thought decision by reasoning over information from both textual news and price time series and extracting insights, leveraging cross-sequence information, and utilizing the inherent knowledge embedded within the LLM. Additionally, we show that a publicly available LLM such as Open-LLaMA, after fine-tuning, can comprehend the instruction to generate explainable forecasts and achieve reasonable performance, albeit relatively inferior in comparison to GPT-4.
△ Less
Submitted 19 June, 2023;
originally announced June 2023.
-
Who is Gambling? Finding Cryptocurrency Gamblers Using Multi-modal Retrieval Methods
Authors:
Zhengjie Huang,
Zhenguang Liu,
Jianhai Chen,
Qinming He,
Shuang Wu,
Lei Zhu,
Meng Wang
Abstract:
With the popularity of cryptocurrencies and the remarkable development of blockchain technology, decentralized applications emerged as a revolutionary force for the Internet. Meanwhile, decentralized applications have also attracted intense attention from the online gambling community, with more and more decentralized gambling platforms created through the help of smart contracts. Compared with co…
▽ More
With the popularity of cryptocurrencies and the remarkable development of blockchain technology, decentralized applications emerged as a revolutionary force for the Internet. Meanwhile, decentralized applications have also attracted intense attention from the online gambling community, with more and more decentralized gambling platforms created through the help of smart contracts. Compared with conventional gambling platforms, decentralized gambling have transparent rules and a low participation threshold, attracting a substantial number of gamblers. In order to discover gambling behaviors and identify the contracts and addresses involved in gambling, we propose a tool termed ETHGamDet. The tool is able to automatically detect the smart contracts and addresses involved in gambling by scrutinizing the smart contract code and address transaction records. Interestingly, we present a novel LightGBM model with memory components, which possesses the ability to learn from its own misclassifications. As a side contribution, we construct and release a large-scale gambling dataset at https://github.com/AwesomeHuang/Bitcoin-Gambling-Dataset to facilitate future research in this field. Empirically, ETHGamDet achieves a F1-score of 0.72 and 0.89 in address classification and contract classification respectively, and offers novel and interesting insights.
△ Less
Submitted 27 November, 2022;
originally announced November 2022.
-
Risk Assessment with Generic Energy Storage under Exogenous and Endogenous Uncertainty
Authors:
Ning Qi,
Lin Cheng,
Yuxiang Wan,
Yingrui Zhuang,
Zeyu Liu
Abstract:
Current risk assessment ignores the stochastic nature of energy storage availability itself and thus lead to potential risk during operation. This paper proposes the redefinition of generic energy storage (GES) that is allowed to offer probabilistic reserve. A data-driven unified model with exogenous and endogenous uncertainty (EXU & EDU) description is presented for four typical types of GES. Mor…
▽ More
Current risk assessment ignores the stochastic nature of energy storage availability itself and thus lead to potential risk during operation. This paper proposes the redefinition of generic energy storage (GES) that is allowed to offer probabilistic reserve. A data-driven unified model with exogenous and endogenous uncertainty (EXU & EDU) description is presented for four typical types of GES. Moreover, risk indices are proposed to assess the impact of overlooking (EXU & EDU) of GES. Comparative results between EXU & EDU are illustrated in distribution system with day-ahead chance-constrained optimization (CCO) and more severe risks are observed for the latter, which indicate that system operator (SO) should adopt novel strategies for EDU uncertainty.
△ Less
Submitted 26 March, 2022;
originally announced March 2022.
-
Every Corporation Owns Its Image: Corporate Credit Ratings via Convolutional Neural Networks
Authors:
Bojing Feng,
Wenfang Xue,
Bindang Xue,
Zeyu Liu
Abstract:
Credit rating is an analysis of the credit risks associated with a corporation, which reflect the level of the riskiness and reliability in investing. There have emerged many studies that implement machine learning techniques to deal with corporate credit rating. However, the ability of these models is limited by enormous amounts of data from financial statement reports. In this work, we analyze t…
▽ More
Credit rating is an analysis of the credit risks associated with a corporation, which reflect the level of the riskiness and reliability in investing. There have emerged many studies that implement machine learning techniques to deal with corporate credit rating. However, the ability of these models is limited by enormous amounts of data from financial statement reports. In this work, we analyze the performance of traditional machine learning models in predicting corporate credit rating. For utilizing the powerful convolutional neural networks and enormous financial data, we propose a novel end-to-end method, Corporate Credit Ratings via Convolutional Neural Networks, CCR-CNN for brevity. In the proposed model, each corporation is transformed into an image. Based on this image, CNN can capture complex feature interactions of data, which are difficult to be revealed by previous machine learning models. Extensive experiments conducted on the Chinese public-listed corporate rating dataset which we build, prove that CCR-CNN outperforms the state-of-the-art methods consistently.
△ Less
Submitted 2 December, 2020;
originally announced December 2020.
-
Preventing crash in stock market: The role of economic policy uncertainty during COVID-19
Authors:
Peng-Fei Dai,
Xiong Xiong,
Zhifeng Liu,
Toan Luu Duc Huynh,
Jianjun Sun
Abstract:
This paper investigates the impact of economic policy uncertainty (EPU) on the crash risk of US stock market during the COVID-19 pandemic. To this end, we use the GARCH-S (GARCH with skewness) model to estimate daily skewness as a proxy for the stock market crash risk. The empirical results show the significantly negative correlation between EPU and stock market crash risk, indicating the aggravat…
▽ More
This paper investigates the impact of economic policy uncertainty (EPU) on the crash risk of US stock market during the COVID-19 pandemic. To this end, we use the GARCH-S (GARCH with skewness) model to estimate daily skewness as a proxy for the stock market crash risk. The empirical results show the significantly negative correlation between EPU and stock market crash risk, indicating the aggravation of EPU increase the crash risk. Moreover, the negative correlation gets stronger after the global COVID-19 outbreak, which shows the crash risk of the US stock market will be more affected by EPU during the pandemic.
△ Less
Submitted 17 August, 2021; v1 submitted 2 October, 2020;
originally announced October 2020.
-
The impact of COVID-19 on the stock market crash risk in China
Authors:
Zhifeng Liu,
Toan Luu Duc Huynh,
Peng-Fei Dai
Abstract:
This study investigates the impact of the COVID-19 pandemic on the stock market crash risk in China. For this purpose, we first estimated the conditional skewness of the return distribution from a GARCH with skewness (GARCH-S) model as the proxy for the equity market crash risk of the Shanghai Stock Exchange. We then constructed a fear index for COVID-19 using data from the Baidu Index. Based on t…
▽ More
This study investigates the impact of the COVID-19 pandemic on the stock market crash risk in China. For this purpose, we first estimated the conditional skewness of the return distribution from a GARCH with skewness (GARCH-S) model as the proxy for the equity market crash risk of the Shanghai Stock Exchange. We then constructed a fear index for COVID-19 using data from the Baidu Index. Based on the findings, conditional skewness reacts negatively to daily growth in total confirmed cases, indicating that the pandemic increases stock market crash risk. Moreover, the fear sentiment exacerbates such risk, especially with regard to the impact of COVID-19. In other words, when the fear sentiment is high, the stock market crash risk is more strongly affected by the pandemic. Our evidence is robust for the number of daily deaths and global cases.
△ Less
Submitted 17 August, 2021; v1 submitted 16 September, 2020;
originally announced September 2020.
-
Equity2Vec: End-to-end Deep Learning Framework for Cross-sectional Asset Pricing
Authors:
Qiong Wu,
Christopher G. Brinton,
Zheng Zhang,
Andrea Pizzoferrato,
Zhenming Liu,
Mihai Cucuringu
Abstract:
Pricing assets has attracted significant attention from the financial technology community. We observe that the existing solutions overlook the cross-sectional effects and not fully leveraged the heterogeneous data sets, leading to sub-optimal performance.
To this end, we propose an end-to-end deep learning framework to price the assets. Our framework possesses two main properties: 1) We propose…
▽ More
Pricing assets has attracted significant attention from the financial technology community. We observe that the existing solutions overlook the cross-sectional effects and not fully leveraged the heterogeneous data sets, leading to sub-optimal performance.
To this end, we propose an end-to-end deep learning framework to price the assets. Our framework possesses two main properties: 1) We propose Equity2Vec, a graph-based component that effectively captures both long-term and evolving cross-sectional interactions. 2) The framework simultaneously leverages all the available heterogeneous alpha sources including technical indicators, financial news signals, and cross-sectional signals. Experimental results on datasets from the real-world stock market show that our approach outperforms the existing state-of-the-art approaches. Furthermore, market trading simulations demonstrate that our framework monetizes the signals effectively.
△ Less
Submitted 26 October, 2021; v1 submitted 7 September, 2019;
originally announced September 2019.
-
Distributions of Historic Market Data -- Relaxation and Correlations
Authors:
M. Dashti Moghaddam,
Zhiyuan Liu,
R. A. Serota
Abstract:
We investigate relaxation and correlations in a class of mean-reverting models for stochastic variances. We derive closed-form expressions for the correlation functions and leverage for a general form of the stochastic term. We also discuss correlation functions and leverage for three specific models -- multiplicative, Heston (Cox-Ingersoll-Ross) and combined multiplicative-Heston -- whose steady-…
▽ More
We investigate relaxation and correlations in a class of mean-reverting models for stochastic variances. We derive closed-form expressions for the correlation functions and leverage for a general form of the stochastic term. We also discuss correlation functions and leverage for three specific models -- multiplicative, Heston (Cox-Ingersoll-Ross) and combined multiplicative-Heston -- whose steady-state probability density functions are Gamma, Inverse Gamma and Beta Prime respectively, the latter two exhibiting "fat" tails. For the Heston model, we apply the eigenvalue analysis of the Fokker-Planck equation to derive the correlation function -- in agreement with the general analysis -- and to identify a series of time scales, which are observable in relaxation of cumulants on approach to the steady state. We test our findings on a very large set of historic financial markets data.
△ Less
Submitted 24 February, 2020; v1 submitted 11 July, 2019;
originally announced July 2019.
-
Distributions of Historic Market Data -- Implied and Realized Volatility
Authors:
M. Dashti Moghaddam,
Zhiyuan Liu,
R. A. Serota
Abstract:
We undertake a systematic comparison between implied volatility, as represented by VIX (new methodology) and VXO (old methodology), and realized volatility. We compare visually and statistically distributions of realized and implied variance (volatility squared) and study the distribution of their ratio. We find that the ratio is best fitted by heavy-tailed -- lognormal and fat-tailed (power-law)…
▽ More
We undertake a systematic comparison between implied volatility, as represented by VIX (new methodology) and VXO (old methodology), and realized volatility. We compare visually and statistically distributions of realized and implied variance (volatility squared) and study the distribution of their ratio. We find that the ratio is best fitted by heavy-tailed -- lognormal and fat-tailed (power-law) -- distributions, depending on whether preceding or concurrent month of realized variance is used. We do not find substantial difference in accuracy between VIX and VXO. Additionally, we study the variance of theoretical realized variance for Heston and multiplicative models of stochastic volatility and compare those with realized variance obtained from historic market data.
△ Less
Submitted 14 April, 2018;
originally announced April 2018.
-
Distributions of Historic Market Data - Stock Returns
Authors:
Zhiyuan Liu,
M. Dashti Moghaddam,
R. A. Serota
Abstract:
We show that the moments of the distribution of historic stock returns are in excellent agreement with the Heston model and not with the multiplicative model, which predicts power-law tails of volatility and stock returns. We also show that the mean realized variance of returns is a linear function of the number of days over which the returns are calculated. The slope is determined by the mean val…
▽ More
We show that the moments of the distribution of historic stock returns are in excellent agreement with the Heston model and not with the multiplicative model, which predicts power-law tails of volatility and stock returns. We also show that the mean realized variance of returns is a linear function of the number of days over which the returns are calculated. The slope is determined by the mean value of the variance (squared volatility) in the mean-reverting stochastic volatility models, such as Heston and multiplicative, independent of stochasticity. The distribution function of stock returns, which rescales with the increase of the number of days of return, is obtained from the steady-state variance distribution function using the product distribution with the normal distribution.
△ Less
Submitted 5 December, 2017; v1 submitted 29 November, 2017;
originally announced November 2017.
-
On absence of steady state in the Bouchaud-Mézard network model
Authors:
Zhiyuan Liu,
R. A. Serota
Abstract:
In the limit of infinite number of nodes (agents), the Itô-reduced Bouchaud-Mézard network model of economic exchange has a time-independent mean and a steady-state inverse gamma distribution. We show that for a finite number of nodes the mean is actually distributed as a time-dependent lognormal and inverse gamma is quasi-stationary, with the time-dependent scale parameter.
In the limit of infinite number of nodes (agents), the Itô-reduced Bouchaud-Mézard network model of economic exchange has a time-independent mean and a steady-state inverse gamma distribution. We show that for a finite number of nodes the mean is actually distributed as a time-dependent lognormal and inverse gamma is quasi-stationary, with the time-dependent scale parameter.
△ Less
Submitted 7 April, 2017;
originally announced April 2017.
-
Stock Market Prediction from WSJ: Text Mining via Sparse Matrix Factorization
Authors:
Felix Ming Fai Wong,
Zhenming Liu,
Mung Chiang
Abstract:
We revisit the problem of predicting directional movements of stock prices based on news articles: here our algorithm uses daily articles from The Wall Street Journal to predict the closing stock prices on the same day. We propose a unified latent space model to characterize the "co-movements" between stock prices and news articles. Unlike many existing approaches, our new model is able to simulta…
▽ More
We revisit the problem of predicting directional movements of stock prices based on news articles: here our algorithm uses daily articles from The Wall Street Journal to predict the closing stock prices on the same day. We propose a unified latent space model to characterize the "co-movements" between stock prices and news articles. Unlike many existing approaches, our new model is able to simultaneously leverage the correlations: (a) among stock prices, (b) among news articles, and (c) between stock prices and news articles. Thus, our model is able to make daily predictions on more than 500 stocks (most of which are not even mentioned in any news article) while having low complexity. We carry out extensive backtesting on trading strategies based on our algorithm. The result shows that our model has substantially better accuracy rate (55.7%) compared to many widely used algorithms. The return (56%) and Sharpe ratio due to a trading strategy based on our model are also much higher than baseline indices.
△ Less
Submitted 27 June, 2014;
originally announced June 2014.
-
From Black-Scholes to Online Learning: Dynamic Hedging under Adversarial Environments
Authors:
Henry Lam,
Zhenming Liu
Abstract:
We consider a non-stochastic online learning approach to price financial options by modeling the market dynamic as a repeated game between the nature (adversary) and the investor. We demonstrate that such framework yields analogous structure as the Black-Scholes model, the widely popular option pricing model in stochastic finance, for both European and American options with convex payoffs. In the…
▽ More
We consider a non-stochastic online learning approach to price financial options by modeling the market dynamic as a repeated game between the nature (adversary) and the investor. We demonstrate that such framework yields analogous structure as the Black-Scholes model, the widely popular option pricing model in stochastic finance, for both European and American options with convex payoffs. In the case of non-convex options, we construct approximate pricing algorithms, and demonstrate that their efficiency can be analyzed through the introduction of an artificial probability measure, in parallel to the so-called risk-neutral measure in the finance literature, even though our framework is completely adversarial. Continuous-time convergence results and extensions to incorporate price jumps are also presented.
△ Less
Submitted 23 June, 2014;
originally announced June 2014.
-
Forecasting Value-at-Risk with Time-Varying Variance, Skewness and Kurtosis in an Exponential Weighted Moving Average Framework
Authors:
A. Gabrielsen,
P. Zagaglia,
A. Kirchner,
Z. Liu
Abstract:
This paper provides an insight to the time-varying dynamics of the shape of the distribution of financial return series by proposing an exponential weighted moving average model that jointly estimates volatility, skewness and kurtosis over time using a modified form of the Gram-Charlier density in which skewness and kurtosis appear directly in the functional form of this density. In this setting V…
▽ More
This paper provides an insight to the time-varying dynamics of the shape of the distribution of financial return series by proposing an exponential weighted moving average model that jointly estimates volatility, skewness and kurtosis over time using a modified form of the Gram-Charlier density in which skewness and kurtosis appear directly in the functional form of this density. In this setting VaR can be described as a function of the time-varying higher moments by applying the Cornish-Fisher expansion series of the first four moments. An evaluation of the predictive performance of the proposed model in the estimation of 1-day and 10-day VaR forecasts is performed in comparison with the historical simulation, filtered historical simulation and GARCH model. The adequacy of the VaR forecasts is evaluated under the unconditional, independence and conditional likelihood ratio tests as well as Basel II regulatory tests. The results presented have significant implications for risk management, trading and hedging activities as well as in the pricing of equity derivatives.
△ Less
Submitted 6 June, 2012;
originally announced June 2012.