Search | arXiv e-print repository

Towards Competent AI for Fundamental Analysis in Finance: A Benchmark Dataset and Evaluation

Authors: Zonghan Wu, Junlin Wang, Congyuan Zou, Chenhan Wang, Yilei Shao

Abstract: Generative AI, particularly large language models (LLMs), is beginning to transform the financial industry by automating tasks and helping to make sense of complex financial information. One especially promising use case is the automatic creation of fundamental analysis reports, which are essential for making informed investment decisions, evaluating credit risks, guiding corporate mergers, etc. W… ▽ More Generative AI, particularly large language models (LLMs), is beginning to transform the financial industry by automating tasks and helping to make sense of complex financial information. One especially promising use case is the automatic creation of fundamental analysis reports, which are essential for making informed investment decisions, evaluating credit risks, guiding corporate mergers, etc. While LLMs attempt to generate these reports from a single prompt, the risks of inaccuracy are significant. Poor analysis can lead to misguided investments, regulatory issues, and loss of trust. Existing financial benchmarks mainly evaluate how well LLMs answer financial questions but do not reflect performance in real-world tasks like generating financial analysis reports. In this paper, we propose FinAR-Bench, a solid benchmark dataset focusing on financial statement analysis, a core competence of fundamental analysis. To make the evaluation more precise and reliable, we break this task into three measurable steps: extracting key information, calculating financial indicators, and applying logical reasoning. This structured approach allows us to objectively assess how well LLMs perform each step of the process. Our findings offer a clear understanding of LLMs current strengths and limitations in fundamental analysis and provide a more practical way to benchmark their performance in real-world financial settings. △ Less

Submitted 22 May, 2025; originally announced June 2025.

arXiv:2506.02796 [pdf, ps, other]

Deep Learning Enhanced Multivariate GARCH

Authors: Haoyuan Wang, Chen Liu, Minh-Ngoc Tran, Chao Wang

Abstract: This paper introduces a novel multivariate volatility modeling framework, named Long Short-Term Memory enhanced BEKK (LSTM-BEKK), that integrates deep learning into multivariate GARCH processes. By combining the flexibility of recurrent neural networks with the econometric structure of BEKK models, our approach is designed to better capture nonlinear, dynamic, and high-dimensional dependence struc… ▽ More This paper introduces a novel multivariate volatility modeling framework, named Long Short-Term Memory enhanced BEKK (LSTM-BEKK), that integrates deep learning into multivariate GARCH processes. By combining the flexibility of recurrent neural networks with the econometric structure of BEKK models, our approach is designed to better capture nonlinear, dynamic, and high-dimensional dependence structures in financial return data. The proposed model addresses key limitations of traditional multivariate GARCH-based methods, particularly in capturing persistent volatility clustering and asymmetric co-movement across assets. Leveraging the data-driven nature of LSTMs, the framework adapts effectively to time-varying market conditions, offering improved robustness and forecasting performance. Empirical results across multiple equity markets confirm that the LSTM-BEKK model achieves superior performance in terms of out-of-sample portfolio risk forecast, while maintaining the interpretability from the BEKK models. These findings highlight the potential of hybrid econometric-deep learning models in advancing financial risk management and multivariate volatility forecasting. △ Less

Submitted 3 June, 2025; originally announced June 2025.

arXiv:2506.01423 [pdf, ps, other]

FinRobot: Generative Business Process AI Agents for Enterprise Resource Planning in Finance

Authors: Hongyang Yang, Likun Lin, Yang She, Xinyu Liao, Jiaoyang Wang, Runjia Zhang, Yuquan Mo, Christina Dan Wang

Abstract: Enterprise Resource Planning (ERP) systems serve as the digital backbone of modern financial institutions, yet they continue to rely on static, rule-based workflows that limit adaptability, scalability, and intelligence. As business operations grow more complex and data-rich, conventional ERP platforms struggle to integrate structured and unstructured data in real time and to accommodate dynamic,… ▽ More Enterprise Resource Planning (ERP) systems serve as the digital backbone of modern financial institutions, yet they continue to rely on static, rule-based workflows that limit adaptability, scalability, and intelligence. As business operations grow more complex and data-rich, conventional ERP platforms struggle to integrate structured and unstructured data in real time and to accommodate dynamic, cross-functional workflows. In this paper, we present the first AI-native, agent-based framework for ERP systems, introducing a novel architecture of Generative Business Process AI Agents (GBPAs) that bring autonomy, reasoning, and dynamic optimization to enterprise workflows. The proposed system integrates generative AI with business process modeling and multi-agent orchestration, enabling end-to-end automation of complex tasks such as budget planning, financial reporting, and wire transfer processing. Unlike traditional workflow engines, GBPAs interpret user intent, synthesize workflows in real time, and coordinate specialized sub-agents for modular task execution. We validate the framework through case studies in bank wire transfers and employee reimbursements, two representative financial workflows with distinct complexity and data modalities. Results show that GBPAs achieve up to 40% reduction in processing time, 94% drop in error rate, and improved regulatory compliance by enabling parallelism, risk control insertion, and semantic reasoning. These findings highlight the potential of GBPAs to bridge the gap between generative AI capabilities and enterprise-grade automation, laying the groundwork for the next generation of intelligent ERP systems. △ Less

Submitted 2 June, 2025; originally announced June 2025.

arXiv:2505.08654 [pdf, ps, other]

An Efficient Multi-scale Leverage Effect Estimator under Dependent Microstructure Noise

Authors: Ziyang Xiong, Zhao Chen, Christina Dan Wang

Abstract: Estimating the leverage effect from high-frequency data is vital but challenged by complex, dependent microstructure noise, often exhibiting non-Gaussian higher-order moments. This paper introduces a novel multi-scale framework for efficient and robust leverage effect estimation under such flexible noise structures. We develop two new estimators, the Subsampling-and-Averaging Leverage Effect (SALE… ▽ More Estimating the leverage effect from high-frequency data is vital but challenged by complex, dependent microstructure noise, often exhibiting non-Gaussian higher-order moments. This paper introduces a novel multi-scale framework for efficient and robust leverage effect estimation under such flexible noise structures. We develop two new estimators, the Subsampling-and-Averaging Leverage Effect (SALE) and the Multi-Scale Leverage Effect (MSLE), which adapt subsampling and multi-scale approaches holistically using a unique shifted window technique. This design simplifies the multi-scale estimation procedure and enhances noise robustness without requiring the pre-averaging approach. We establish central limit theorems and stable convergence, with MSLE achieving convergence rates of an optimal $n^{-1/4}$ and a near-optimal $n^{-1/9}$ for the noise-free and noisy settings, respectively. A cornerstone of our framework's efficiency is a specifically designed MSLE weighting strategy that leverages covariance structures across scales. This significantly reduces asymptotic variance and, critically, yields substantially smaller finite-sample errors than existing methods under both noise-free and realistic noisy settings. Extensive simulations and empirical analyses confirm the superior efficiency, robustness, and practical advantages of our approach. △ Less

Submitted 13 May, 2025; originally announced May 2025.

arXiv:2505.06864 [pdf, ps, other]

NewsNet-SDF: Stochastic Discount Factor Estimation with Pretrained Language Model News Embeddings via Adversarial Networks

Authors: Shunyao Wang, Ming Cheng, Christina Dan Wang

Abstract: Stochastic Discount Factor (SDF) models provide a unified framework for asset pricing and risk assessment, yet traditional formulations struggle to incorporate unstructured textual information. We introduce NewsNet-SDF, a novel deep learning framework that seamlessly integrates pretrained language model embeddings with financial time series through adversarial networks. Our multimodal architecture… ▽ More Stochastic Discount Factor (SDF) models provide a unified framework for asset pricing and risk assessment, yet traditional formulations struggle to incorporate unstructured textual information. We introduce NewsNet-SDF, a novel deep learning framework that seamlessly integrates pretrained language model embeddings with financial time series through adversarial networks. Our multimodal architecture processes financial news using GTE-multilingual models, extracts temporal patterns from macroeconomic data via LSTM networks, and normalizes firm characteristics, fusing these heterogeneous information sources through an innovative adversarial training mechanism. Our dataset encompasses approximately 2.5 million news articles and 10,000 unique securities, addressing the computational challenges of processing and aligning text data with financial time series. Empirical evaluations on U.S. equity data (1980-2022) demonstrate NewsNet-SDF substantially outperforms alternatives with a Sharpe ratio of 2.80. The model shows a 471% improvement over CAPM, over 200% improvement versus traditional SDF implementations, and a 74% reduction in pricing errors compared to the Fama-French five-factor model. In comprehensive comparisons, our deep learning approach consistently outperforms traditional, modern, and other neural asset pricing models across all key metrics. Ablation studies confirm that text embeddings contribute significantly more to model performance than macroeconomic features, with news-derived principal components ranking among the most influential determinants of SDF dynamics. These results validate the effectiveness of our multimodal deep learning approach in integrating unstructured text with traditional financial data for more accurate asset pricing, providing new insights for digital intelligent decision-making in financial technology. △ Less

Submitted 11 May, 2025; originally announced May 2025.

arXiv:2503.20787 [pdf, ps, other]

Advanced simulation paradigm of human behaviour unveils complex financial systemic projection

Authors: Cheng Wang, Chuwen Wang, Shirong Zeng, Jianguo Liu, Changjun Jiang

Abstract: The high-order complexity of human behaviour is likely the root cause of extreme difficulty in financial market projections. We consider that behavioural simulation can unveil systemic dynamics to support analysis. Simulating diverse human groups must account for the behavioural heterogeneity, especially in finance. To address the fidelity of simulated agents, on the basis of agent-based modeling,… ▽ More The high-order complexity of human behaviour is likely the root cause of extreme difficulty in financial market projections. We consider that behavioural simulation can unveil systemic dynamics to support analysis. Simulating diverse human groups must account for the behavioural heterogeneity, especially in finance. To address the fidelity of simulated agents, on the basis of agent-based modeling, we propose a new paradigm of behavioural simulation where each agent is supported and driven by a hierarchical knowledge architecture. This architecture, integrating language and professional models, imitates behavioural processes in specific scenarios. Evaluated on futures markets, our simulator achieves a 13.29% deviation in simulating crisis scenarios whose price increase rate reaches 285.34%. Under normal conditions, our simulator also exhibits lower mean square error in predicting futures price of specific commodities. This technique bridges non-quantitative information with diverse market behaviour, offering a promising platform to simulate investor behaviour and its impact on market dynamics. △ Less

Submitted 31 May, 2025; v1 submitted 18 February, 2025; originally announced March 2025.

arXiv:2503.05185 [pdf, other]

FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance

Authors: Fengbin Zhu, Junfeng Li, Liangming Pan, Wenjie Wang, Fuli Feng, Chao Wang, Huanbo Luan, Tat-Seng Chua

Abstract: Finance decision-making often relies on in-depth data analysis across various data sources, including financial tables, news articles, stock prices, etc. In this work, we introduce FinTMMBench, the first comprehensive benchmark for evaluating temporal-aware multi-modal Retrieval-Augmented Generation (RAG) systems in finance. Built from heterologous data of NASDAQ 100 companies, FinTMMBench offers… ▽ More Finance decision-making often relies on in-depth data analysis across various data sources, including financial tables, news articles, stock prices, etc. In this work, we introduce FinTMMBench, the first comprehensive benchmark for evaluating temporal-aware multi-modal Retrieval-Augmented Generation (RAG) systems in finance. Built from heterologous data of NASDAQ 100 companies, FinTMMBench offers three significant advantages. 1) Multi-modal Corpus: It encompasses a hybrid of financial tables, news articles, daily stock prices, and visual technical charts as the corpus. 2) Temporal-aware Questions: Each question requires the retrieval and interpretation of its relevant data over a specific time period, including daily, weekly, monthly, quarterly, and annual periods. 3) Diverse Financial Analysis Tasks: The questions involve 10 different tasks, including information extraction, trend analysis, sentiment analysis and event detection, etc. We further propose a novel TMMHybridRAG method, which first leverages LLMs to convert data from other modalities (e.g., tabular, visual and time-series data) into textual format and then incorporates temporal information in each node when constructing graphs and dense indexes. Its effectiveness has been validated in extensive experiments, but notable gaps remain, highlighting the challenges presented by our FinTMMBench. △ Less

Submitted 7 March, 2025; originally announced March 2025.

Comments: Under review

arXiv:2502.12957 [pdf, ps, other]

A measure-valued HJB perspective on Bayesian optimal adaptive control

Authors: Alexander M. G. Cox, Sigrid Källblad, Chaorui Wang

Abstract: We consider a Bayesian adaptive optimal stochastic control problem where a hidden static signal has a non-separable influence on the drift of a noisy observation. Being allowed to control the specific form of this dependence, we aim at optimising a cost functional depending on the posterior distribution of the hidden signal. Expressing the dynamics of this posterior distribution in the observation… ▽ More We consider a Bayesian adaptive optimal stochastic control problem where a hidden static signal has a non-separable influence on the drift of a noisy observation. Being allowed to control the specific form of this dependence, we aim at optimising a cost functional depending on the posterior distribution of the hidden signal. Expressing the dynamics of this posterior distribution in the observation filtration, we embed our problem into a genuinely infinite-dimensional stochastic control problem featuring so-called measure-valued martingales. We address this problem by use of viscosity theory and approximation arguments. Specifically, we show equivalence to a corresponding weak formulation, characterise the optimal value of the problem in terms of the unique continuous viscosity solution of an associated HJB equation, and construct a piecewise constant and arbitrarily-close-to-optimal control to our main problem of study. △ Less

Submitted 18 February, 2025; originally announced February 2025.

MSC Class: 49L25; 60G35; 60H10; 62M20; 93E35; 93E10

arXiv:2412.10823 [pdf, ps, other]

FinGPT: Enhancing Sentiment-Based Stock Movement Prediction with Dissemination-Aware and Context-Enriched LLMs

Authors: Yixuan Liang, Yuncong Liu, Neng Wang, Hongyang Yang, Boyu Zhang, Christina Dan Wang

Abstract: Financial sentiment analysis is crucial for understanding the influence of news on stock prices. Recently, large language models (LLMs) have been widely adopted for this purpose due to their advanced text analysis capabilities. However, these models often only consider the news content itself, ignoring its dissemination, which hampers accurate prediction of short-term stock movements. Additionally… ▽ More Financial sentiment analysis is crucial for understanding the influence of news on stock prices. Recently, large language models (LLMs) have been widely adopted for this purpose due to their advanced text analysis capabilities. However, these models often only consider the news content itself, ignoring its dissemination, which hampers accurate prediction of short-term stock movements. Additionally, current methods often lack sufficient contextual data and explicit instructions in their prompts, limiting LLMs' ability to interpret news. In this paper, we propose a data-driven approach that enhances LLM-powered sentiment-based stock movement predictions by incorporating news dissemination breadth, contextual data, and explicit instructions. We cluster recent company-related news to assess its reach and influence, enriching prompts with more specific data and precise instructions. This data is used to construct an instruction tuning dataset to fine-tune an LLM for predicting short-term stock price movements. Our experimental results show that our approach improves prediction accuracy by 8\% compared to existing methods. △ Less

Submitted 22 June, 2025; v1 submitted 14 December, 2024; originally announced December 2024.

Comments: 1st Workshop on Preparing Good Data for Generative AI: Challenges and Approaches@ AAAI 2025, ai4finance.org

arXiv:2412.00062 [pdf, other]

Deep Learning-Based Electricity Price Forecast for Virtual Bidding in Wholesale Electricity Market

Authors: Xuesong Wang, Sharaf K. Magableh, Oraib Dawaghreh, Caisheng Wang, Jiaxuan Gong, Zhongyang Zhao, Michael H. Liao

Abstract: Virtual bidding plays an important role in two-settlement electric power markets, as it can reduce discrepancies between day-ahead and real-time markets. Renewable energy penetration increases volatility in electricity prices, making accurate forecasting critical for virtual bidders, reducing uncertainty and maximizing profits. This study presents a Transformer-based deep learning model to forecas… ▽ More Virtual bidding plays an important role in two-settlement electric power markets, as it can reduce discrepancies between day-ahead and real-time markets. Renewable energy penetration increases volatility in electricity prices, making accurate forecasting critical for virtual bidders, reducing uncertainty and maximizing profits. This study presents a Transformer-based deep learning model to forecast the price spread between real-time and day-ahead electricity prices in the ERCOT (Electric Reliability Council of Texas) market. The proposed model leverages various time-series features, including load forecasts, solar and wind generation forecasts, and temporal attributes. The model is trained under realistic constraints and validated using a walk-forward approach by updating the model every week. Based on the price spread prediction results, several trading strategies are proposed and the most effective strategy for maximizing cumulative profit under realistic market conditions is identified through backtesting. The results show that the strategy of trading only at the peak hour with a precision score of over 50% produces nearly consistent profit over the test period. The proposed method underscores the importance of an accurate electricity price forecasting model and introduces a new method of evaluating the price forecast model from a virtual bidder's perspective, providing valuable insights for future research. △ Less

Submitted 25 November, 2024; originally announced December 2024.

Comments: Submitted to 2025 IEEE PES General Meeting

arXiv:2411.17136 [pdf, other]

Autoencoder Enhanced Realised GARCH on Volatility Forecasting

Authors: Qianli Zhao, Chao Wang, Richard Gerlach, Giuseppe Storti, Lingxiang Zhang

Abstract: Realised volatility has become increasingly prominent in volatility forecasting due to its ability to capture intraday price fluctuations. With a growing variety of realised volatility estimators, each with unique advantages and limitations, selecting an optimal estimator may introduce challenges. In this thesis, aiming to synthesise the impact of various realised volatility measures on volatility… ▽ More Realised volatility has become increasingly prominent in volatility forecasting due to its ability to capture intraday price fluctuations. With a growing variety of realised volatility estimators, each with unique advantages and limitations, selecting an optimal estimator may introduce challenges. In this thesis, aiming to synthesise the impact of various realised volatility measures on volatility forecasting, we propose an extension of the Realised GARCH model that incorporates an autoencoder-generated synthetic realised measure, combining the information from multiple realised measures in a nonlinear manner. Our proposed model extends existing linear methods, such as Principal Component Analysis and Independent Component Analysis, to reduce the dimensionality of realised measures. The empirical evaluation, conducted across four major stock markets from January 2000 to June 2022 and including the period of COVID-19, demonstrates both the feasibility of applying an autoencoder to synthesise volatility measures and the superior effectiveness of the proposed model in one-step-ahead rolling volatility forecasting. The model exhibits enhanced flexibility in parameter estimations across each rolling window, outperforming traditional linear approaches. These findings indicate that nonlinear dimension reduction offers further adaptability and flexibility in improving the synthetic realised measure, with promising implications for future volatility forecasting applications. △ Less

Submitted 26 November, 2024; originally announced November 2024.

Comments: 48 pages, 6 figures

arXiv:2410.22706 [pdf, other]

Graph Signal Processing for Global Stock Market Realized Volatility Forecasting

Authors: Zhengyang Chi, Junbin Gao, Chao Wang

Abstract: This paper introduces an innovative realized volatility (RV) forecasting framework that extends the conventional Heterogeneous Auto-Regressive (HAR) model via integrating the Graph Signal Processing (GSP) technique. The volatility spillover effect is embedded and modeled in the proposed framework, which employs the graph Fourier transformation method to effectively analyze the global stock market… ▽ More This paper introduces an innovative realized volatility (RV) forecasting framework that extends the conventional Heterogeneous Auto-Regressive (HAR) model via integrating the Graph Signal Processing (GSP) technique. The volatility spillover effect is embedded and modeled in the proposed framework, which employs the graph Fourier transformation method to effectively analyze the global stock market dynamics in the spectral domain. In addition, convolution filters with learnable weights are applied to capture the historical mid-term and long-term volatility patterns. The empirical study is conducted with RV data of $24$ global stock market indices with around $3500$ common trading days from May 2002 to June 2022. The proposed model's short-term, middle-term and long-term RV forecasting performance is compared with various HAR type models and the graph neural network based HAR model. The results show that the proposed model consistently outperforms all other models considered in the study, demonstrating the effectiveness of integrating the GSP technique into the HAR model for RV forecasting. △ Less

Submitted 26 February, 2025; v1 submitted 30 October, 2024; originally announced October 2024.

arXiv:2409.15320 [pdf, other]

Global Stock Market Volatility Forecasting Incorporating Dynamic Graphs and All Trading Days

Authors: Zhengyang Chi, Junbin Gao, Chao Wang

Abstract: This study introduces a global stock market volatility forecasting model that enhances forecasting accuracy and practical utility in real-world financial decision-making by integrating dynamic graph structures and encompassing the union of active trading days of different stock markets. The model employs a spatial-temporal graph neural network (GNN) architecture to capture the volatility spillover… ▽ More This study introduces a global stock market volatility forecasting model that enhances forecasting accuracy and practical utility in real-world financial decision-making by integrating dynamic graph structures and encompassing the union of active trading days of different stock markets. The model employs a spatial-temporal graph neural network (GNN) architecture to capture the volatility spillover effect, where shocks in one market spread to others through the interconnective global economy. By calculating the volatility spillover index to depict the volatility network as graphs, the model effectively mirrors the volatility dynamics for the chosen stock market indices. In the empirical analysis, the proposed model surpasses the benchmark model in all forecasting scenarios and is shown to be sensitive to the underlying volatility interrelationships. △ Less

Submitted 30 September, 2024; v1 submitted 6 September, 2024; originally announced September 2024.

arXiv:2408.13588 [pdf, ps, other]

Loss-based Bayesian Sequential Prediction of Value at Risk with a Long-Memory and Non-linear Realized Volatility Model

Authors: Rangika Peiris, Minh-Ngoc Tran, Chao Wang, Richard Gerlach

Abstract: A long memory and non-linear realized volatility model class is proposed for direct Value at Risk (VaR) forecasting. This model, referred to as RNN-HAR, extends the heterogeneous autoregressive (HAR) model, a framework known for efficiently capturing long memory in realized measures, by integrating a Recurrent Neural Network (RNN) to handle non-linear dynamics. Loss-based generalized Bayesian infe… ▽ More A long memory and non-linear realized volatility model class is proposed for direct Value at Risk (VaR) forecasting. This model, referred to as RNN-HAR, extends the heterogeneous autoregressive (HAR) model, a framework known for efficiently capturing long memory in realized measures, by integrating a Recurrent Neural Network (RNN) to handle non-linear dynamics. Loss-based generalized Bayesian inference with Sequential Monte Carlo is employed for model estimation and sequential prediction in RNN HAR. The empirical analysis is conducted using daily closing prices and realized measures from 2000 to 2022 across 31 market indices. The proposed models one step ahead VaR forecasting performance is compared against a basic HAR model and its extensions. The results demonstrate that the proposed RNN-HAR model consistently outperforms all other models considered in the study. △ Less

Submitted 24 August, 2024; originally announced August 2024.

arXiv:2407.16566 [pdf, ps, other]

doi 10.1109/TCSS.2025.3574236

Alleviating Non-identifiability: a High-fidelity Calibration Objective for Financial Market Simulation with Multivariate Time Series Data

Authors: Chenkai Wang, Junji Ren, Peng Yang

Abstract: The non-identifiability issue has been frequently reported in social simulation works, where different parameters of an agent-based simulation model yield indistinguishable simulated time series data under certain discrepancy metrics. This issue largely undermines the simulation fidelity yet lacks dedicated investigations. This paper theoretically demonstrates that incorporating multiple time seri… ▽ More The non-identifiability issue has been frequently reported in social simulation works, where different parameters of an agent-based simulation model yield indistinguishable simulated time series data under certain discrepancy metrics. This issue largely undermines the simulation fidelity yet lacks dedicated investigations. This paper theoretically demonstrates that incorporating multiple time series data features during the model calibration phase can exponentially alleviate non-identifiability as the number of features increases. To implement this theoretical finding, a maximization-based aggregation function is proposed based on existing discrepancy metrics to form a new calibration objective function. For verification, the task of calibrating the Financial Market Simulation (FMS), a typical yet complex social simulation, is considered. Empirical studies confirm the significant improvements in alleviating the non-identifiability of calibration tasks. Furthermore, as a model-agnostic method, it achieves much higher simulation fidelity of the chosen FMS model on both synthetic and real market data. Moreover, it is both theoretically and empirically analyzed that as long as the features are selected and not linearly correlated, they can contribute to alleviation, which demonstrates the robustness of the proposed objective. Hence, this work is expected to provide not only a rigorous understanding of non-identifiability in social simulation but also an off-the-shelf high-fidelity calibration objective function for FMS. △ Less

Submitted 21 June, 2025; v1 submitted 23 July, 2024; originally announced July 2024.

Comments: 12 pages, 11 figures

arXiv:2406.14537 [pdf, other]

MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading

Authors: Chuqiao Zong, Chaojie Wang, Molei Qin, Lei Feng, Xinrun Wang, Bo An

Abstract: High-frequency trading (HFT) that executes algorithmic trading in short time scales, has recently occupied the majority of cryptocurrency market. Besides traditional quantitative trading methods, reinforcement learning (RL) has become another appealing approach for HFT due to its terrific ability of handling high-dimensional financial data and solving sophisticated sequential decision-making probl… ▽ More High-frequency trading (HFT) that executes algorithmic trading in short time scales, has recently occupied the majority of cryptocurrency market. Besides traditional quantitative trading methods, reinforcement learning (RL) has become another appealing approach for HFT due to its terrific ability of handling high-dimensional financial data and solving sophisticated sequential decision-making problems, \emph{e.g.,} hierarchical reinforcement learning (HRL) has shown its promising performance on second-level HFT by training a router to select only one sub-agent from the agent pool to execute the current transaction. However, existing RL methods for HFT still have some defects: 1) standard RL-based trading agents suffer from the overfitting issue, preventing them from making effective policy adjustments based on financial context; 2) due to the rapid changes in market conditions, investment decisions made by an individual agent are usually one-sided and highly biased, which might lead to significant loss in extreme markets. To tackle these problems, we propose a novel Memory Augmented Context-aware Reinforcement learning method On HFT, \emph{a.k.a.} MacroHFT, which consists of two training phases: 1) we first train multiple types of sub-agents with the market data decomposed according to various financial indicators, specifically market trend and volatility, where each agent owns a conditional adapter to adjust its trading policy according to market conditions; 2) then we train a hyper-agent to mix the decisions from these sub-agents and output a consistently profitable meta-policy to handle rapid market fluctuations, equipped with a memory mechanism to enhance the capability of decision-making. Extensive experiments on various cryptocurrency markets demonstrate that MacroHFT can achieve state-of-the-art performance on minute-level trading tasks. △ Less

Submitted 20 June, 2024; originally announced June 2024.

Comments: Accepted to KDD 2024

arXiv:2406.01335 [pdf, other]

Statistics-Informed Parameterized Quantum Circuit via Maximum Entropy Principle for Data Science and Finance

Authors: Xi-Ning Zhuang, Zhao-Yun Chen, Cheng Xue, Xiao-Fan Xu, Chao Wang, Huan-Yu Liu, Tai-Ping Sun, Yun-Jie Wang, Yu-Chun Wu, Guo-Ping Guo

Abstract: Quantum machine learning has demonstrated significant potential in solving practical problems, particularly in statistics-focused areas such as data science and finance. However, challenges remain in preparing and learning statistical models on a quantum processor due to issues with trainability and interpretability. In this letter, we utilize the maximum entropy principle to design a statistics-i… ▽ More Quantum machine learning has demonstrated significant potential in solving practical problems, particularly in statistics-focused areas such as data science and finance. However, challenges remain in preparing and learning statistical models on a quantum processor due to issues with trainability and interpretability. In this letter, we utilize the maximum entropy principle to design a statistics-informed parameterized quantum circuit (SI-PQC) for efficiently preparing and training of quantum computational statistical models, including arbitrary distributions and their weighted mixtures. The SI-PQC features a static structure with trainable parameters, enabling in-depth optimized circuit compilation, exponential reductions in resource and time consumption, and improved trainability and interpretability for learning quantum states and classical model parameters simultaneously. As an efficient subroutine for preparing and learning in various quantum algorithms, the SI-PQC addresses the input bottleneck and facilitates the injection of prior knowledge. △ Less

Submitted 18 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

Comments: 19 pages, 5 figures

arXiv:2405.14767 [pdf, other]

FinRobot: An Open-Source AI Agent Platform for Financial Applications using Large Language Models

Authors: Hongyang Yang, Boyu Zhang, Neng Wang, Cheng Guo, Xiaoli Zhang, Likun Lin, Junlin Wang, Tianyu Zhou, Mao Guan, Runjia Zhang, Christina Dan Wang

Abstract: As financial institutions and professionals increasingly incorporate Large Language Models (LLMs) into their workflows, substantial barriers, including proprietary data and specialized knowledge, persist between the finance sector and the AI community. These challenges impede the AI community's ability to enhance financial tasks effectively. Acknowledging financial analysis's critical role, we aim… ▽ More As financial institutions and professionals increasingly incorporate Large Language Models (LLMs) into their workflows, substantial barriers, including proprietary data and specialized knowledge, persist between the finance sector and the AI community. These challenges impede the AI community's ability to enhance financial tasks effectively. Acknowledging financial analysis's critical role, we aim to devise financial-specialized LLM-based toolchains and democratize access to them through open-source initiatives, promoting wider AI adoption in financial decision-making. In this paper, we introduce FinRobot, a novel open-source AI agent platform supporting multiple financially specialized AI agents, each powered by LLM. Specifically, the platform consists of four major layers: 1) the Financial AI Agents layer that formulates Financial Chain-of-Thought (CoT) by breaking sophisticated financial problems down into logical sequences; 2) the Financial LLM Algorithms layer dynamically configures appropriate model application strategies for specific tasks; 3) the LLMOps and DataOps layer produces accurate models by applying training/fine-tuning techniques and using task-relevant data; 4) the Multi-source LLM Foundation Models layer that integrates various LLMs and enables the above layers to access them directly. Finally, FinRobot provides hands-on for both professional-grade analysts and laypersons to utilize powerful AI techniques for advanced financial analysis. We open-source FinRobot at \url{https://github.com/AI4Finance-Foundation/FinRobot}. △ Less

Submitted 27 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

Comments: FinRobot Whitepaper V1.0

arXiv:2402.09985 [pdf, ps, other]

Semi-parametric financial risk forecasting incorporating multiple realized measures

Authors: Rangika Peiris, Chao Wang, Richard Gerlach, Minh-Ngoc Tran

Abstract: A semi-parametric joint Value-at-Risk (VaR) and Expected Shortfall (ES) forecasting framework employing multiple realized measures is developed. The proposed framework extends the realized exponential GARCH model to be semi-parametrically estimated, via a joint loss function, whilst extending existing quantile time series models to incorporate multiple realized measures. A quasi-likelihood is buil… ▽ More A semi-parametric joint Value-at-Risk (VaR) and Expected Shortfall (ES) forecasting framework employing multiple realized measures is developed. The proposed framework extends the realized exponential GARCH model to be semi-parametrically estimated, via a joint loss function, whilst extending existing quantile time series models to incorporate multiple realized measures. A quasi-likelihood is built, employing the asymmetric Laplace distribution that is directly linked to a joint loss function, which enables Bayesian inference for the proposed model. An adaptive Markov Chain Monte Carlo method is used for the model estimation. The empirical section evaluates the performance of the proposed framework with six stock markets from January 2000 to June 2022, covering the period of COVID-19. Three realized measures, including 5- minute realized variance, bi-power variation, and realized kernel, are incorporated and evaluated in the proposed framework. One-step-ahead VaR and ES forecasting results of the proposed model are compared to a range of parametric and semi-parametric models, lending support to the effectiveness of the proposed framework. △ Less

Submitted 5 December, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

arXiv:2401.08093 [pdf, ps, other]

A Two-Step Longstaff Schwartz Monte Carlo Approach to Game Option Pricing

Authors: Ce Wang

Abstract: We proposed a two-step Longstaff Schwartz Monte Carlo (LSMC) method with two regression models fitted at each time step to price game options. Although the original LSMC can be used to price game options with an enlarged range of path in regression and a modified cashflow updating rule, we identified a drawback of such approach, which motivated us to propose our approach. We implemented numerical… ▽ More We proposed a two-step Longstaff Schwartz Monte Carlo (LSMC) method with two regression models fitted at each time step to price game options. Although the original LSMC can be used to price game options with an enlarged range of path in regression and a modified cashflow updating rule, we identified a drawback of such approach, which motivated us to propose our approach. We implemented numerical examples with benchmarks using binomial tree and numerical PDE, and it showed that our method produces more reliable results comparing to the original LSMC. △ Less

Submitted 15 January, 2024; originally announced January 2024.

arXiv:2312.10388 [pdf, other]

The Causal Impact of Credit Lines on Spending Distributions

Authors: Yijun Li, Cheuk Hang Leung, Xiangqian Sun, Chaoqun Wang, Yiyan Huang, Xing Yan, Qi Wu, Dongdong Wang, Zhixiang Huang

Abstract: Consumer credit services offered by e-commerce platforms provide customers with convenient loan access during shopping and have the potential to stimulate sales. To understand the causal impact of credit lines on spending, previous studies have employed causal estimators, based on direct regression (DR), inverse propensity weighting (IPW), and double machine learning (DML) to estimate the treatmen… ▽ More Consumer credit services offered by e-commerce platforms provide customers with convenient loan access during shopping and have the potential to stimulate sales. To understand the causal impact of credit lines on spending, previous studies have employed causal estimators, based on direct regression (DR), inverse propensity weighting (IPW), and double machine learning (DML) to estimate the treatment effect. However, these estimators do not consider the notion that an individual's spending can be understood and represented as a distribution, which captures the range and pattern of amounts spent across different orders. By disregarding the outcome as a distribution, valuable insights embedded within the outcome distribution might be overlooked. This paper develops a distribution-valued estimator framework that extends existing real-valued DR-, IPW-, and DML-based estimators to distribution-valued estimators within Rubin's causal framework. We establish their consistency and apply them to a real dataset from a large e-commerce platform. Our findings reveal that credit lines positively influence spending across all quantiles; however, as credit lines increase, consumers allocate more to luxuries (higher quantiles) than necessities (lower quantiles). △ Less

Submitted 16 December, 2023; originally announced December 2023.

arXiv:2310.07110 [pdf, other]

Valuation Duration of the Stock Market

Authors: Ye Li, Chen Wang

Abstract: At the peak of the tech bubble, only 0.57% of market valuation comes from dividends in the next year. Taking the ratio of total market value to the value of one-year dividends, we obtain a valuation-based duration of 175 years. In contrast, at the height of the global financial crisis, more than 2.2% of market value is from dividends in the next year, implying a duration of 46 years. What drives v… ▽ More At the peak of the tech bubble, only 0.57% of market valuation comes from dividends in the next year. Taking the ratio of total market value to the value of one-year dividends, we obtain a valuation-based duration of 175 years. In contrast, at the height of the global financial crisis, more than 2.2% of market value is from dividends in the next year, implying a duration of 46 years. What drives valuation duration? We find that market participants have limited information about cash flow beyond one year. Therefore, an increase in valuation duration is due to a decrease in the discount rate rather than good news about long-term growth. Accordingly, valuation duration negatively predicts annual market return with an out-of-sample R2 of 15%, robustly outperforming other predictors in the literature. While the price-dividend ratio reflects the overall valuation level, our valuation-based measure of duration captures the slope of the valuation term structure. We show that valuation duration, as a discount rate proxy, is a critical state variable that augments the price-dividend ratio in spanning the (latent) state space for stock-market dynamics. △ Less

Submitted 10 October, 2023; originally announced October 2023.

arXiv:2310.04793 [pdf, other]

FinGPT: Instruction Tuning Benchmark for Open-Source Large Language Models in Financial Datasets

Authors: Neng Wang, Hongyang Yang, Christina Dan Wang

Abstract: In the swiftly expanding domain of Natural Language Processing (NLP), the potential of GPT-based models for the financial sector is increasingly evident. However, the integration of these models with financial datasets presents challenges, notably in determining their adeptness and relevance. This paper introduces a distinctive approach anchored in the Instruction Tuning paradigm for open-source l… ▽ More In the swiftly expanding domain of Natural Language Processing (NLP), the potential of GPT-based models for the financial sector is increasingly evident. However, the integration of these models with financial datasets presents challenges, notably in determining their adeptness and relevance. This paper introduces a distinctive approach anchored in the Instruction Tuning paradigm for open-source large language models, specifically adapted for financial contexts. Through this methodology, we capitalize on the interoperability of open-source models, ensuring a seamless and transparent integration. We begin by explaining the Instruction Tuning paradigm, highlighting its effectiveness for immediate integration. The paper presents a benchmarking scheme designed for end-to-end training and testing, employing a cost-effective progression. Firstly, we assess basic competencies and fundamental tasks, such as Named Entity Recognition (NER) and sentiment analysis to enhance specialization. Next, we delve into a comprehensive model, executing multi-task operations by amalgamating all instructional tunings to examine versatility. Finally, we explore the zero-shot capabilities by earmarking unseen tasks and incorporating novel datasets to understand adaptability in uncharted terrains. Such a paradigm fortifies the principles of openness and reproducibility, laying a robust foundation for future investigations in open-source financial large language models (FinLLMs). △ Less

Submitted 11 November, 2023; v1 submitted 7 October, 2023; originally announced October 2023.

Comments: Workshop on Instruction Tuning and Instruction Following at NeurIPS 2023

arXiv:2309.02072 [pdf, other]

Global Neural Networks and The Data Scaling Effect in Financial Time Series Forecasting

Authors: Chen Liu, Minh-Ngoc Tran, Chao Wang, Richard Gerlach, Robert Kohn

Abstract: Neural networks have revolutionized many empirical fields, yet their application to financial time series forecasting remains controversial. In this study, we demonstrate that the conventional practice of estimating models locally in data-scarce environments may underlie the mixed empirical performance observed in prior work. By focusing on volatility forecasting, we employ a dataset comprising ov… ▽ More Neural networks have revolutionized many empirical fields, yet their application to financial time series forecasting remains controversial. In this study, we demonstrate that the conventional practice of estimating models locally in data-scarce environments may underlie the mixed empirical performance observed in prior work. By focusing on volatility forecasting, we employ a dataset comprising over 10,000 global stocks and implement a global estimation strategy that pools information across cross-sections. Our econometric analysis reveals that forecasting accuracy improves markedly as the training dataset becomes larger and more heterogeneous. Notably, even with as little as 12 months of data, globally trained networks deliver robust predictions for individual stocks and portfolios that are not even in the training dataset. Furthermore, our interpretation of the model dynamics shows that these networks not only capture key stylized facts of volatility but also exhibit resilience to outliers and rapid adaptation to market regime changes. These findings underscore the importance of leveraging extensive and diverse datasets in financial forecasting and advocate for a shift from traditional local training approaches to integrated global estimation methods. △ Less

Submitted 20 February, 2025; v1 submitted 5 September, 2023; originally announced September 2023.

Comments: 25 pages, 5 figures

arXiv:2306.06031 [pdf, other]

FinGPT: Open-Source Financial Large Language Models

Authors: Hongyang Yang, Xiao-Yang Liu, Christina Dan Wang

Abstract: Large language models (LLMs) have shown the potential of revolutionizing natural language processing tasks in diverse domains, sparking great interest in finance. Accessing high-quality financial data is the first challenge for financial LLMs (FinLLMs). While proprietary models like BloombergGPT have taken advantage of their unique data accumulation, such privileged access calls for an open-source… ▽ More Large language models (LLMs) have shown the potential of revolutionizing natural language processing tasks in diverse domains, sparking great interest in finance. Accessing high-quality financial data is the first challenge for financial LLMs (FinLLMs). While proprietary models like BloombergGPT have taken advantage of their unique data accumulation, such privileged access calls for an open-source alternative to democratize Internet-scale financial data. In this paper, we present an open-source large language model, FinGPT, for the finance sector. Unlike proprietary models, FinGPT takes a data-centric approach, providing researchers and practitioners with accessible and transparent resources to develop their FinLLMs. We highlight the importance of an automatic data curation pipeline and the lightweight low-rank adaptation technique in building FinGPT. Furthermore, we showcase several potential applications as stepping stones for users, such as robo-advising, algorithmic trading, and low-code development. Through collaborative efforts within the open-source AI4Finance community, FinGPT aims to stimulate innovation, democratize FinLLMs, and unlock new opportunities in open finance. Two associated code repos are \url{https://github.com/AI4Finance-Foundation/FinGPT} and \url{https://github.com/AI4Finance-Foundation/FinNLP} △ Less

Submitted 9 June, 2023; originally announced June 2023.

arXiv:2302.08002 [pdf, ps, other]

Deep Learning Enhanced Realized GARCH

Authors: Chen Liu, Chao Wang, Minh-Ngoc Tran, Robert Kohn

Abstract: We propose a new approach to volatility modeling by combining deep learning (LSTM) and realized volatility measures. This LSTM-enhanced realized GARCH framework incorporates and distills modeling advances from financial econometrics, high frequency trading data and deep learning. Bayesian inference via the Sequential Monte Carlo method is employed for statistical inference and forecasting. The new… ▽ More We propose a new approach to volatility modeling by combining deep learning (LSTM) and realized volatility measures. This LSTM-enhanced realized GARCH framework incorporates and distills modeling advances from financial econometrics, high frequency trading data and deep learning. Bayesian inference via the Sequential Monte Carlo method is employed for statistical inference and forecasting. The new framework can jointly model the returns and realized volatility measures, has an excellent in-sample fit and superior predictive performance compared to several benchmark models, while being able to adapt well to the stylized facts in volatility. The performance of the new framework is tested using a wide range of metrics, from marginal likelihood, volatility forecasting, to tail risk forecasting and option pricing. We report on a comprehensive empirical study using 31 widely traded stock indices over a time period that includes COVID-19 pandemic. △ Less

Submitted 17 October, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

Comments: 47 pages, 12 tables

arXiv:2211.03107 [pdf, other]

FinRL-Meta: Market Environments and Benchmarks for Data-Driven Financial Reinforcement Learning

Authors: Xiao-Yang Liu, Ziyi Xia, Jingyang Rui, Jiechao Gao, Hongyang Yang, Ming Zhu, Christina Dan Wang, Zhaoran Wang, Jian Guo

Abstract: Finance is a particularly difficult playground for deep reinforcement learning. However, establishing high-quality market environments and benchmarks for financial reinforcement learning is challenging due to three major factors, namely, low signal-to-noise ratio of financial data, survivorship bias of historical data, and model overfitting in the backtesting stage. In this paper, we present an op… ▽ More Finance is a particularly difficult playground for deep reinforcement learning. However, establishing high-quality market environments and benchmarks for financial reinforcement learning is challenging due to three major factors, namely, low signal-to-noise ratio of financial data, survivorship bias of historical data, and model overfitting in the backtesting stage. In this paper, we present an openly accessible FinRL-Meta library that has been actively maintained by the AI4Finance community. First, following a DataOps paradigm, we will provide hundreds of market environments through an automatic pipeline that collects dynamic datasets from real-world markets and processes them into gym-style market environments. Second, we reproduce popular papers as stepping stones for users to design new trading strategies. We also deploy the library on cloud platforms so that users can visualize their own results and assess the relative performance via community-wise competitions. Third, FinRL-Meta provides tens of Jupyter/Python demos organized into a curriculum and a documentation website to serve the rapidly growing community. FinRL-Meta is available at: https://github.com/AI4Finance-Foundation/FinRL-Meta △ Less

Submitted 6 November, 2022; originally announced November 2022.

Comments: NeurIPS 2022 Datasets and Benchmarks. 36th Conference on Neural Information Processing Systems Datasets and Benchmarks Track

arXiv:2209.05559 [pdf, other]

doi 10.48550/arXiv.2209.05559

Deep Reinforcement Learning for Cryptocurrency Trading: Practical Approach to Address Backtest Overfitting

Authors: Berend Jelmer Dirk Gort, Xiao-Yang Liu, Xinghang Sun, Jiechao Gao, Shuaiyu Chen, Christina Dan Wang

Abstract: Designing profitable and reliable trading strategies is challenging in the highly volatile cryptocurrency market. Existing works applied deep reinforcement learning methods and optimistically reported increased profits in backtesting, which may suffer from the false positive issue due to overfitting. In this paper, we propose a practical approach to address backtest overfitting for cryptocurrency… ▽ More Designing profitable and reliable trading strategies is challenging in the highly volatile cryptocurrency market. Existing works applied deep reinforcement learning methods and optimistically reported increased profits in backtesting, which may suffer from the false positive issue due to overfitting. In this paper, we propose a practical approach to address backtest overfitting for cryptocurrency trading using deep reinforcement learning. First, we formulate the detection of backtest overfitting as a hypothesis test. Then, we train the DRL agents, estimate the probability of overfitting, and reject the overfitted agents, increasing the chance of good trading performance. Finally, on 10 cryptocurrencies over a testing period from 05/01/2022 to 06/27/2022 (during which the crypto market crashed two times), we show that the less overfitted deep reinforcement learning agents have a higher return than that of more overfitted agents, an equal weight strategy, and the S&P DBM Index (market benchmark), offering confidence in possible deployment to a real market. △ Less

Submitted 31 January, 2023; v1 submitted 12 September, 2022; originally announced September 2022.

MSC Class: 68T07; ACM Class: I.2.6

arXiv:2207.04595 [pdf, other]

A semi-parametric dynamic conditional correlation framework for risk forecasting

Authors: Giuseppe Storti, Chao Wang

Abstract: We develop a novel multivariate semi-parametric framework for joint portfolio Value-at-Risk (VaR) and Expected Shortfall (ES) forecasting. Unlike existing univariate semi-parametric approaches, the proposed framework explicitly models the dependence structure among portfolio asset returns through a dynamic conditional correlation (DCC) parameterization. To estimate the model, a two-step procedure… ▽ More We develop a novel multivariate semi-parametric framework for joint portfolio Value-at-Risk (VaR) and Expected Shortfall (ES) forecasting. Unlike existing univariate semi-parametric approaches, the proposed framework explicitly models the dependence structure among portfolio asset returns through a dynamic conditional correlation (DCC) parameterization. To estimate the model, a two-step procedure based on the minimization of a strictly consistent VaR and ES joint loss function is employed. This procedure allows to simultaneously estimate the DCC parameters and the portfolio risk factors. The performance of the proposed model in risk forecasting on various probability levels is evaluated by means of a forecasting study on the components of the Dow Jones index for an out-of-sample period from December 2016 to September 2021. The empirical results support effectiveness of the proposed framework compared to a variety of existing approaches. △ Less

Submitted 20 December, 2024; v1 submitted 10 July, 2022; originally announced July 2022.

Comments: 43 pages, 6 figures

arXiv:2202.02276 [pdf]

Measuring Systemic Risk: Common Factor Exposures and Tail Dependence Effects

Authors: Wan-Chien Chiu, Juan Ignacio Peña, Chih-Wei Wang

Abstract: We model systemic risk using a common factor that accounts for market-wide shocks and a tail dependence factor that accounts for linkages among extreme stock returns. Specifically, our theoretical model allows for firm-specific impacts of infrequent and extreme events. Using data on the four sectors of the U.S. financial industry from 1996 to 2011, we uncover two key empirical findings. First, dis… ▽ More We model systemic risk using a common factor that accounts for market-wide shocks and a tail dependence factor that accounts for linkages among extreme stock returns. Specifically, our theoretical model allows for firm-specific impacts of infrequent and extreme events. Using data on the four sectors of the U.S. financial industry from 1996 to 2011, we uncover two key empirical findings. First, disregarding the effect of the tail dependence factor leads to a downward bias in the measurement of systemic risk, especially during weak economic times. Second, when these measures serve as leading indicators of the St. Louis Fed Financial Stress Index, measures that include a tail dependence factor offer better forecasting ability than measures based on a common factor only. △ Less

Submitted 4 February, 2022; originally announced February 2022.

arXiv:2202.02263 [pdf]

Industry Characteristics and Financial Risk Spillovers

Authors: Wan-Chien Chiua, Juan Ignacio Peña, Chih-Wei Wang

Abstract: This paper proposes a new measure of tail risk spillover. The empirical application provides evidence of significant volatility and tail risk spillovers from the financial sector to many real economy sectors in the U.S. economy in the period from 2001 to 2011. These spillovers increase in crisis periods. The conditional coexceedance in a given sector is positively related to its amount of debt fin… ▽ More This paper proposes a new measure of tail risk spillover. The empirical application provides evidence of significant volatility and tail risk spillovers from the financial sector to many real economy sectors in the U.S. economy in the period from 2001 to 2011. These spillovers increase in crisis periods. The conditional coexceedance in a given sector is positively related to its amount of debt financing, and negatively related to its relative valuation and investment. Real economy sectors which require substantial external financing, and whose value and investment activity are relatively lower, are prime candidates for depreciation in the wake of crisis in the financial sector. △ Less

Submitted 4 February, 2022; originally announced February 2022.

arXiv:2201.07214 [pdf, other]

doi 10.1073/pnas.2201573119

Opinion Dynamics in Financial Markets via Random Networks

Authors: Mateus F. B. Granha, André L. M. Vilela, Chao Wang, Kenric P. Nelson, H. Eugene Stanley

Abstract: We investigate the financial market dynamics by introducing a heterogeneous agent-based opinion formation model. In this work, we organize the individuals in a financial market by their trading strategy, namely noise traders and fundamentalists. The opinion of a local majority compels the market exchanging behavior of noise traders, whereas the global behavior of the market influences the fundamen… ▽ More We investigate the financial market dynamics by introducing a heterogeneous agent-based opinion formation model. In this work, we organize the individuals in a financial market by their trading strategy, namely noise traders and fundamentalists. The opinion of a local majority compels the market exchanging behavior of noise traders, whereas the global behavior of the market influences the fundamentalist agents' decisions. We introduce a noise parameter $q$ to represent a level of anxiety and perceived uncertainty regarding the market behavior, enabling the possibility for an adrift financial action. We place the individuals as nodes in an Erdös-Rényi random graph, where the links represent their social interaction. At a given time, they assume one of two possible opinion states $\pm 1$ regarding buying or selling an asset. The model exhibits such fundamental qualitative and quantitative real-world market features as the distribution of logarithmic returns with fat-tails, clustered volatility, and long-term correlation of returns. We use Student's t distributions to fit the histograms of logarithmic returns, showing the gradual shift from a leptokurtic to a mesokurtic regime, depending on the fraction of fundamentalist agents. We also compare our results with the distribution of logarithmic returns of several real-world financial indices. △ Less

Submitted 14 January, 2022; originally announced January 2022.

Comments: 23 pages, 12 figures

arXiv:2112.13383 [pdf, ps, other]

Community detection and portfolio optimization

Authors: Longfeng Zhao, Chao Wang, Gang-Jin Wang, H. Eugene Stanley, Lin Chen

Abstract: Community detection methods can be used to explore the structure of complex systems. The well-known modular configurations in complex financial systems indicate the existence of community structures. Here we analyze the community properties of correlation-based networks in worldwide stock markets and use community information to construct portfolios. Portfolios constructed using community detectio… ▽ More Community detection methods can be used to explore the structure of complex systems. The well-known modular configurations in complex financial systems indicate the existence of community structures. Here we analyze the community properties of correlation-based networks in worldwide stock markets and use community information to construct portfolios. Portfolios constructed using community detection methods perform well. Our results can be used as new portfolio optimization and risk management tools. △ Less

Submitted 26 December, 2021; originally announced December 2021.

arXiv:2112.06753 [pdf, other]

FinRL-Meta: A Universe of Near-Real Market Environments for Data-Driven Deep Reinforcement Learning in Quantitative Finance

Authors: Xiao-Yang Liu, Jingyang Rui, Jiechao Gao, Liuqing Yang, Hongyang Yang, Zhaoran Wang, Christina Dan Wang, Jian Guo

Abstract: Deep reinforcement learning (DRL) has shown huge potentials in building financial market simulators recently. However, due to the highly complex and dynamic nature of real-world markets, raw historical financial data often involve large noise and may not reflect the future of markets, degrading the fidelity of DRL-based market simulators. Moreover, the accuracy of DRL-based market simulators heavi… ▽ More Deep reinforcement learning (DRL) has shown huge potentials in building financial market simulators recently. However, due to the highly complex and dynamic nature of real-world markets, raw historical financial data often involve large noise and may not reflect the future of markets, degrading the fidelity of DRL-based market simulators. Moreover, the accuracy of DRL-based market simulators heavily relies on numerous and diverse DRL agents, which increases demand for a universe of market environments and imposes a challenge on simulation speed. In this paper, we present a FinRL-Meta framework that builds a universe of market environments for data-driven financial reinforcement learning. First, FinRL-Meta separates financial data processing from the design pipeline of DRL-based strategy and provides open-source data engineering tools for financial big data. Second, FinRL-Meta provides hundreds of market environments for various trading tasks. Third, FinRL-Meta enables multiprocessing simulation and training by exploiting thousands of GPU cores. Our codes are available online at https://github.com/AI4Finance-Foundation/FinRL-Meta. △ Less

Submitted 2 March, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

Comments: Workshop on Data Centric AI, 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

arXiv:2112.02365 [pdf, other]

TransBoost: A Boosting-Tree Kernel Transfer Learning Algorithm for Improving Financial Inclusion

Authors: Yiheng Sun, Tian Lu, Cong Wang, Yuan Li, Huaiyu Fu, Jingran Dong, Yunjie Xu

Abstract: The prosperity of mobile and financial technologies has bred and expanded various kinds of financial products to a broader scope of people, which contributes to advocating financial inclusion. It has non-trivial social benefits of diminishing financial inequality. However, the technical challenges in individual financial risk evaluation caused by the distinct characteristic distribution and limite… ▽ More The prosperity of mobile and financial technologies has bred and expanded various kinds of financial products to a broader scope of people, which contributes to advocating financial inclusion. It has non-trivial social benefits of diminishing financial inequality. However, the technical challenges in individual financial risk evaluation caused by the distinct characteristic distribution and limited credit history of new users, as well as the inexperience of newly-entered companies in handling complex data and obtaining accurate labels, impede further promoting financial inclusion. To tackle these challenges, this paper develops a novel transfer learning algorithm (i.e., TransBoost) that combines the merits of tree-based models and kernel methods. The TransBoost is designed with a parallel tree structure and efficient weights updating mechanism with theoretical guarantee, which enables it to excel in tackling real-world data with high dimensional features and sparsity in $O(n)$ time complexity. We conduct extensive experiments on two public datasets and a unique large-scale dataset from Tencent Mobile Payment. The results show that the TransBoost outperforms other state-of-the-art benchmark transfer learning algorithms in terms of prediction accuracy with superior efficiency, shows stronger robustness to data sparsity, and provides meaningful model interpretation. Besides, given a financial risk level, the TransBoost enables financial service providers to serve the largest number of users including those who would otherwise be excluded by other algorithms. That is, the TransBoost improves financial inclusion. △ Less

Submitted 15 December, 2021; v1 submitted 4 December, 2021; originally announced December 2021.

Comments: Accepted at AAAI-22

arXiv:2111.11286 [pdf]

Portfolio optimization with idiosyncratic and systemic risks for financial networks

Authors: Yajie Yang, Longfeng Zhao, Lin Chen, Chao Wang, Jihui Han

Abstract: In this study, we propose a new multi-objective portfolio optimization with idiosyncratic and systemic risks for financial networks. The two risks are measured by the idiosyncratic variance and the network clustering coefficient derived from the asset correlation networks, respectively. We construct three types of financial networks in which nodes indicate assets and edges are based on three corre… ▽ More In this study, we propose a new multi-objective portfolio optimization with idiosyncratic and systemic risks for financial networks. The two risks are measured by the idiosyncratic variance and the network clustering coefficient derived from the asset correlation networks, respectively. We construct three types of financial networks in which nodes indicate assets and edges are based on three correlation measures. Starting from the multi-objective model, we formulate and solve the asset allocation problem. We find that the optimal portfolios obtained through the multi-objective with networked approach have a significant over-performance in terms of return measures in an out-of-sample framework. This is further supported by the less drawdown during the periods of the stock market fluctuating downward. According to analyzing different datasets, we also show that improvements made to portfolio strategies are robust. △ Less

Submitted 22 November, 2021; originally announced November 2021.

arXiv:2111.09395 [pdf, other]

doi 10.1145/3490354.3494366

FinRL: Deep Reinforcement Learning Framework to Automate Trading in Quantitative Finance

Authors: Xiao-Yang Liu, Hongyang Yang, Jiechao Gao, Christina Dan Wang

Abstract: Deep reinforcement learning (DRL) has been envisioned to have a competitive edge in quantitative finance. However, there is a steep development curve for quantitative traders to obtain an agent that automatically positions to win in the market, namely \textit{to decide where to trade, at what price} and \textit{what quantity}, due to the error-prone programming and arduous debugging. In this paper… ▽ More Deep reinforcement learning (DRL) has been envisioned to have a competitive edge in quantitative finance. However, there is a steep development curve for quantitative traders to obtain an agent that automatically positions to win in the market, namely \textit{to decide where to trade, at what price} and \textit{what quantity}, due to the error-prone programming and arduous debugging. In this paper, we present the first open-source framework \textit{FinRL} as a full pipeline to help quantitative traders overcome the steep learning curve. FinRL is featured with simplicity, applicability and extensibility under the key principles, \textit{full-stack framework, customization, reproducibility} and \textit{hands-on tutoring}. Embodied as a three-layer architecture with modular structures, FinRL implements fine-tuned state-of-the-art DRL algorithms and common reward functions, while alleviating the debugging workloads. Thus, we help users pipeline the strategy design at a high turnover rate. At multiple levels of time granularity, FinRL simulates various markets as training environments using historical data and live trading APIs. Being highly extensible, FinRL reserves a set of user-import interfaces and incorporates trading constraints such as market friction, market liquidity and investor's risk-aversion. Moreover, serving as practitioners' stepping stones, typical trading tasks are provided as step-by-step tutorials, e.g., stock trading, portfolio allocation, cryptocurrency trading, etc. △ Less

Submitted 6 November, 2021; originally announced November 2021.

Comments: ACM International Conference on AI in Finance

Journal ref: ACM International Conference on AI in Finance, 2021

arXiv:2106.00288 [pdf, ps, other]

A Bayesian realized threshold measurement GARCH framework for financial tail risk forecasting

Authors: Chao Wang, Richard Gerlach

Abstract: This paper proposes an innovative threshold measurement equation to be employed in a Realized-GARCH framework. The proposed framework incorporates a nonlinear threshold regression specification to consider the leverage effect and model the contemporaneous dependence between the observed realized measure and hidden volatility. A Bayesian Markov Chain Monte Carlo method is adapted and employed for m… ▽ More This paper proposes an innovative threshold measurement equation to be employed in a Realized-GARCH framework. The proposed framework incorporates a nonlinear threshold regression specification to consider the leverage effect and model the contemporaneous dependence between the observed realized measure and hidden volatility. A Bayesian Markov Chain Monte Carlo method is adapted and employed for model estimation, with its validity assessed via a simulation study. The validity of incorporating the proposed measurement equation in Realized-GARCH type models is evaluated via an empirical study, forecasting the 1% and 2.5% Value-at-Risk and Expected Shortfall on six market indices with two different out-of-sample sizes. The proposed framework is shown to be capable of producing competitive tail risk forecasting results in comparison to the GARCH and Realized-GARCH type models. △ Less

Submitted 30 October, 2022; v1 submitted 1 June, 2021; originally announced June 2021.

Comments: 28 pages, 6 Tables, 4 Figures

arXiv:2104.04918 [pdf, ps, other]

Modelling uncertainty in financial tail risk: a forecast combination and weighted quantile approach

Authors: Giuseppe Storti, Chao Wang

Abstract: A novel forecast combination and weighted quantile based tail-risk forecasting framework is proposed, aiming to reduce the impact of modelling uncertainty in tail-risk forecasting. The proposed approach is based on a two-step estimation procedure. The first step involves the combination of Value-at-Risk (VaR) forecasts at a grid of quantile levels. A range of parametric and semi-parametric models… ▽ More A novel forecast combination and weighted quantile based tail-risk forecasting framework is proposed, aiming to reduce the impact of modelling uncertainty in tail-risk forecasting. The proposed approach is based on a two-step estimation procedure. The first step involves the combination of Value-at-Risk (VaR) forecasts at a grid of quantile levels. A range of parametric and semi-parametric models is selected as the model universe in the forecast combination procedure. The quantile forecast combination weights are estimated by optimizing the quantile loss. In the second step, the Expected Shortfall (ES) is computed as a weighted average of combined quantiles. The quantiles weighting structure for ES forecasting is determined by minimizing a strictly consistent joint VaR and ES loss function of the Fissler-Ziegel class. The proposed framework is applied to six stock market indices and its forecasting performance is compared to each individual model in the universe, a simple average approach and a weighted quantile approach. The forecasting results support the proposed framework. △ Less

Submitted 18 July, 2021; v1 submitted 11 April, 2021; originally announced April 2021.

Comments: 32 pages, 3 figures, 5 tables. arXiv admin note: text overlap with arXiv:2005.04868

arXiv:2011.09607 [pdf, other]

FinRL: A Deep Reinforcement Learning Library for Automated Stock Trading in Quantitative Finance

Authors: Xiao-Yang Liu, Hongyang Yang, Qian Chen, Runjia Zhang, Liuqing Yang, Bowen Xiao, Christina Dan Wang

Abstract: As deep reinforcement learning (DRL) has been recognized as an effective approach in quantitative finance, getting hands-on experiences is attractive to beginners. However, to train a practical DRL trading agent that decides where to trade, at what price, and what quantity involves error-prone and arduous development and debugging. In this paper, we introduce a DRL library FinRL that facilitates b… ▽ More As deep reinforcement learning (DRL) has been recognized as an effective approach in quantitative finance, getting hands-on experiences is attractive to beginners. However, to train a practical DRL trading agent that decides where to trade, at what price, and what quantity involves error-prone and arduous development and debugging. In this paper, we introduce a DRL library FinRL that facilitates beginners to expose themselves to quantitative finance and to develop their own stock trading strategies. Along with easily-reproducible tutorials, FinRL library allows users to streamline their own developments and to compare with existing schemes easily. Within FinRL, virtual environments are configured with stock market datasets, trading agents are trained with neural networks, and extensive backtesting is analyzed via trading performance. Moreover, it incorporates important trading constraints such as transaction cost, market liquidity and the investor's degree of risk-aversion. FinRL is featured with completeness, hands-on tutorial and reproducibility that favors beginners: (i) at multiple levels of time granularity, FinRL simulates trading environments across various stock markets, including NASDAQ-100, DJIA, S&P 500, HSI, SSE 50, and CSI 300; (ii) organized in a layered architecture with modular structure, FinRL provides fine-tuned state-of-the-art DRL algorithms (DQN, DDPG, PPO, SAC, A2C, TD3, etc.), commonly-used reward functions and standard evaluation baselines to alleviate the debugging workloads and promote the reproducibility, and (iii) being highly extendable, FinRL reserves a complete set of user-import interfaces. Furthermore, we incorporated three application demonstrations, namely single stock trading, multiple stock trading, and portfolio allocation. The FinRL library will be available on Github at link https://github.com/AI4Finance-LLC/FinRL-Library. △ Less

Submitted 2 March, 2022; v1 submitted 18 November, 2020; originally announced November 2020.

Comments: Deep Reinforcement Learning Workshop, 34th Conference on Neural Information Processing Systems (NeurIPS2020), Vancouver, Canada

arXiv:2009.01317 [pdf, other]

Towards Earnings Call and Stock Price Movement

Authors: Zhiqiang Ma, Grace Bang, Chong Wang, Xiaomo Liu

Abstract: Earnings calls are hosted by management of public companies to discuss the company's financial performance with analysts and investors. Information disclosed during an earnings call is an essential source of data for analysts and investors to make investment decisions. Thus, we leverage earnings call transcripts to predict future stock price dynamics. We propose to model the language in transcript… ▽ More Earnings calls are hosted by management of public companies to discuss the company's financial performance with analysts and investors. Information disclosed during an earnings call is an essential source of data for analysts and investors to make investment decisions. Thus, we leverage earnings call transcripts to predict future stock price dynamics. We propose to model the language in transcripts using a deep learning framework, where an attention mechanism is applied to encode the text data into vectors for the discriminative network classifier to predict stock price movements. Our empirical experiments show that the proposed model is superior to the traditional machine learning baselines and earnings call information can boost the stock price prediction performance. △ Less

Submitted 23 August, 2020; originally announced September 2020.

Comments: Accepted by KDD 2020 MLF workshop

arXiv:2008.05147 [pdf, other]

Tail risk forecasting using Bayesian realized EGARCH models

Authors: Vica Tendenan, Richard Gerlach, Chao Wang

Abstract: This paper develops a Bayesian framework for the realized exponential generalized autoregressive conditional heteroskedasticity (realized EGARCH) model, which can incorporate multiple realized volatility measures for the modelling of a return series. The realized EGARCH model is extended by adopting a standardized Student-t and a standardized skewed Student-t distribution for the return equation.… ▽ More This paper develops a Bayesian framework for the realized exponential generalized autoregressive conditional heteroskedasticity (realized EGARCH) model, which can incorporate multiple realized volatility measures for the modelling of a return series. The realized EGARCH model is extended by adopting a standardized Student-t and a standardized skewed Student-t distribution for the return equation. Different types of realized measures, such as sub-sampled realized variance, sub-sampled realized range, and realized kernel, are considered in the paper. The Bayesian Markov chain Monte Carlo (MCMC) estimation employs the robust adaptive Metropolis algorithm (RAM) in the burn in period and the standard random walk Metropolis in the sample period. The Bayesian estimators show more favourable results than maximum likelihood estimators in a simulation study. We test the proposed models with several indices to forecast one-step-ahead Value at Risk (VaR) and Expected Shortfall (ES) over a period of 1000 days. Rigorous tail risk forecast evaluations show that the realized EGARCH models employing the standardized skewed Student-t distribution and incorporating sub-sampled realized range are favored, compared to a range of models. △ Less

Submitted 24 August, 2020; v1 submitted 12 August, 2020; originally announced August 2020.

arXiv:2006.05574 [pdf, other]

doi 10.1145/3383455.3422570

Multi-Agent Reinforcement Learning in a Realistic Limit Order Book Market Simulation

Authors: Michaël Karpe, Jin Fang, Zhongyao Ma, Chen Wang

Abstract: Optimal order execution is widely studied by industry practitioners and academic researchers because it determines the profitability of investment decisions and high-level trading strategies, particularly those involving large volumes of orders. However, complex and unknown market dynamics pose significant challenges for the development and validation of optimal execution strategies. In this paper… ▽ More Optimal order execution is widely studied by industry practitioners and academic researchers because it determines the profitability of investment decisions and high-level trading strategies, particularly those involving large volumes of orders. However, complex and unknown market dynamics pose significant challenges for the development and validation of optimal execution strategies. In this paper, we propose a model-free approach by training Reinforcement Learning (RL) agents in a realistic market simulation environment with multiple agents. First, we configure a multi-agent historical order book simulation environment for execution tasks built on an Agent-Based Interactive Discrete Event Simulation (ABIDES) [arXiv:1904.12066]. Second, we formulate the problem of optimal execution in an RL setting where an intelligent agent can make order execution and placement decisions based on market microstructure trading signals in High Frequency Trading (HFT). Third, we develop and train an RL execution agent using the Double Deep Q-Learning (DDQL) algorithm in the ABIDES environment. In some scenarios, our RL agent converges towards a Time-Weighted Average Price (TWAP) strategy. Finally, we evaluate the simulation with our RL agent by comparing it with a market replay simulation using real market Limit Order Book (LOB) data. △ Less

Submitted 11 September, 2020; v1 submitted 9 June, 2020; originally announced June 2020.

Comments: 7 pages, accepted for inclusion in the 2020 ACM International Conference on AI in Finance (ICAIF 2020)

arXiv:2005.04868 [pdf, other]

Nonparametric Expected Shortfall Forecasting Incorporating Weighted Quantiles

Authors: Giuseppe Storti, Chao Wang

Abstract: A new semi-parametric Expected Shortfall (ES) estimation and forecasting framework is proposed. The proposed approach is based on a two-step estimation procedure. The first step involves the estimation of Value-at-Risk (VaR) at different quantile levels through a set of quantile time series regressions. Then, the ES is computed as a weighted average of the estimated quantiles. The quantiles weight… ▽ More A new semi-parametric Expected Shortfall (ES) estimation and forecasting framework is proposed. The proposed approach is based on a two-step estimation procedure. The first step involves the estimation of Value-at-Risk (VaR) at different quantile levels through a set of quantile time series regressions. Then, the ES is computed as a weighted average of the estimated quantiles. The quantiles weighting structure is parsimoniously parameterized by means of a Beta weight function whose coefficients are optimized by minimizing a joint VaR and ES loss function of the Fissler-Ziegel class. The properties of the proposed approach are first evaluated with an extensive simulation study using two data generating processes. Two forecasting studies with different out-of-sample sizes are then conducted, one of which focuses on the 2008 Global Financial Crisis (GFC) period. The proposed models are applied to 7 stock market indices and their forecasting performances are compared to those of a range of parametric, non-parametric and semi-parametric models, including GARCH, Conditional AutoRegressive Expectile (CARE), joint VaR and ES quantile regression models and simple average of quantiles. The results of the forecasting experiments provide clear evidence in support of proposed models. △ Less

Submitted 15 March, 2021; v1 submitted 11 May, 2020; originally announced May 2020.

Comments: 38 pages, 2 figures and 6 tables

arXiv:2001.08374 [pdf, ps, other]

A Bayesian Long Short-Term Memory Model for Value at Risk and Expected Shortfall Joint Forecasting

Authors: Zhengkun Li, Minh-Ngoc Tran, Chao Wang, Richard Gerlach, Junbin Gao

Abstract: Value-at-Risk (VaR) and Expected Shortfall (ES) are widely used in the financial sector to measure the market risk and manage the extreme market movement. The recent link between the quantile score function and the Asymmetric Laplace density has led to a flexible likelihood-based framework for joint modelling of VaR and ES. It is of high interest in financial applications to be able to capture the… ▽ More Value-at-Risk (VaR) and Expected Shortfall (ES) are widely used in the financial sector to measure the market risk and manage the extreme market movement. The recent link between the quantile score function and the Asymmetric Laplace density has led to a flexible likelihood-based framework for joint modelling of VaR and ES. It is of high interest in financial applications to be able to capture the underlying joint dynamics of these two quantities. We address this problem by developing a hybrid model that is based on the Asymmetric Laplace quasi-likelihood and employs the Long Short-Term Memory (LSTM) time series modelling technique from Machine Learning to capture efficiently the underlying dynamics of VaR and ES. We refer to this model as LSTM-AL. We adopt the adaptive Markov chain Monte Carlo (MCMC) algorithm for Bayesian inference in the LSTM-AL model. Empirical results show that the proposed LSTM-AL model can improve the VaR and ES forecasting accuracy over a range of well-established competing models. △ Less

Submitted 12 May, 2021; v1 submitted 23 January, 2020; originally announced January 2020.

arXiv:1908.08036 [pdf, other]

Deep Reinforcement Learning for Foreign Exchange Trading

Authors: Yun-Cheng Tsai, Chun-Chieh Wang

Abstract: Reinforcement learning can interact with the environment and is suitable for applications in decision control systems. Therefore, we used the reinforcement learning method to establish a foreign exchange transaction, avoiding the long-standing problem of unstable trends in deep learning predictions. In the system design, we optimized the Sure-Fire statistical arbitrage policy, set three different… ▽ More Reinforcement learning can interact with the environment and is suitable for applications in decision control systems. Therefore, we used the reinforcement learning method to establish a foreign exchange transaction, avoiding the long-standing problem of unstable trends in deep learning predictions. In the system design, we optimized the Sure-Fire statistical arbitrage policy, set three different actions, encoded the continuous price over a period of time into a heat-map view of the Gramian Angular Field (GAF) and compared the Deep Q Learning (DQN) and Proximal Policy Optimization (PPO) algorithms. To test feasibility, we analyzed three currency pairs, namely EUR/USD, GBP/USD, and AUD/USD. We trained the data in units of four hours from 1 August 2018 to 30 November 2018 and tested model performance using data between 1 December 2018 and 31 December 2018. The test results of the various models indicated that favorable investment performance was achieved as long as the model was able to handle complex and random processes and the state was able to describe the environment, validating the feasibility of reinforcement learning in the development of trading strategies. △ Less

Submitted 3 June, 2020; v1 submitted 20 August, 2019; originally announced August 2019.

arXiv:1908.01112 [pdf, other]

Risk Management via Anomaly Circumvent: Mnemonic Deep Learning for Midterm Stock Prediction

Authors: Xinyi Li, Yinchuan Li, Xiao-Yang Liu, Christina Dan Wang

Abstract: Midterm stock price prediction is crucial for value investments in the stock market. However, most deep learning models are essentially short-term and applying them to midterm predictions encounters large cumulative errors because they cannot avoid anomalies. In this paper, we propose a novel deep neural network Mid-LSTM for midterm stock prediction, which incorporates the market trend as hidden s… ▽ More Midterm stock price prediction is crucial for value investments in the stock market. However, most deep learning models are essentially short-term and applying them to midterm predictions encounters large cumulative errors because they cannot avoid anomalies. In this paper, we propose a novel deep neural network Mid-LSTM for midterm stock prediction, which incorporates the market trend as hidden states. First, based on the autoregressive moving average model (ARMA), a midterm ARMA is formulated by taking into consideration both hidden states and the capital asset pricing model. Then, a midterm LSTM-based deep neural network is designed, which consists of three components: LSTM, hidden Markov model and linear regression networks. The proposed Mid-LSTM can avoid anomalies to reduce large prediction errors, and has good explanatory effects on the factors affecting stock prices. Extensive experiments on S&P 500 stocks show that (i) the proposed Mid-LSTM achieves 2-4% improvement in prediction accuracy, and (ii) in portfolio allocation investment, we achieve up to 120.16% annual return and 2.99 average Sharpe ratio. △ Less

Submitted 2 August, 2019; originally announced August 2019.

arXiv:1906.09961 [pdf, ps, other]

Semi-parametric Realized Nonlinear Conditional Autoregressive Expectile and Expected Shortfall

Authors: Chao Wang, Richard Gerlach

Abstract: A joint conditional autoregressive expectile and Expected Shortfall framework is proposed. The framework is extended through incorporating a measurement equation which models the contemporaneous dependence between the realized measures and the latent conditional expectile. Nonlinear threshold specification is further incorporated into the proposed framework. A Bayesian Markov Chain Monte Carlo met… ▽ More A joint conditional autoregressive expectile and Expected Shortfall framework is proposed. The framework is extended through incorporating a measurement equation which models the contemporaneous dependence between the realized measures and the latent conditional expectile. Nonlinear threshold specification is further incorporated into the proposed framework. A Bayesian Markov Chain Monte Carlo method is adapted for estimation, whose properties are assessed and compared with maximum likelihood via a simulation study. One-day-ahead VaR and ES forecasting studies, with seven market indices, provide empirical support to the proposed models. △ Less

Submitted 20 June, 2019; originally announced June 2019.

Comments: 41 pages, 6 figures. arXiv admin note: substantial text overlap with arXiv:1805.08653, arXiv:1807.02422, arXiv:1612.08488

arXiv:1905.04370 [pdf, ps, other]

A Three-state Opinion Formation Model for Financial Markets

Authors: Bernardo J. Zubillaga, André L. M. Vilela, Chao Wang, Kenric P. Nelson, H. Eugene Stanley

Abstract: We propose a three-state microscopic opinion formation model for the purpose of simulating the dynamics of financial markets. In order to mimic the heterogeneous composition of the mass of investors in a market, the agent-based model considers two different types of traders: noise traders and contrarians. Agents are represented as nodes in a network of interactions and they can assume any of three… ▽ More We propose a three-state microscopic opinion formation model for the purpose of simulating the dynamics of financial markets. In order to mimic the heterogeneous composition of the mass of investors in a market, the agent-based model considers two different types of traders: noise traders and contrarians. Agents are represented as nodes in a network of interactions and they can assume any of three distinct possible states (e.g. buy, sell or remain inactive). The time evolution of the state of an agent is dictated by probabilistic dynamics that include both local and global influences. A noise trader is subject to local interactions, tending to assume the majority state of its nearest neighbors, whilst a contrarian is subject to a global interaction with the behavior of the market as a whole, tending to assume the state of the global minority of the market. The model exhibits the typical qualitative and quantitative features of real financial time series, including distributions of returns with heavy tails, volatility clustering and long-time memory for the absolute values of the returns. The distributions of returns are fitted by means of coupled Gaussian distributions, quantitatively revealing transitions between leptokurtic, mesokurtic and platykurtic regimes in terms of a non-linear statistical coupling which describes the complexity of the system. △ Less

Submitted 10 May, 2019; originally announced May 2019.

Comments: 10 pages, 11 figures, regular paper

MSC Class: 82Cxx; 82-08;

arXiv:1807.02422 [pdf, ps, other]

A Semi-parametric Realized Joint Value-at-Risk and Expected Shortfall Regression Framework

Authors: Chao Wang, Richard Gerlach, Qian Chen

Abstract: A new realized conditional autoregressive Value-at-Risk (VaR) framework is proposed, through incorporating a measurement equation into the original quantile regression model. The framework is further extended by employing various Expected Shortfall (ES) components, to jointly estimate and forecast VaR and ES. The measurement equation models the contemporaneous dependence between the realized measu… ▽ More A new realized conditional autoregressive Value-at-Risk (VaR) framework is proposed, through incorporating a measurement equation into the original quantile regression model. The framework is further extended by employing various Expected Shortfall (ES) components, to jointly estimate and forecast VaR and ES. The measurement equation models the contemporaneous dependence between the realized measure (i.e., Realized Variance and Realized Range) and the latent conditional ES. An adaptive Bayesian Markov Chain Monte Carlo method is employed for estimation and forecasting, the properties of which are assessed and compared with maximum likelihood through a simulation study. In a comprehensive forecasting study on 1% and 2.5 % quantile levels, the proposed models are compared to a range of parametric, non-parametric and semi-parametric models, based on 7 market indices and 7 individual assets. One-day-ahead VaR and ES forecasting results favor the proposed models, especially when incorporating the sub-sampled Realized Variance and the sub-sampled Realized Range in the model. △ Less

Submitted 15 January, 2021; v1 submitted 5 July, 2018; originally announced July 2018.

Comments: 45 pages, 4 figures. arXiv admin note: substantial text overlap with arXiv:1805.08653, arXiv:1612.08488, arXiv:1707.03715

Showing 1–50 of 56 results for author: Wang, C