-
A Method for Evaluating the Interpretability of Machine Learning Models in Predicting Bond Default Risk Based on LIME and SHAP
Authors:
Yan Zhang,
Lin Chen,
Yixiang Tian
Abstract:
Interpretability analysis methods for artificial intelligence models, such as LIME and SHAP, are widely used, though they primarily serve as post-model for analyzing model outputs. While it is commonly believed that the transparency and interpretability of AI models diminish as their complexity increases, currently there is no standardized method for assessing the inherent interpretability of the…
▽ More
Interpretability analysis methods for artificial intelligence models, such as LIME and SHAP, are widely used, though they primarily serve as post-model for analyzing model outputs. While it is commonly believed that the transparency and interpretability of AI models diminish as their complexity increases, currently there is no standardized method for assessing the inherent interpretability of the models themselves. This paper uses bond market default prediction as a case study, applying commonly used machine learning algorithms within AI models. First, the classification performance of these algorithms in default prediction is evaluated. Then, leveraging LIME and SHAP to assess the contribution of sample features to prediction outcomes, the paper proposes a novel method for evaluating the interpretability of the models themselves. The results of this analysis are consistent with the intuitive understanding and logical expectations regarding the interpretability of these models.
△ Less
Submitted 26 February, 2025;
originally announced February 2025.
-
Risk Management with Feature-Enriched Generative Adversarial Networks (FE-GAN)
Authors:
Ling Chen
Abstract:
This paper investigates the application of Feature-Enriched Generative Adversarial Networks (FE-GAN) in financial risk management, with a focus on improving the estimation of Value at Risk (VaR) and Expected Shortfall (ES). FE-GAN enhances existing GANs architectures by incorporating an additional input sequence derived from preceding data to improve model performance. Two specialized GANs models,…
▽ More
This paper investigates the application of Feature-Enriched Generative Adversarial Networks (FE-GAN) in financial risk management, with a focus on improving the estimation of Value at Risk (VaR) and Expected Shortfall (ES). FE-GAN enhances existing GANs architectures by incorporating an additional input sequence derived from preceding data to improve model performance. Two specialized GANs models, the Wasserstein Generative Adversarial Network (WGAN) and the Tail Generative Adversarial Network (Tail-GAN), were evaluated under the FE-GAN framework. The results demonstrate that FE-GAN significantly outperforms traditional architectures in both VaR and ES estimation. Tail-GAN, leveraging its task-specific loss function, consistently outperforms WGAN in ES estimation, while both models exhibit similar performance in VaR estimation. Despite these promising results, the study acknowledges limitations, including reliance on highly correlated temporal data and restricted applicability to other domains. Future research directions include exploring alternative input generation methods, dynamic forecasting models, and advanced neural network architectures to further enhance GANs-based financial risk estimation.
△ Less
Submitted 23 November, 2024;
originally announced November 2024.
-
Can ChatGPT Overcome Behavioral Biases in the Financial Sector? Classify-and-Rethink: Multi-Step Zero-Shot Reasoning in the Gold Investment
Authors:
Shuoling Liu,
Gaoguo Jia,
Yuhang Jiang,
Liyuan Chen,
Qiang Yang
Abstract:
Large Language Models (LLMs) have achieved remarkable success recently, displaying exceptional capabilities in creating understandable and organized text. These LLMs have been utilized in diverse fields, such as clinical research, where domain-specific models like Med-Palm have achieved human-level performance. Recently, researchers have employed advanced prompt engineering to enhance the general…
▽ More
Large Language Models (LLMs) have achieved remarkable success recently, displaying exceptional capabilities in creating understandable and organized text. These LLMs have been utilized in diverse fields, such as clinical research, where domain-specific models like Med-Palm have achieved human-level performance. Recently, researchers have employed advanced prompt engineering to enhance the general reasoning ability of LLMs. Despite the remarkable success of zero-shot Chain-of-Thoughts (CoT) in solving general reasoning tasks, the potential of these methods still remains paid limited attention in the financial reasoning task.To address this issue, we explore multiple prompt strategies and incorporated semantic news information to improve LLMs' performance on financial reasoning tasks.To the best of our knowledge, we are the first to explore this important issue by applying ChatGPT to the gold investment.In this work, our aim is to investigate the financial reasoning capabilities of LLMs and their capacity to generate logical and persuasive investment opinions. We will use ChatGPT, one of the most powerful LLMs recently, and prompt engineering to achieve this goal. Our research will focus on understanding the ability of LLMs in sophisticated analysis and reasoning within the context of investment decision-making. Our study finds that ChatGPT with CoT prompt can provide more explainable predictions and overcome behavioral biases, which is crucial in finance-related tasks and can achieve higher investment returns.
△ Less
Submitted 15 January, 2025; v1 submitted 19 November, 2024;
originally announced November 2024.
-
The Impact of Implicit Government Guarantee on Credit Rating of Municipal Investment Bonds
Authors:
Yan Zhang,
Yixiang Tian,
Lin Chen
Abstract:
One type of bond with the most implicit government guarantee is municipal investment bonds. In recent years, there have been an increasing number of downgrades in the credit ratings of municipal bonds, which has led some people to question whether the implicit government guarantee may affect the objectivity of the bond ratings? This paper uses text mining methods to mine relevant policy documents…
▽ More
One type of bond with the most implicit government guarantee is municipal investment bonds. In recent years, there have been an increasing number of downgrades in the credit ratings of municipal bonds, which has led some people to question whether the implicit government guarantee may affect the objectivity of the bond ratings? This paper uses text mining methods to mine relevant policy documents related to municipal investment bond issuance, and calculates the implicit guarantee strength of municipal investment bonds based on the PMC index model. It further analyzes the impact of the implicit guarantee strength of municipal bonds on their credit evaluation. The study found that the implicit government guarantee on municipal investment bonds does indeed help to raise the credit ratings assigned by credit rating agencies. The study found that, moreover, the government's implicit guarantee has a more pronounced effect in boosting credit ratings in less developed western regions.
△ Less
Submitted 20 September, 2024;
originally announced September 2024.
-
Implicit Government Guarantee Measurement Based on PMC Index Model
Authors:
Yan Zhang,
Yixiang Tian,
Lin Chen,
Qi Wang
Abstract:
The implicit government guarantee hampers the recognition and management of risks by all stakeholders in the bond market, and it has led to excessive debt for local governments or state-owned enterprises. To prevent the risk of local government debt defaults and reduce investors' expectations of implicit government guarantees, various regulatory departments have issued a series of policy documents…
▽ More
The implicit government guarantee hampers the recognition and management of risks by all stakeholders in the bond market, and it has led to excessive debt for local governments or state-owned enterprises. To prevent the risk of local government debt defaults and reduce investors' expectations of implicit government guarantees, various regulatory departments have issued a series of policy documents related to municipal investment bonds. By employing text mining techniques on policy documents related to municipal investment bond, and utilizing the PMC index model to assess the effectiveness of policy documents. This paper proposes a novel method for quantifying the intensity of implicit governmental guarantees based on PMC index model. The intensity of implicit governmental guarantees is inversely correlated with the PMC index of policies aimed at de-implicitizing governmental guarantees. Then as these policies become more effective, the intensity of implicit governmental guarantees diminishes correspondingly. These findings indicate that recent policies related to municipal investment bond have indeed succeeded in reducing implicit governmental guarantee intensity, and these policies have achieved the goal of risk management. Furthermore, it was showed that the intensity of implicit governmental guarantee affected by diverse aspects of these policies such as effectiveness, clarity, and specificity, as well as incentive and assurance mechanisms.
△ Less
Submitted 19 September, 2024;
originally announced September 2024.
-
Automate Strategy Finding with LLM in Quant Investment
Authors:
Zhizhuo Kou,
Holam Yu,
Junyu Luo,
Jingshu Peng,
Xujia Li,
Chengzhong Liu,
Juntao Dai,
Lei Chen,
Sirui Han,
Yike Guo
Abstract:
We present a novel three-stage framework leveraging Large Language Models (LLMs) within a risk-aware multi-agent system for automate strategy finding in quantitative finance. Our approach addresses the brittleness of traditional deep learning models in financial applications by: employing prompt-engineered LLMs to generate executable alpha factor candidates across diverse financial data, implement…
▽ More
We present a novel three-stage framework leveraging Large Language Models (LLMs) within a risk-aware multi-agent system for automate strategy finding in quantitative finance. Our approach addresses the brittleness of traditional deep learning models in financial applications by: employing prompt-engineered LLMs to generate executable alpha factor candidates across diverse financial data, implementing multimodal agent-based evaluation that filters factors based on market status, predictive quality while maintaining category balance, and deploying dynamic weight optimization that adapts to market conditions. Experimental results demonstrate the robust performance of the strategy in Chinese & US market regimes compared to established benchmarks. Our work extends LLMs capabilities to quantitative trading, providing a scalable architecture for financial signal extraction and portfolio construction. The overall framework significantly outperforms all benchmarks with 53.17% cumulative return on SSE50 (Jan 2023 to Jan 2024), demonstrating superior risk-adjusted performance and downside protection on the market.
△ Less
Submitted 21 May, 2025; v1 submitted 10 September, 2024;
originally announced September 2024.
-
Methods for Acquiring and Incorporating Knowledge into Stock Price Prediction: A Survey
Authors:
Liping Wang,
Jiawei Li,
Lifan Zhao,
Zhizhuo Kou,
Xiaohan Wang,
Xinyi Zhu,
Hao Wang,
Yanyan Shen,
Lei Chen
Abstract:
Predicting stock prices presents a challenging research problem due to the inherent volatility and non-linear nature of the stock market. In recent years, knowledge-enhanced stock price prediction methods have shown groundbreaking results by utilizing external knowledge to understand the stock market. Despite the importance of these methods, there is a scarcity of scholarly works that systematical…
▽ More
Predicting stock prices presents a challenging research problem due to the inherent volatility and non-linear nature of the stock market. In recent years, knowledge-enhanced stock price prediction methods have shown groundbreaking results by utilizing external knowledge to understand the stock market. Despite the importance of these methods, there is a scarcity of scholarly works that systematically synthesize previous studies from the perspective of external knowledge types. Specifically, the external knowledge can be modeled in different data structures, which we group into non-graph-based formats and graph-based formats: 1) non-graph-based knowledge captures contextual information and multimedia descriptions specifically associated with an individual stock; 2) graph-based knowledge captures interconnected and interdependent information in the stock market. This survey paper aims to provide a systematic and comprehensive description of methods for acquiring external knowledge from various unstructured data sources and then incorporating it into stock price prediction models. We also explore fusion methods for combining external knowledge with historical price features. Moreover, this paper includes a compilation of relevant datasets and delves into potential future research directions in this domain.
△ Less
Submitted 9 August, 2023;
originally announced August 2023.
-
Naive Markowitz Policies
Authors:
Lin Chen,
Xun Yu Zhou
Abstract:
We study a continuous-time Markowitz mean-variance portfolio selection model in which a naive agent, unaware of the underlying time-inconsistency, continuously reoptimizes over time. We define the resulting naive policies through the limit of discretely naive policies that are committed only in very small time intervals, and derive them analytically and explicitly. We compare naive policies with p…
▽ More
We study a continuous-time Markowitz mean-variance portfolio selection model in which a naive agent, unaware of the underlying time-inconsistency, continuously reoptimizes over time. We define the resulting naive policies through the limit of discretely naive policies that are committed only in very small time intervals, and derive them analytically and explicitly. We compare naive policies with pre-committed optimal policies and with consistent planners' equilibrium policies in a Black-Scholes market, and find that the former are mean-variance inefficient starting from any given time and wealth, and always take riskier exposure than equilibrium policies.
△ Less
Submitted 14 December, 2022;
originally announced December 2022.
-
Community detection and portfolio optimization
Authors:
Longfeng Zhao,
Chao Wang,
Gang-Jin Wang,
H. Eugene Stanley,
Lin Chen
Abstract:
Community detection methods can be used to explore the structure of complex systems. The well-known modular configurations in complex financial systems indicate the existence of community structures. Here we analyze the community properties of correlation-based networks in worldwide stock markets and use community information to construct portfolios. Portfolios constructed using community detectio…
▽ More
Community detection methods can be used to explore the structure of complex systems. The well-known modular configurations in complex financial systems indicate the existence of community structures. Here we analyze the community properties of correlation-based networks in worldwide stock markets and use community information to construct portfolios. Portfolios constructed using community detection methods perform well. Our results can be used as new portfolio optimization and risk management tools.
△ Less
Submitted 26 December, 2021;
originally announced December 2021.
-
Portfolio optimization with idiosyncratic and systemic risks for financial networks
Authors:
Yajie Yang,
Longfeng Zhao,
Lin Chen,
Chao Wang,
Jihui Han
Abstract:
In this study, we propose a new multi-objective portfolio optimization with idiosyncratic and systemic risks for financial networks. The two risks are measured by the idiosyncratic variance and the network clustering coefficient derived from the asset correlation networks, respectively. We construct three types of financial networks in which nodes indicate assets and edges are based on three corre…
▽ More
In this study, we propose a new multi-objective portfolio optimization with idiosyncratic and systemic risks for financial networks. The two risks are measured by the idiosyncratic variance and the network clustering coefficient derived from the asset correlation networks, respectively. We construct three types of financial networks in which nodes indicate assets and edges are based on three correlation measures. Starting from the multi-objective model, we formulate and solve the asset allocation problem. We find that the optimal portfolios obtained through the multi-objective with networked approach have a significant over-performance in terms of return measures in an out-of-sample framework. This is further supported by the less drawdown during the periods of the stock market fluctuating downward. According to analyzing different datasets, we also show that improvements made to portfolio strategies are robust.
△ Less
Submitted 22 November, 2021;
originally announced November 2021.
-
Hermite Polynomial-based Valuation of American Options with General Jump-Diffusion Processes
Authors:
Li Chen,
Guang Zhang
Abstract:
We present a new approximation scheme for the price and exercise policy of American options. The scheme is based on Hermite polynomial expansions of the transition density of the underlying asset dynamics and the early exercise premium representation of the American option price. The advantages of the proposed approach are threefold. First, our approach does not require the transition density and…
▽ More
We present a new approximation scheme for the price and exercise policy of American options. The scheme is based on Hermite polynomial expansions of the transition density of the underlying asset dynamics and the early exercise premium representation of the American option price. The advantages of the proposed approach are threefold. First, our approach does not require the transition density and characteristic functions of the underlying asset dynamics to be attainable in closed form. Second, our approach is fast and accurate, while the prices and exercise policy can be jointly produced. Third, our approach has a wide range of applications. We show that the proposed approximations of the price and optimal exercise boundary converge to the true ones. We also provide a numerical method based on a step function to implement our proposed approach. Applications to nonlinear mean-reverting models, double mean-reverting models, Merton's and Kou's jump-diffusion models are presented and discussed.
△ Less
Submitted 23 April, 2021;
originally announced April 2021.
-
Deep Learning in Asset Pricing
Authors:
Luyang Chen,
Markus Pelger,
Jason Zhu
Abstract:
We use deep neural networks to estimate an asset pricing model for individual stock returns that takes advantage of the vast amount of conditioning information, while keeping a fully flexible form and accounting for time-variation. The key innovations are to use the fundamental no-arbitrage condition as criterion function, to construct the most informative test assets with an adversarial approach…
▽ More
We use deep neural networks to estimate an asset pricing model for individual stock returns that takes advantage of the vast amount of conditioning information, while keeping a fully flexible form and accounting for time-variation. The key innovations are to use the fundamental no-arbitrage condition as criterion function, to construct the most informative test assets with an adversarial approach and to extract the states of the economy from many macroeconomic time series. Our asset pricing model outperforms out-of-sample all benchmark approaches in terms of Sharpe ratio, explained variation and pricing errors and identifies the key factors that drive asset prices.
△ Less
Submitted 10 August, 2021; v1 submitted 10 March, 2019;
originally announced April 2019.
-
The evolving networks of debtor-creditor relationships with addition and deletion of nodes: a case of P2P lending
Authors:
Lin Chen,
Ping Li,
Qiang Li
Abstract:
P2P lending activities have grown rapidly and have caused the huge and complex networks of debtor-creditor relationships. The aim of this study was to study the underlying structural characteristics of networks formed by debtor-creditor relationships. According attributes of P2P lending, this paper model the networks of debtor-creditor relationships as an evolving networks with addition and deleti…
▽ More
P2P lending activities have grown rapidly and have caused the huge and complex networks of debtor-creditor relationships. The aim of this study was to study the underlying structural characteristics of networks formed by debtor-creditor relationships. According attributes of P2P lending, this paper model the networks of debtor-creditor relationships as an evolving networks with addition and deletion of nodes. It was found that networks of debtor-creditor relationships are scale-free networks. Moreover, the exponent of power-law was calculated by an empirical study. In addition, this paper study what factors impact on the exponent of power-law besides the number of nodes. It was found that the both interest rate and term have significantly influence on the exponent of power-law. Interest rate is negatively correlated with the exponent of power-law and term is positively correlated with the exponent of power-law. Our results enriches the application of complex networks
△ Less
Submitted 11 June, 2018;
originally announced June 2018.
-
Anomalous Scaling of Stochastic Processes and the Moses Effect
Authors:
Lijian Chen,
Kevin E. Bassler,
Joseph L. McCauley,
Gemunu H. Gunaratne
Abstract:
The state of a stochastic process evolving over a time $t$ is typically assumed to lie on a normal distribution whose width scales like $t^{1/2}$. However, processes where the probability distribution is not normal and the scaling exponent differs from $\frac{1}{2}$ are known. The search for possible origins of such "anomalous" scaling and approaches to quantify them are the motivations for the wo…
▽ More
The state of a stochastic process evolving over a time $t$ is typically assumed to lie on a normal distribution whose width scales like $t^{1/2}$. However, processes where the probability distribution is not normal and the scaling exponent differs from $\frac{1}{2}$ are known. The search for possible origins of such "anomalous" scaling and approaches to quantify them are the motivations for the work reported here. In processes with stationary increments, where the stochastic process is time-independent, auto-correlations between increments and infinite variance of increments can cause anomalous scaling. These sources have been referred to as the $\it{Joseph}$ $\it{effect}$ the $\it{Noah}$ $\it{effect}$, respectively. If the increments are non-stationary, then scaling of increments with $t$ can also lead to anomalous scaling, a mechanism we refer to as the $\it{Moses}$ $\it{effect}$. Scaling exponents quantifying the three effects are defined and related to the Hurst exponent that characterizes the overall scaling of the stochastic process. Methods of time series analysis that enable accurate independent measurement of each exponent are presented. Simple stochastic processes are used to illustrate each effect. Intraday Financial time series data is analyzed, revealing that its anomalous scaling is due only to the Moses effect. In the context of financial market data, we reiterate that the Joseph exponent, not the Hurst exponent, is the appropriate measure to test the efficient market hypothesis.
△ Less
Submitted 7 April, 2017;
originally announced April 2017.
-
Computation of the "Enrichment" of a Value Functions of an Optimization Problem on Cumulated Transaction-Costs through a Generalized Lax-Hopf Formula
Authors:
Luxi Chen
Abstract:
The Lax-Hopf formula simplifies the value function of an intertemporal optimization (infinite dimensional) problem associated with a convex transaction-cost function which depends only on the transactions (velocities) of a commodity evolution: it states that the value function is equal to the marginal fonction of a finite dimensional problem with respect to durations and average ransactions, much…
▽ More
The Lax-Hopf formula simplifies the value function of an intertemporal optimization (infinite dimensional) problem associated with a convex transaction-cost function which depends only on the transactions (velocities) of a commodity evolution: it states that the value function is equal to the marginal fonction of a finite dimensional problem with respect to durations and average ransactions, much simpler to solve. The average velocity of the value function on a investment temporal window is regarded as an enrichment, proportional to the profit and inversely proportional to the investment duration. At optimum, the Lax-Hopf formula implies that the enrichment is equal to the cost of the average transaction on the investment temporal window. In this study, we generalize the Lax-Hopf formula when the transaction-cost function depends also on time and commodity, for reducing the infinite dimensional problem to a finite dimensional problem. For that purpose, we introduce the moderated ansaction-cost function which depends only on the duration and on a commodity. Here again, the generalized Lax-Hopf formula reduces the computation of the value function to the marginal fonction of an optimization problem on durations and commodities involving the moderated transaction cost function. At optimum, the enrichment of the value function is still equal to the moderated transition cost-function of average transaction.
△ Less
Submitted 8 January, 2014;
originally announced January 2014.