-
INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent
Authors:
Haohang Li,
Yupeng Cao,
Yangyang Yu,
Shashidhar Reddy Javaji,
Zhiyang Deng,
Yueru He,
Yuechen Jiang,
Zining Zhu,
Koduvayur Subbalakshmi,
Guojun Xiong,
Jimin Huang,
Lingfei Qian,
Xueqing Peng,
Qianqian Xie,
Jordan W. Suchow
Abstract:
Recent advancements have underscored the potential of large language model (LLM)-based agents in financial decision-making. Despite this progress, the field currently encounters two main challenges: (1) the lack of a comprehensive LLM agent framework adaptable to a variety of financial tasks, and (2) the absence of standardized benchmarks and consistent datasets for assessing agent performance. To…
▽ More
Recent advancements have underscored the potential of large language model (LLM)-based agents in financial decision-making. Despite this progress, the field currently encounters two main challenges: (1) the lack of a comprehensive LLM agent framework adaptable to a variety of financial tasks, and (2) the absence of standardized benchmarks and consistent datasets for assessing agent performance. To tackle these issues, we introduce \textsc{InvestorBench}, the first benchmark specifically designed for evaluating LLM-based agents in diverse financial decision-making contexts. InvestorBench enhances the versatility of LLM-enabled agents by providing a comprehensive suite of tasks applicable to different financial products, including single equities like stocks, cryptocurrencies and exchange-traded funds (ETFs). Additionally, we assess the reasoning and decision-making capabilities of our agent framework using thirteen different LLMs as backbone models, across various market environments and tasks. Furthermore, we have curated a diverse collection of open-source, multi-modal datasets and developed a comprehensive suite of environments for financial decision-making. This establishes a highly accessible platform for evaluating financial agents' performance across various scenarios.
△ Less
Submitted 24 December, 2024;
originally announced December 2024.
-
UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models
Authors:
Yuzhe Yang,
Yifei Zhang,
Yan Hu,
Yilin Guo,
Ruoli Gan,
Yueru He,
Mingcong Lei,
Xiao Zhang,
Haining Wang,
Qianqian Xie,
Jimin Huang,
Honghai Yu,
Benyou Wang
Abstract:
This paper introduces the UCFE: User-Centric Financial Expertise benchmark, an innovative framework designed to evaluate the ability of large language models (LLMs) to handle complex real-world financial tasks. UCFE benchmark adopts a hybrid approach that combines human expert evaluations with dynamic, task-specific interactions to simulate the complexities of evolving financial scenarios. Firstly…
▽ More
This paper introduces the UCFE: User-Centric Financial Expertise benchmark, an innovative framework designed to evaluate the ability of large language models (LLMs) to handle complex real-world financial tasks. UCFE benchmark adopts a hybrid approach that combines human expert evaluations with dynamic, task-specific interactions to simulate the complexities of evolving financial scenarios. Firstly, we conducted a user study involving 804 participants, collecting their feedback on financial tasks. Secondly, based on this feedback, we created our dataset that encompasses a wide range of user intents and interactions. This dataset serves as the foundation for benchmarking 11 LLMs services using the LLM-as-Judge methodology. Our results show a significant alignment between benchmark scores and human preferences, with a Pearson correlation coefficient of 0.78, confirming the effectiveness of the UCFE dataset and our evaluation approach. UCFE benchmark not only reveals the potential of LLMs in the financial domain but also provides a robust framework for assessing their performance and user satisfaction.
△ Less
Submitted 7 February, 2025; v1 submitted 17 October, 2024;
originally announced October 2024.
-
A Krasnoselskii-Mann Proximity Algorithm for Markowitz Portfolios with Adaptive Expected Return Level
Authors:
Yizun Lin,
Yongxin He,
Zhao-Rong Lai
Abstract:
Markowitz's criterion aims to balance expected return and risk when optimizing the portfolio. The expected return level is usually fixed according to the risk appetite of an investor, then the risk is minimized at this fixed return level. However, the investor may not know which return level is suitable for her/him and the current financial circumstance. It motivates us to find a novel approach th…
▽ More
Markowitz's criterion aims to balance expected return and risk when optimizing the portfolio. The expected return level is usually fixed according to the risk appetite of an investor, then the risk is minimized at this fixed return level. However, the investor may not know which return level is suitable for her/him and the current financial circumstance. It motivates us to find a novel approach that adaptively optimizes this return level and the portfolio at the same time. It not only relieves the trouble of deciding the return level during an investment but also gets more adaptive to the ever-changing financial market than a subjective return level. In order to solve the new model, we propose an exact, convergent, and efficient Krasnoselskii-Mann Proximity Algorithm based on the proximity operator and Krasnoselskii-Mann momentum technique. Extensive experiments show that the proposed method achieves significant improvements over state-of-the-art methods in portfolio optimization. This finding may contribute a new perspective on the relationship between return and risk in portfolio optimization.
△ Less
Submitted 7 November, 2024; v1 submitted 20 September, 2024;
originally announced September 2024.
-
Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications
Authors:
Jimin Huang,
Mengxi Xiao,
Dong Li,
Zihao Jiang,
Yuzhe Yang,
Yifei Zhang,
Lingfei Qian,
Yan Wang,
Xueqing Peng,
Yang Ren,
Ruoyu Xiang,
Zhengyu Chen,
Xiao Zhang,
Yueru He,
Weiguang Han,
Shunian Chen,
Lihang Shen,
Daniel Kim,
Yangyang Yu,
Yupeng Cao,
Zhiyang Deng,
Haohang Li,
Duanyu Feng,
Yongfu Dai,
VijayaSai Somasundaram
, et al. (19 additional authors not shown)
Abstract:
Financial LLMs hold promise for advancing financial tasks and domain-specific applications. However, they are limited by scarce corpora, weak multimodal capabilities, and narrow evaluations, making them less suited for real-world application. To address this, we introduce \textit{Open-FinLLMs}, the first open-source multimodal financial LLMs designed to handle diverse tasks across text, tabular, t…
▽ More
Financial LLMs hold promise for advancing financial tasks and domain-specific applications. However, they are limited by scarce corpora, weak multimodal capabilities, and narrow evaluations, making them less suited for real-world application. To address this, we introduce \textit{Open-FinLLMs}, the first open-source multimodal financial LLMs designed to handle diverse tasks across text, tabular, time-series, and chart data, excelling in zero-shot, few-shot, and fine-tuning settings. The suite includes FinLLaMA, pre-trained on a comprehensive 52-billion-token corpus; FinLLaMA-Instruct, fine-tuned with 573K financial instructions; and FinLLaVA, enhanced with 1.43M multimodal tuning pairs for strong cross-modal reasoning. We comprehensively evaluate Open-FinLLMs across 14 financial tasks, 30 datasets, and 4 multimodal tasks in zero-shot, few-shot, and supervised fine-tuning settings, introducing two new multimodal evaluation datasets. Our results show that Open-FinLLMs outperforms afvanced financial and general LLMs such as GPT-4, across financial NLP, decision-making, and multi-modal tasks, highlighting their potential to tackle real-world challenges. To foster innovation and collaboration across academia and industry, we release all codes (https://anonymous.4open.science/r/PIXIU2-0D70/B1D7/LICENSE) and models under OSI-approved licenses.
△ Less
Submitted 6 June, 2025; v1 submitted 20 August, 2024;
originally announced August 2024.
-
Beyond the Bid-Ask: Strategic Insights into Spread Prediction and the Global Mid-Price Phenomenon
Authors:
Yifan He,
Abootaleb Shirvani,
Barret Shao,
Svetlozar Rachev,
Frank Fabozzi
Abstract:
This research extends the conventional concepts of the bid--ask spread (BAS) and mid-price to include the total market order book bid--ask spread (TMOBBAS) and the global mid-price (GMP). Using high-frequency trading data, we investigate these new constructs, finding that they have heavy tails and significant deviations from normality in the distributions of their log returns, which are confirmed…
▽ More
This research extends the conventional concepts of the bid--ask spread (BAS) and mid-price to include the total market order book bid--ask spread (TMOBBAS) and the global mid-price (GMP). Using high-frequency trading data, we investigate these new constructs, finding that they have heavy tails and significant deviations from normality in the distributions of their log returns, which are confirmed by three different methods. We shift from a static to a dynamic analysis, employing the ARMA(1,1)-GARCH(1,1) model to capture the temporal dependencies in the return time-series, with the normal inverse Gaussian distribution used to capture the heavy tails of the returns. We apply an option pricing model to address the risks associated with the low liquidity indicated by the TMOBBAS and GMP. Additionally, we employ the Rachev ratio to evaluate the risk--return performance at various depths of the limit order book and examine tail risk interdependencies across spread levels. This study provides insights into the dynamics of financial markets, offering tools for trading strategies and systemic risk management.
△ Less
Submitted 21 October, 2024; v1 submitted 17 April, 2024;
originally announced April 2024.
-
Exploring Implied Certainty Equivalent Rates in Financial Markets: Empirical Analysis and Application to the Electric Vehicle Industry
Authors:
Yifan He,
Svetlozar Rachev
Abstract:
In this paper, we mainly study the impact of the implied certainty equivalent rate on investment in financial markets. First, we derived the mathematical expression of the implied certainty equivalent rate by using put-call parity, and then we selected some company stocks and options; we considered the best-performing and worst-performing company stocks and options from the beginning of 2023 to th…
▽ More
In this paper, we mainly study the impact of the implied certainty equivalent rate on investment in financial markets. First, we derived the mathematical expression of the implied certainty equivalent rate by using put-call parity, and then we selected some company stocks and options; we considered the best-performing and worst-performing company stocks and options from the beginning of 2023 to the present for empirical research. By visualizing the relationship between the time to maturity, moneyness, and implied certainty equivalent rate of these options, we have obtained a universal conclusion -- a positive implied certainty equivalent rate is more suitable for investment than a negative implied certainty equivalent rate, but for a positive implied certainty equivalent rate, a larger value also means a higher investment risk. Next, we applied these results to the electric vehicle industry, and by comparing several well-known US electric vehicle production companies, we further strengthened our conclusions. Finally, we give a warning concerning risk, that is, investment in the financial market should not focus solely on the implied certainty equivalent rate, because investment is not an easy task, and many factors need to be considered, including some factors that are difficult to predict with models.
△ Less
Submitted 18 July, 2023; v1 submitted 30 June, 2023;
originally announced July 2023.
-
The Implied Views of Bond Traders on the Spot Equity Market
Authors:
Yifan He,
Yuan Hu,
Svetlozar Rachev
Abstract:
This study delves into the temporal dynamics within the equity market through the lens of bond traders. Recognizing that the riskless interest rate fluctuates over time, we leverage the Black-Derman-Toy model to trace its temporal evolution. To gain insights from a bond trader's perspective, we focus on a specific type of bond: the zero-coupon bond. This paper introduces a pricing algorithm for th…
▽ More
This study delves into the temporal dynamics within the equity market through the lens of bond traders. Recognizing that the riskless interest rate fluctuates over time, we leverage the Black-Derman-Toy model to trace its temporal evolution. To gain insights from a bond trader's perspective, we focus on a specific type of bond: the zero-coupon bond. This paper introduces a pricing algorithm for this bond and presents a formula that can be used to ascertain its real value. By crafting an equation that juxtaposes the theoretical value of a zero-coupon bond with its actual value, we can deduce the risk-neutral probability. It is noteworthy that the risk-neutral probability correlates with variables like the instantaneous mean return, instantaneous volatility, and inherent upturn probability in the equity market. Examining these relationships enables us to discern the temporal shifts in these parameters. Our findings suggest that the mean starts at a negative value, eventually plateauing at a consistent level. The volatility, on the other hand, initially has a minimal positive value, peaks swiftly, and then stabilizes. Lastly, the upturn probability is initially significantly high, plunges rapidly, and ultimately reaches equilibrium.
△ Less
Submitted 17 October, 2023; v1 submitted 28 June, 2023;
originally announced June 2023.
-
The Gerber-Shiu discounted penalty function: A review from practical perspectives
Authors:
Yue He,
Reiichiro Kawai,
Yasutaka Shimizu,
Kazutoshi Yamazaki
Abstract:
The Gerber-Shiu function provides a unified framework for the evaluation of a variety of risk quantities. Ever since its establishment, it has attracted constantly increasing interests in actuarial science, whereas the conventional research has been focused on finding analytical or semi-analytical solutions, either of which is rarely available, except for limited classes of penalty functions on ra…
▽ More
The Gerber-Shiu function provides a unified framework for the evaluation of a variety of risk quantities. Ever since its establishment, it has attracted constantly increasing interests in actuarial science, whereas the conventional research has been focused on finding analytical or semi-analytical solutions, either of which is rarely available, except for limited classes of penalty functions on rather simple risk models. In contrast to its great generality, the Gerber-Shiu function does not seem sufficiently prevalent in practice, largely due to a variety of difficulties in numerical approximation and statistical inference. To enhance research activities on such implementation aspects, we provide a comprehensive review of existing formulations and underlying surplus processes, as well as an extensive survey of analytical, semi-analytical and asymptotic methods for the Gerber-Shiu function, which altogether shed fresh light on its numerical methods and statistical inference for further developments. On the basis of an ambitious collection of 235 references, the present survey can serve as an insightful guidebook to model and method selection from practical perspectives as well.
△ Less
Submitted 5 December, 2022; v1 submitted 20 March, 2022;
originally announced March 2022.
-
Evolutionary dynamics in financial markets with heterogeneities in strategies and risk tolerance
Authors:
Wen-Juan Xu,
Chen-Yang Zhong,
Fei Ren,
Tian Qiu,
Rong-Da Chen,
Yun-Xin He,
Li-Xin Zhong
Abstract:
In nature and human societies, the effects of homogeneous and heterogeneous characteristics on the evolution of collective behaviors are quite different from each other. It is of great importance to understand the underlying mechanisms of the occurrence of such differences. By incorporating pair pattern strategies and reference point strategies into an agent-based model, we have investigated the c…
▽ More
In nature and human societies, the effects of homogeneous and heterogeneous characteristics on the evolution of collective behaviors are quite different from each other. It is of great importance to understand the underlying mechanisms of the occurrence of such differences. By incorporating pair pattern strategies and reference point strategies into an agent-based model, we have investigated the coupled effects of heterogeneous investment strategies and heterogeneous risk tolerance on price fluctuations. In the market flooded with the investors with homogeneous investment strategies or homogeneous risk tolerance, large price fluctuations are easy to occur. In the market flooded with the investors with heterogeneous investment strategies or heterogeneous risk tolerance, the price fluctuations are suppressed. For a heterogeneous population, the coexistence of investors with pair pattern strategies and reference point strategies causes the price to have a slow fluctuation around a typical equilibrium point and both a large price fluctuation and a no-trading state are avoided, in which the pair pattern strategies push the system far away from the equilibrium while the reference point strategies pull the system back to the equilibrium. A theoretical analysis indicates that the evolutionary dynamics in the present model is governed by the competition between different strategies. The strategy that causes large price fluctuations loses more while the strategy that pulls the system back to the equilibrium gains more. Overfrequent trading does harm to one's pursuit for more wealth.
△ Less
Submitted 18 October, 2020;
originally announced October 2020.
-
Learned Sectors: A fundamentals-driven sector reclassification project
Authors:
Rukmal Weerawarana,
Yiyi Zhu,
Yuzhen He
Abstract:
Market sectors play a key role in the efficient flow of capital through the modern Global economy. We analyze existing sectorization heuristics, and observe that the most popular - the GICS (which informs the S&P 500), and the NAICS (published by the U.S. Government) - are not entirely quantitatively driven, but rather appear to be highly subjective and rooted in dogma. Building on inferences from…
▽ More
Market sectors play a key role in the efficient flow of capital through the modern Global economy. We analyze existing sectorization heuristics, and observe that the most popular - the GICS (which informs the S&P 500), and the NAICS (published by the U.S. Government) - are not entirely quantitatively driven, but rather appear to be highly subjective and rooted in dogma. Building on inferences from analysis of the capital structure irrelevance principle and the Modigliani-Miller theoretic universe conditions, we postulate that corporation fundamentals - particularly those components specific to the Modigliani-Miller universe conditions - would be optimal descriptors of the true economic domain of operation of a company. We generate a set of potential candidate learned sector universes by varying the linkage method of a hierarchical clustering algorithm, and the number of resulting sectors derived from the model (ranging from 5 to 19), resulting in a total of 60 candidate learned sector universes. We then introduce reIndexer, a backtest-driven sector universe evaluation research tool, to rank the candidate sector universes produced by our learned sector classification heuristic. This rank was utilized to identify the risk-adjusted return optimal learned sector universe as being the universe generated under CLINK (i.e. complete linkage), with 17 sectors. The optimal learned sector universe was tested against the benchmark GICS classification universe with reIndexer, outperforming on both absolute portfolio value, and risk-adjusted return over the backtest period. We conclude that our fundamentals-driven Learned Sector classification heuristic provides a superior risk-diversification profile than the status quo classification heuristic.
△ Less
Submitted 30 May, 2019;
originally announced June 2019.
-
A generalized public goods game with coupling of individual ability and project benefit
Authors:
Li-Xin Zhong,
Wen-Juan Xu,
Yun-Xin He,
Chen-Yang Zhong,
Rong-Da Chen,
Tian Qiu,
Yong-Dong Shi,
Fei Ren
Abstract:
Facing a heavy task, any single person can only make a limited contribution and team cooperation is needed. As one enjoys the benefit of the public goods, the potential benefits of the project are not always maximized and may be partly wasted. By incorporating individual ability and project benefit into the original public goods game, we study the coupling effect of the four parameters, the upper…
▽ More
Facing a heavy task, any single person can only make a limited contribution and team cooperation is needed. As one enjoys the benefit of the public goods, the potential benefits of the project are not always maximized and may be partly wasted. By incorporating individual ability and project benefit into the original public goods game, we study the coupling effect of the four parameters, the upper limit of individual contribution, the upper limit of individual benefit, the needed project cost and the upper limit of project benefit on the evolution of cooperation. Coevolving with the individual-level group size preferences, an increase in the upper limit of individual benefit promotes cooperation while an increase in the upper limit of individual contribution inhibits cooperation. The coupling of the upper limit of individual contribution and the needed project cost determines the critical point of the upper limit of project benefit, where the equilibrium frequency of cooperators reaches its highest level. Above the critical point, an increase in the upper limit of project benefit inhibits cooperation. The evolution of cooperation is closely related to the preferred group-size distribution. A functional relation between the frequency of cooperators and the dominant group size is found.
△ Less
Submitted 21 May, 2017; v1 submitted 23 February, 2017;
originally announced February 2017.