Search | arXiv e-print repository

Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Authors: Jimin Huang, Mengxi Xiao, Dong Li, Zihao Jiang, Yuzhe Yang, Yifei Zhang, Lingfei Qian, Yan Wang, Xueqing Peng, Yang Ren, Ruoyu Xiang, Zhengyu Chen, Xiao Zhang, Yueru He, Weiguang Han, Shunian Chen, Lihang Shen, Daniel Kim, Yangyang Yu, Yupeng Cao, Zhiyang Deng, Haohang Li, Duanyu Feng, Yongfu Dai, VijayaSai Somasundaram , et al. (19 additional authors not shown)

Abstract: Financial LLMs hold promise for advancing financial tasks and domain-specific applications. However, they are limited by scarce corpora, weak multimodal capabilities, and narrow evaluations, making them less suited for real-world application. To address this, we introduce \textit{Open-FinLLMs}, the first open-source multimodal financial LLMs designed to handle diverse tasks across text, tabular, t… ▽ More Financial LLMs hold promise for advancing financial tasks and domain-specific applications. However, they are limited by scarce corpora, weak multimodal capabilities, and narrow evaluations, making them less suited for real-world application. To address this, we introduce \textit{Open-FinLLMs}, the first open-source multimodal financial LLMs designed to handle diverse tasks across text, tabular, time-series, and chart data, excelling in zero-shot, few-shot, and fine-tuning settings. The suite includes FinLLaMA, pre-trained on a comprehensive 52-billion-token corpus; FinLLaMA-Instruct, fine-tuned with 573K financial instructions; and FinLLaVA, enhanced with 1.43M multimodal tuning pairs for strong cross-modal reasoning. We comprehensively evaluate Open-FinLLMs across 14 financial tasks, 30 datasets, and 4 multimodal tasks in zero-shot, few-shot, and supervised fine-tuning settings, introducing two new multimodal evaluation datasets. Our results show that Open-FinLLMs outperforms afvanced financial and general LLMs such as GPT-4, across financial NLP, decision-making, and multi-modal tasks, highlighting their potential to tackle real-world challenges. To foster innovation and collaboration across academia and industry, we release all codes (https://anonymous.4open.science/r/PIXIU2-0D70/B1D7/LICENSE) and models under OSI-approved licenses. △ Less

Submitted 6 June, 2025; v1 submitted 20 August, 2024; originally announced August 2024.

Comments: 33 pages, 13 figures

arXiv:2401.08094 [pdf, ps, other]

Optimal Insurance to Maximize Exponential Utility when Premium is Computed by a Convex Functional

Authors: Jingyi Cao, Dongchen Li, Virginia R. Young, Bin Zou

Abstract: We find the optimal indemnity to maximize the expected utility of terminal wealth of a buyer of insurance whose preferences are modeled by an exponential utility. The insurance premium is computed by a convex functional. We obtain a necessary condition for the optimal indemnity; then, because the candidate optimal indemnity is given implicitly, we use that necessary condition to develop a numerica… ▽ More We find the optimal indemnity to maximize the expected utility of terminal wealth of a buyer of insurance whose preferences are modeled by an exponential utility. The insurance premium is computed by a convex functional. We obtain a necessary condition for the optimal indemnity; then, because the candidate optimal indemnity is given implicitly, we use that necessary condition to develop a numerical algorithm to compute it. We prove that the numerical algorithm converges to a unique indemnity that, indeed, equals the optimal policy. We also illustrate our results with numerical examples. △ Less

Submitted 15 January, 2024; originally announced January 2024.

Comments: 12 pages, 3 figures

MSC Class: 91G05; 93E20; 49M05

arXiv:2305.17881 [pdf, ps, other]

Integrating Different Informations for Portfolio Selection

Authors: Yi Huang, Wei Zhu, Duan Li, Shushang Zhu, Shikun Wang

Abstract: Following the idea of Bayesian learning via Gaussian mixture model, we organically combine the backward-looking information contained in the historical data and the forward-looking information implied by the market portfolio, which is affected by heterogeneous expectations and noisy trading behavior. The proposed combined estimation adaptively harmonizes these two types of information based on the… ▽ More Following the idea of Bayesian learning via Gaussian mixture model, we organically combine the backward-looking information contained in the historical data and the forward-looking information implied by the market portfolio, which is affected by heterogeneous expectations and noisy trading behavior. The proposed combined estimation adaptively harmonizes these two types of information based on the degree of market efficiency and responds quickly at turning points of the market. Both simulation experiments and a global empirical test confirm that the approach is a flexible and robust forecasting tool and is applicable to various capital markets with different degrees of efficiency. △ Less

Submitted 29 May, 2023; originally announced May 2023.

arXiv:2304.11771 [pdf, other]

Generative AI at Work

Authors: Erik Brynjolfsson, Danielle Li, Lindsey Raymond

Abstract: We study the staggered introduction of a generative AI-based conversational assistant using data from 5,172 customer support agents. Access to AI assistance increases worker productivity, as measured by issues resolved per hour, by 15\% on average, with substantial heterogeneity across workers. Less experienced and lower-skilled workers improve both the speed and quality of their output while the… ▽ More We study the staggered introduction of a generative AI-based conversational assistant using data from 5,172 customer support agents. Access to AI assistance increases worker productivity, as measured by issues resolved per hour, by 15\% on average, with substantial heterogeneity across workers. Less experienced and lower-skilled workers improve both the speed and quality of their output while the most experienced and highest-skilled workers see small gains in speed and small declines in quality. We also find evidence that AI assistance facilitates worker learning and improves English fluency, particularly among international agents. While AI systems improve with more training data, we find that the gains from AI adoption are largest for relatively rare problems, where human agents have less baseline training and experience. Finally, we provide evidence that AI assistance improves the experience of work along two key dimensions: customers are more polite and less likely to ask to speak to a manager. △ Less

Submitted 5 November, 2024; v1 submitted 23 April, 2023; originally announced April 2023.

arXiv:2303.11716 [pdf, other]

Style Miner: Find Significant and Stable Explanatory Factors in Time Series with Constrained Reinforcement Learning

Authors: Dapeng Li, Feiyang Pan, Jia He, Zhiwei Xu, Dandan Tu, Guoliang Fan

Abstract: In high-dimensional time-series analysis, it is essential to have a set of key factors (namely, the style factors) that explain the change of the observed variable. For example, volatility modeling in finance relies on a set of risk factors, and climate change studies in climatology rely on a set of causal factors. The ideal low-dimensional style factors should balance significance (with high expl… ▽ More In high-dimensional time-series analysis, it is essential to have a set of key factors (namely, the style factors) that explain the change of the observed variable. For example, volatility modeling in finance relies on a set of risk factors, and climate change studies in climatology rely on a set of causal factors. The ideal low-dimensional style factors should balance significance (with high explanatory power) and stability (consistent, no significant fluctuations). However, previous supervised and unsupervised feature extraction methods can hardly address the tradeoff. In this paper, we propose Style Miner, a reinforcement learning method to generate style factors. We first formulate the problem as a Constrained Markov Decision Process with explanatory power as the return and stability as the constraint. Then, we design fine-grained immediate rewards and costs and use a Lagrangian heuristic to balance them adaptively. Experiments on real-world financial data sets show that Style Miner outperforms existing learning-based methods by a large margin and achieves a relatively 10% gain in R-squared explanatory power compared to the industry-renowned factors proposed by human experts. △ Less

Submitted 21 March, 2023; originally announced March 2023.

Comments: 9 pages, 6 figures

arXiv:2012.07368 [pdf, ps, other]

Effective Algorithms for Optimal Portfolio Deleveraging Problem with Cross Impact

Authors: Hezhi Luo, Yuanyuan Chen, Xianye Zhang, Duan Li, Huixian Wu

Abstract: We investigate the optimal portfolio deleveraging (OPD) problem with permanent and temporary price impacts, where the objective is to maximize equity while meeting a prescribed debt/equity requirement. We take the real situation with cross impact among different assets into consideration. The resulting problem is, however, a non-convex quadratic program with a quadratic constraint and a box constr… ▽ More We investigate the optimal portfolio deleveraging (OPD) problem with permanent and temporary price impacts, where the objective is to maximize equity while meeting a prescribed debt/equity requirement. We take the real situation with cross impact among different assets into consideration. The resulting problem is, however, a non-convex quadratic program with a quadratic constraint and a box constraint, which is known to be NP-hard. In this paper, we first develop a successive convex optimization (SCO) approach for solving the OPD problem and show that the SCO algorithm converges to a KKT point of its transformed problem. Second, we propose an effective global algorithm for the OPD problem, which integrates the SCO method, simple convex relaxation and a branch-and-bound framework, to identify a global optimal solution to the OPD problem within a pre-specified $ε$-tolerance. We establish the global convergence of our algorithm and estimate its complexity. We also conduct numerical experiments to demonstrate the effectiveness of our proposed algorithms with both the real data and the randomly generated medium- and large-scale OPD problem instances. △ Less

Submitted 15 January, 2021; v1 submitted 14 December, 2020; originally announced December 2020.

arXiv:2010.08985 [pdf, ps, other]

Scenario-decomposition Solution Framework for Nonseparable Stochastic Control Problems

Authors: Xin Huang, Duan Li, Daniel Zhuoyu Long

Abstract: When stochastic control problems do not possess separability and/or monotonicity, the dynamic programming pioneered by Bellman in 1950s fails to work as a time-decomposition solution method. Such cases have posted a great challenge to the control society in both theoretical foundation and solution methodologies for many years. With the help of the progressive hedging algorithm proposed by Rockafel… ▽ More When stochastic control problems do not possess separability and/or monotonicity, the dynamic programming pioneered by Bellman in 1950s fails to work as a time-decomposition solution method. Such cases have posted a great challenge to the control society in both theoretical foundation and solution methodologies for many years. With the help of the progressive hedging algorithm proposed by Rockafellar and Wets in 1991, we develop a novel scenario-decomposition solution framework for stochastic control problems which could be nonseparable and/or non-monotonic, thus extending the reach of stochastic optimal control. We discuss then some of its promising applications, including online quadratic programming problems and dynamic portfolio selection problems with smoothing properties. △ Less

Submitted 18 October, 2020; originally announced October 2020.

Comments: Working paper. Under review

arXiv:2002.07116 [pdf, ps, other]

A New Pricing Theory That Solves the St. Petersburg Paradox

Authors: Dahang Li

Abstract: The St. Petersburg Paradox, an important topic in probability theory, has not been solved in the last 280 years. Since Nicolaus Bernoulli proposed the St. Petersburg Paradox in 1738, many people had tried to solve it and had proposed various explanations, but all were not satisfactory. In this paper we propose a new pricing theory with several rules, which incidentally resolves this paradox. The n… ▽ More The St. Petersburg Paradox, an important topic in probability theory, has not been solved in the last 280 years. Since Nicolaus Bernoulli proposed the St. Petersburg Paradox in 1738, many people had tried to solve it and had proposed various explanations, but all were not satisfactory. In this paper we propose a new pricing theory with several rules, which incidentally resolves this paradox. The new pricing theory states that so-called fair (reasonable) pricing should be judged by the seller and the buyer independently. Reasonable pricing for the seller may not be appropriate for the buyer. The seller cares about costs, while the buyer is concerned about the realistic prospect of returns.The pricing theory we proposed can be applied to financial markets to solve the confusion that financial asset return with fat tails distribution will cause the option pricing formula to fail, thus making up the theoretical defects of quantitative financial pricing theory. △ Less

Submitted 3 May, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

arXiv:1906.09024 [pdf, ps, other]

BERT-based Financial Sentiment Index and LSTM-based Stock Return Predictability

Authors: Joshua Zoen Git Hiew, Xin Huang, Hao Mou, Duan Li, Qi Wu, Yabo Xu

Abstract: Traditional sentiment construction in finance relies heavily on the dictionary-based approach, with a few exceptions using simple machine learning techniques such as Naive Bayes classifier. While the current literature has not yet invoked the rapid advancement in the natural language processing, we construct in this research a textual-based sentiment index using a well-known pre-trained model BERT… ▽ More Traditional sentiment construction in finance relies heavily on the dictionary-based approach, with a few exceptions using simple machine learning techniques such as Naive Bayes classifier. While the current literature has not yet invoked the rapid advancement in the natural language processing, we construct in this research a textual-based sentiment index using a well-known pre-trained model BERT developed by Google, especially for three actively trading individual stocks in Hong Kong market with at the same time the hot discussion on Weibo.com. On the one hand, we demonstrate a significant enhancement of applying BERT in financial sentiment analysis when compared with the existing models. On the other hand, by combining with the other two commonly-used methods when it comes to building the sentiment index in the financial literature, i.e., the option-implied and the market-implied approaches, we propose a more general and comprehensive framework for the financial sentiment analysis, and further provide convincing outcomes for the predictability of individual stock return by combining LSTM (with a feature of a nonlinear mapping). It is significantly distinct with the dominating econometric methods in sentiment influence analysis which are all of a nature of linear regression. △ Less

Submitted 7 July, 2022; v1 submitted 21 June, 2019; originally announced June 2019.

Comments: Manuscript

arXiv:1709.05529 [pdf, ps, other]

Explicit Solution for Constrained Stochastic Linear-Quadratic Control with Multiplicative Noise

Authors: Weipin Wu, Jianjun Gao, Duan Li, Yun Shi

Abstract: We study in this paper a class of constrained linear-quadratic (LQ) optimal control problem formulations for the scalar-state stochastic system with multiplicative noise, which has various applications, especially in the financial risk management. The linear constraint on both the control and state variables considered in our model destroys the elegant structure of the conventional LQ formulation… ▽ More We study in this paper a class of constrained linear-quadratic (LQ) optimal control problem formulations for the scalar-state stochastic system with multiplicative noise, which has various applications, especially in the financial risk management. The linear constraint on both the control and state variables considered in our model destroys the elegant structure of the conventional LQ formulation and has blocked the derivation of an explicit control policy so far in the literature. We successfully derive in this paper the analytical control policy for such a class of problems by utilizing the state separation property induced from its structure. We reveal that the optimal control policy is a piece-wise affine function of the state and can be computed off-line efficiently by solving two coupled Riccati equations. Under some mild conditions, we also obtain the stationary control policy for infinite time horizon. We demonstrate the implementation of our method via some illustrative examples and show how to calibrate our model to solve dynamic constrained portfolio optimization problems. △ Less

Submitted 16 September, 2017; originally announced September 2017.

Comments: 32 Pages, 2 Figures

ACM Class: I.2.8; J.1

arXiv:1703.01984 [pdf, ps, other]

Optimality of Excess-Loss Reinsurance under a Mean-Variance Criterion

Authors: Danping Li, Dongchen Li, Virginia R. Young

Abstract: In this paper, we study an insurer's reinsurance-investment problem under a mean-variance criterion. We show that excess-loss is the unique equilibrium reinsurance strategy under a spectrally negative Lévy insurance model when the reinsurance premium is computed according to the expected value premium principle. Furthermore, we obtain the explicit equilibrium reinsurance-investment strategy by sol… ▽ More In this paper, we study an insurer's reinsurance-investment problem under a mean-variance criterion. We show that excess-loss is the unique equilibrium reinsurance strategy under a spectrally negative Lévy insurance model when the reinsurance premium is computed according to the expected value premium principle. Furthermore, we obtain the explicit equilibrium reinsurance-investment strategy by solving the extended Hamilton-Jacobi-Bellman equation. △ Less

Submitted 21 March, 2017; v1 submitted 6 March, 2017; originally announced March 2017.

arXiv:1408.6070 [pdf, ps, other]

Time Consistent Behavior Portfolio Policy for Dynamic Mean-Variance Formulation

Authors: Xiangyu Cui, Xun Li, Duan Li, Yun Shi

Abstract: When we implement a portfolio selection methodology under a mean-risk formulation, it is essential to correctly model investors' risk aversion which may be time-dependent, or even state-dependent during the investment procedure. In this paper, we propose a behavior risk aversion model, which is a piecewise linear function of the current wealth level with a reference point at a preset investment ta… ▽ More When we implement a portfolio selection methodology under a mean-risk formulation, it is essential to correctly model investors' risk aversion which may be time-dependent, or even state-dependent during the investment procedure. In this paper, we propose a behavior risk aversion model, which is a piecewise linear function of the current wealth level with a reference point at a preset investment target. Due to the time inconsistency of the resulting multi-period mean-variance model with an adaptive risk aversion, we investigate in this paper the time consistent behavior portfolio policy by solving a nested mean-variance game formulation. We derive semi-analytical time consistent behavior portfolio policy which takes a piecewise linear feedback form of the current wealth level with respect to the discounted investment target. △ Less

Submitted 2 August, 2015; v1 submitted 26 August, 2014; originally announced August 2014.

arXiv:1403.0718 [pdf, ps, other]

Mean-Variance Policy for Discrete-time Cone Constrained Markets: The Consistency in Efficiency and Minimum-Variance Signed Supermartingale Measure

Authors: Xiangyu Cui, Duan Li, Xun Li

Abstract: The discrete-time mean-variance portfolio selection formulation, a representative of general dynamic mean-risk portfolio selection problems, does not satisfy time consistency in efficiency (TCIE) in general, i.e., a truncated pre-committed efficient policy may become inefficient when considering the corresponding truncated problem, thus stimulating investors' irrational investment behavior. We inv… ▽ More The discrete-time mean-variance portfolio selection formulation, a representative of general dynamic mean-risk portfolio selection problems, does not satisfy time consistency in efficiency (TCIE) in general, i.e., a truncated pre-committed efficient policy may become inefficient when considering the corresponding truncated problem, thus stimulating investors' irrational investment behavior. We investigate analytically effects of portfolio constraints on time consistency of efficiency for convex cone constrained markets. More specifically, we derive the semi-analytical expressions for the pre-committed efficient mean-variance policy and the minimum-variance signed supermartingale measure (VSSM) and reveal their close relationship. Our analysis shows that the pre-committed discrete-time efficient mean-variance policy satisfies TCIE if and only if the conditional expectation of VSSM's density (with respect to the original probability measure) is nonnegative, or once the conditional expectation becomes negative, it remains at the same negative value until the terminal time. Our findings indicate that the property of time consistency in efficiency only depends on the basic market setting, including portfolio constraints, and this fact motivates us to establish a general solution framework in constructing TCIE dynamic portfolio selection problem formulations by introducing suitable portfolio constraints. △ Less

Submitted 4 March, 2014; originally announced March 2014.

arXiv:1402.3464 [pdf, ps, other]

Dynamic Mean-LPM and Mean-CVaR Portfolio Optimization in Continuous-time

Authors: Jianjun Gao, Ke Zhou, Duan Li, Xiren Cao

Abstract: Instead of controlling "symmetric" risks measured by central moments of investment return or terminal wealth, more and more portfolio models have shifted their focus to manage "asymmetric" downside risks that the investment return is below certain threshold. Among the existing downside risk measures, the lower-partial moments (LPM) and conditional value-at-risk (CVaR) are probably most promising.… ▽ More Instead of controlling "symmetric" risks measured by central moments of investment return or terminal wealth, more and more portfolio models have shifted their focus to manage "asymmetric" downside risks that the investment return is below certain threshold. Among the existing downside risk measures, the lower-partial moments (LPM) and conditional value-at-risk (CVaR) are probably most promising. In this paper we investigate the dynamic mean-LPM and mean-CVaR portfolio optimization problems in continuous-time, while the current literature has only witnessed their static versions. Our contributions are two-fold, in both building up tractable formulations and deriving corresponding analytical solutions. By imposing a limit funding level on the terminal wealth, we conquer the ill-posedness exhibited in the class of mean-downside risk portfolio models. The limit funding level not only enables us to solve both dynamic mean-LPM and mean-CVaR portfolio optimization problems, but also offers a flexibility to tame the aggressiveness of the portfolio policies generated from such mean - downside risk models. More specifically, for a general market setting, we prove the existence and uniqueness of the Lagrangian multiplies, which is a key step in applying the martingale approach, and establish a theoretical foundation for developing efficient numerical solution approaches. Moreover, for situations where the opportunity set of the market setting is deterministic, we derive analytical portfolio policies for both dynamic mean-LPM and mean-CVaR formulations. △ Less

Submitted 14 February, 2014; originally announced February 2014.

MSC Class: 91G10

arXiv:1303.1064 [pdf, ps, other]

Unified Framework of Mean-Field Formulations for Optimal Multi-period Mean-Variance Portfolio Selection

Authors: Xiangyu Cui, Xun Li, Duan Li

Abstract: The classical dynamic programming-based optimal stochastic control methods fail to cope with nonseparable dynamic optimization problems as the principle of optimality no longer applies in such situations. Among these notorious nonseparable problems, the dynamic mean-variance portfolio selection formulation had posted a great challenge to our research community until recently. A few solution method… ▽ More The classical dynamic programming-based optimal stochastic control methods fail to cope with nonseparable dynamic optimization problems as the principle of optimality no longer applies in such situations. Among these notorious nonseparable problems, the dynamic mean-variance portfolio selection formulation had posted a great challenge to our research community until recently. A few solution methods, including the embedding scheme, have been developed in the last decade to solve the dynamic mean-variance portfolio selection formulation successfully. We propose in this paper a novel mean-field framework that offers a more efficient modeling tool and a more accurate solution scheme in tackling directly the issue of nonseparability and deriving the optimal policies analytically for the multi-period mean-variance-type portfolio selection problems. △ Less

Submitted 5 March, 2013; originally announced March 2013.

arXiv:1210.2132 [pdf, ps, other]

Equalitarian Societies are Economically Impossible

Authors: Bojin Zheng, Wenhua Du, Wanneng Shu, Jianmin Wang, Deyi Li

Abstract: The inequality of wealth distribution is a universal phenomenon in the civilized nations, and it is often imputed to the Matthew effect, that is, the rich get richer and the poor get poorer. Some philosophers unjustified this phenomenon and tried to put the human civilization upon the evenness of wealth. Noticing the facts that 1) the emergence of the centralism is the starting point of human civi… ▽ More The inequality of wealth distribution is a universal phenomenon in the civilized nations, and it is often imputed to the Matthew effect, that is, the rich get richer and the poor get poorer. Some philosophers unjustified this phenomenon and tried to put the human civilization upon the evenness of wealth. Noticing the facts that 1) the emergence of the centralism is the starting point of human civilization, i.e., people in a society were organized hierarchically, 2) the inequality of wealth emerges simultaneously, this paper proposes a wealth distribution model based on the hidden tree structure from the viewpoint of complex network. This model considers the organized structure of people in a society as a hidden tree, and the cooperations among human beings as the transactions on the hidden tree, thereby explains the distribution of wealth. This model shows that the scale-free phenomenon of wealth distribution can be produced by the cascade controlling of human society, that is, the inequality of wealth can parasitize in the social organizations, such that any actions in eliminating the unequal wealth distribution would lead to the destroy of social or economic structures, resulting in the collapse of the economic system, therefore, would fail in vain. △ Less

Submitted 7 October, 2012; originally announced October 2012.

Journal ref: Romanian Journal of Physics 2013 58(7-8):778-789

Showing 1–16 of 16 results for author: Li, D