Search | arXiv e-print repository

Integrating LLM-Generated Views into Mean-Variance Optimization Using the Black-Litterman Model

Authors: Youngbin Lee, Yejin Kim, Suin Kim, Yongjae Lee

Abstract: Portfolio optimization faces challenges due to the sensitivity in traditional mean-variance models. The Black-Litterman model mitigates this by integrating investor views, but defining these views remains difficult. This study explores the integration of large language models (LLMs) generated views into portfolio optimization using the Black-Litterman framework. Our method leverages LLMs to estima… ▽ More Portfolio optimization faces challenges due to the sensitivity in traditional mean-variance models. The Black-Litterman model mitigates this by integrating investor views, but defining these views remains difficult. This study explores the integration of large language models (LLMs) generated views into portfolio optimization using the Black-Litterman framework. Our method leverages LLMs to estimate expected stock returns from historical prices and company metadata, incorporating uncertainty through the variance in predictions. We conduct a backtest of the LLM-optimized portfolios from June 2024 to February 2025, rebalancing biweekly using the previous two weeks of price data. As baselines, we compare against the S&P 500, an equal-weighted portfolio, and a traditional mean-variance optimized portfolio constructed using the same set of stocks. Empirical results suggest that different LLMs exhibit varying levels of predictive optimism and confidence stability, which impact portfolio performance. The source code and data are available at https://github.com/youngandbin/LLM-MVO-BLM. △ Less

Submitted 19 April, 2025; originally announced April 2025.

Comments: Presented at the ICLR 2025 Workshop on Financial AI (https://sites.google.com/view/financialaiiclr25/home)

arXiv:2502.11701 [pdf, other]

A Cholesky decomposition-based asset selection heuristic for sparse tangent portfolio optimization

Authors: Hyunglip Bae, Haeun Jeon, Minsu Park, Yongjae Lee, Woo Chang Kim

Abstract: In practice, including large number of assets in mean-variance portfolios can lead to higher transaction costs and management fees. To address this, one common approach is to select a smaller subset of assets from the larger pool, constructing more efficient portfolios. As a solution, we propose a new asset selection heuristic which generates a pre-defined list of asset candidates using a surrogat… ▽ More In practice, including large number of assets in mean-variance portfolios can lead to higher transaction costs and management fees. To address this, one common approach is to select a smaller subset of assets from the larger pool, constructing more efficient portfolios. As a solution, we propose a new asset selection heuristic which generates a pre-defined list of asset candidates using a surrogate formulation and re-optimizes the cardinality-constrained tangent portfolio with these selected assets. This method enables faster optimization and effectively constructs portfolios with fewer assets, as demonstrated by numerical analyses on historical stock returns. Finally, we discuss a quantitative metric that can provide a initial assessment of the performance of the proposed heuristic based on asset covariance. △ Less

Submitted 17 February, 2025; originally announced February 2025.

Comments: 15 pages, 3 figures

arXiv:2502.00828 [pdf, other]

Decision-informed Neural Networks with Large Language Model Integration for Portfolio Optimization

Authors: Yoontae Hwang, Yaxuan Kong, Stefan Zohren, Yongjae Lee

Abstract: This paper addresses the critical disconnect between prediction and decision quality in portfolio optimization by integrating Large Language Models (LLMs) with decision-focused learning. We demonstrate both theoretically and empirically that minimizing the prediction error alone leads to suboptimal portfolio decisions. We aim to exploit the representational power of LLMs for investment decisions.… ▽ More This paper addresses the critical disconnect between prediction and decision quality in portfolio optimization by integrating Large Language Models (LLMs) with decision-focused learning. We demonstrate both theoretically and empirically that minimizing the prediction error alone leads to suboptimal portfolio decisions. We aim to exploit the representational power of LLMs for investment decisions. An attention mechanism processes asset relationships, temporal dependencies, and macro variables, which are then directly integrated into a portfolio optimization layer. This enables the model to capture complex market dynamics and align predictions with the decision objectives. Extensive experiments on S\&P100 and DOW30 datasets show that our model consistently outperforms state-of-the-art deep learning models. In addition, gradient-based analyses show that our model prioritizes the assets most crucial to decision making, thus mitigating the effects of prediction errors on portfolio performance. These findings underscore the value of integrating decision objectives into predictions for more robust and context-aware portfolio management. △ Less

Submitted 2 February, 2025; originally announced February 2025.

Comments: Submitted paper

arXiv:2410.09850 [pdf, other]

Can GANs Learn the Stylized Facts of Financial Time Series?

Authors: Sohyeon Kwon, Yongjae Lee

Abstract: In the financial sector, a sophisticated financial time series simulator is essential for evaluating financial products and investment strategies. Traditional back-testing methods have mainly relied on historical data-driven approaches or mathematical model-driven approaches, such as various stochastic processes. However, in the current era of AI, data-driven approaches, where models learn the int… ▽ More In the financial sector, a sophisticated financial time series simulator is essential for evaluating financial products and investment strategies. Traditional back-testing methods have mainly relied on historical data-driven approaches or mathematical model-driven approaches, such as various stochastic processes. However, in the current era of AI, data-driven approaches, where models learn the intrinsic characteristics of data directly, have emerged as promising techniques. Generative Adversarial Networks (GANs) have surfaced as promising generative models, capturing data distributions through adversarial learning. Financial time series, characterized 'stylized facts' such as random walks, mean-reverting patterns, unexpected jumps, and time-varying volatility, present significant challenges for deep neural networks to learn their intrinsic characteristics. This study examines the ability of GANs to learn diverse and complex temporal patterns (i.e., stylized facts) of both univariate and multivariate financial time series. Our extensive experiments revealed that GANs can capture various stylized facts of financial time series, but their performance varies significantly depending on the choice of generator architecture. This suggests that naively applying GANs might not effectively capture the intricate characteristics inherent in financial time series, highlighting the importance of carefully considering and validating the modeling choices. △ Less

Submitted 13 October, 2024; originally announced October 2024.

arXiv:2409.09684 [pdf, other]

Anatomy of Machines for Markowitz: Decision-Focused Learning for Mean-Variance Portfolio Optimization

Authors: Junhyeong Lee, Inwoo Tae, Yongjae Lee

Abstract: Markowitz laid the foundation of portfolio theory through the mean-variance optimization (MVO) framework. However, the effectiveness of MVO is contingent on the precise estimation of expected returns, variances, and covariances of asset returns, which are typically uncertain. Machine learning models are becoming useful in estimating uncertain parameters, and such models are trained to minimize pre… ▽ More Markowitz laid the foundation of portfolio theory through the mean-variance optimization (MVO) framework. However, the effectiveness of MVO is contingent on the precise estimation of expected returns, variances, and covariances of asset returns, which are typically uncertain. Machine learning models are becoming useful in estimating uncertain parameters, and such models are trained to minimize prediction errors, such as mean squared errors (MSE), which treat prediction errors uniformly across assets. Recent studies have pointed out that this approach would lead to suboptimal decisions and proposed Decision-Focused Learning (DFL) as a solution, integrating prediction and optimization to improve decision-making outcomes. While studies have shown DFL's potential to enhance portfolio performance, the detailed mechanisms of how DFL modifies prediction models for MVO remain unexplored. This study aims to investigate how DFL adjusts stock return prediction models to optimize decisions in MVO, addressing the question: "MSE treats the errors of all assets equally, but how does DFL reduce errors of different assets differently?" Answering this will provide crucial insights into optimal stock return prediction for constructing efficient portfolios. △ Less

Submitted 15 September, 2024; originally announced September 2024.

Comments: 7 pages, 3 figures, 3 tables

arXiv:2407.13751 [pdf, other]

Temporal Representation Learning for Stock Similarities and Its Applications in Investment Management

Authors: Yoontae Hwang, Stefan Zohren, Yongjae Lee

Abstract: In the era of rapid globalization and digitalization, accurate identification of similar stocks has become increasingly challenging due to the non-stationary nature of financial markets and the ambiguity in conventional regional and sector classifications. To address these challenges, we examine SimStock, a novel temporal self-supervised learning framework that combines techniques from self-superv… ▽ More In the era of rapid globalization and digitalization, accurate identification of similar stocks has become increasingly challenging due to the non-stationary nature of financial markets and the ambiguity in conventional regional and sector classifications. To address these challenges, we examine SimStock, a novel temporal self-supervised learning framework that combines techniques from self-supervised learning (SSL) and temporal domain generalization to learn robust and informative representations of financial time series data. The primary focus of our study is to understand the similarities between stocks from a broader perspective, considering the complex dynamics of the global financial landscape. We conduct extensive experiments on four real-world datasets with thousands of stocks and demonstrate the effectiveness of SimStock in finding similar stocks, outperforming existing methods. The practical utility of SimStock is showcased through its application to various investment strategies, such as pairs trading, index tracking, and portfolio optimization, where it leads to superior performance compared to conventional methods. Our findings empirically examine the potential of data-driven approach to enhance investment decision-making and risk management practices by leveraging the power of temporal self-supervised learning in the face of the ever-changing global financial landscape. △ Less

Submitted 18 July, 2024; originally announced July 2024.

arXiv:2404.07223 [pdf, other]

doi 10.1145/3677052.3698662

Stock Recommendations for Individual Investors: A Temporal Graph Network Approach with Mean-Variance Efficient Sampling

Authors: Youngbin Lee, Yejin Kim, Javier Sanz-Cruzado, Richard McCreadie, Yongjae Lee

Abstract: Recommender systems can be helpful for individuals to make well-informed decisions in complex financial markets. While many studies have focused on predicting stock prices, even advanced models fall short of accurately forecasting them. Additionally, previous studies indicate that individual investors often disregard established investment theories, favoring their personal preferences instead. Thi… ▽ More Recommender systems can be helpful for individuals to make well-informed decisions in complex financial markets. While many studies have focused on predicting stock prices, even advanced models fall short of accurately forecasting them. Additionally, previous studies indicate that individual investors often disregard established investment theories, favoring their personal preferences instead. This presents a challenge for stock recommendation systems, which must not only provide strong investment performance but also respect these individual preferences. To create effective stock recommender systems, three critical elements must be incorporated: 1) individual preferences, 2) portfolio diversification, and 3) the temporal dynamics of the first two. In response, we propose a new model, Portfolio Temporal Graph Network Recommender PfoTGNRec, which can handle time-varying collaborative signals and incorporates diversification-enhancing sampling. On real-world individual trading data, our approach demonstrates superior performance compared to state-of-the-art baselines, including cutting-edge dynamic embedding models and existing stock recommendation models. Indeed, we show that PfoTGNRec is an effective solution that can balance customer preferences with the need to suggest portfolios with high Return-on-Investment. The source code and data are available at https://github.com/youngandbin/PfoTGNRec. △ Less

Submitted 30 November, 2024; v1 submitted 27 March, 2024; originally announced April 2024.

Comments: ICAIF 2024 (https://dl.acm.org/doi/10.1145/3677052.3698662)

arXiv:2404.00060 [pdf, other]

Temporal Graph Networks for Graph Anomaly Detection in Financial Networks

Authors: Yejin Kim, Youngbin Lee, Minyoung Choe, Sungju Oh, Yongjae Lee

Abstract: This paper explores the utilization of Temporal Graph Networks (TGN) for financial anomaly detection, a pressing need in the era of fintech and digitized financial transactions. We present a comprehensive framework that leverages TGN, capable of capturing dynamic changes in edges within financial networks, for fraud detection. Our study compares TGN's performance against static Graph Neural Networ… ▽ More This paper explores the utilization of Temporal Graph Networks (TGN) for financial anomaly detection, a pressing need in the era of fintech and digitized financial transactions. We present a comprehensive framework that leverages TGN, capable of capturing dynamic changes in edges within financial networks, for fraud detection. Our study compares TGN's performance against static Graph Neural Network (GNN) baselines, as well as cutting-edge hypergraph neural network baselines using DGraph dataset for a realistic financial context. Our results demonstrate that TGN significantly outperforms other models in terms of AUC metrics. This superior performance underlines TGN's potential as an effective tool for detecting financial fraud, showcasing its ability to adapt to the dynamic and complex nature of modern financial systems. We also experimented with various graph embedding modules within the TGN framework and compared the effectiveness of each module. In conclusion, we demonstrated that, even with variations within TGN, it is possible to achieve good performance in the anomaly detection task. △ Less

Submitted 27 March, 2024; originally announced April 2024.

Comments: Presented at the AAAI 2024 Workshop on AI in Finance for Social Impact (https://sites.google.com/view/aifin-aaai2024)

arXiv:2403.18177 [pdf, other]

Growth rate of liquidity provider's wealth in G3Ms

Authors: Cheuk Yin Lee, Shen-Ning Tung, Tai-Ho Wang

Abstract: We study how trading fees and continuous-time arbitrage affect the profitability of liquidity providers (LPs) in Geometric Mean Market Makers (G3Ms). We use stochastic reflected diffusion processes to analyze the dynamics of a G3M model under the arbitrage-driven market. Our research focuses on calculating LP wealth and extends the findings of Tassy and White related to the constant product market… ▽ More We study how trading fees and continuous-time arbitrage affect the profitability of liquidity providers (LPs) in Geometric Mean Market Makers (G3Ms). We use stochastic reflected diffusion processes to analyze the dynamics of a G3M model under the arbitrage-driven market. Our research focuses on calculating LP wealth and extends the findings of Tassy and White related to the constant product market maker (Uniswap v2) to a wider range of G3Ms, including Balancer. This allows us to calculate the long-term expected logarithmic growth of LP wealth, offering new insights into the complex dynamics of AMMs and their implications for LPs in decentralized finance. △ Less

Submitted 8 January, 2025; v1 submitted 26 March, 2024; originally announced March 2024.

Comments: 35 pages, 5 figures

MSC Class: 91G15

arXiv:cond-mat/0403662 [pdf, ps, other]

doi 10.1103/PhysRevE.69.056107

Common Scaling Patterns in Intertrade Times of U. S. Stocks

Authors: Plamen Ch. Ivanov, Ainslie Yuen, Boris Podobnik, Youngki Lee

Abstract: We analyze the sequence of time intervals between consecutive stock trades of thirty companies representing eight sectors of the U. S. economy over a period of four years. For all companies we find that: (i) the probability density function of intertrade times may be fit by a Weibull distribution; (ii) when appropriately rescaled the probability densities of all companies collapse onto a single… ▽ More We analyze the sequence of time intervals between consecutive stock trades of thirty companies representing eight sectors of the U. S. economy over a period of four years. For all companies we find that: (i) the probability density function of intertrade times may be fit by a Weibull distribution; (ii) when appropriately rescaled the probability densities of all companies collapse onto a single curve implying a universal functional form; (iii) the intertrade times exhibit power-law correlated behavior within a trading day and a consistently greater degree of correlation over larger time scales, in agreement with the correlation behavior of the absolute price returns for the corresponding company, and (iv) the magnitude series of intertrade time increments is characterized by long-range power-law correlations suggesting the presence of nonlinear features in the trading dynamics, while the sign series is anti-correlated at small scales. Our results suggest that independent of industry sector, market capitalization and average level of trading activity, the series of intertrade times exhibit possibly universal scaling patterns, which may relate to a common mechanism underlying the trading dynamics of diverse companies. Further, our observation of long-range power-law correlations and a parallel with the crossover in the scaling of absolute price returns for each individual stock, support the hypothesis that the dynamics of transaction times may play a role in the process of price formation. △ Less

Submitted 26 March, 2004; originally announced March 2004.

Comments: 8 pages, 5 figures. Presented at The Second Nikkei Econophysics Workshop, Tokyo, 11-14 Nov. 2002. A subset appears in "The Application of Econophysics: Proceedings of the Second Nikkei Econophysics Symposium", editor H. Takayasu (Springer-Verlag, Tokyo, 2003) pp.51-57. Submitted to Phys. Rev. E on 25 June 2003

arXiv:cond-mat/9910433 [pdf, ps, other]

doi 10.1209/epl/i2000-00540-7

Systems with Correlations in the Variance: Generating Power-Law Tails in Probability Distributions

Authors: Boris Podobnik, Plamen Ch. Ivanov, Youngki Lee, Alessandro Chessa, H. Eugene Stanley

Abstract: We study how the presence of correlations in physical variables contributes to the form of probability distributions. We investigate a process with correlations in the variance generated by (i) a Gaussian or (ii) a truncated Lévy distribution. For both (i) and (ii), we find that due to the correlations in the variance, the process ``dynamically'' generates power-law tails in the distributions, w… ▽ More We study how the presence of correlations in physical variables contributes to the form of probability distributions. We investigate a process with correlations in the variance generated by (i) a Gaussian or (ii) a truncated Lévy distribution. For both (i) and (ii), we find that due to the correlations in the variance, the process ``dynamically'' generates power-law tails in the distributions, whose exponents can be controlled through the way the correlations in the variance are introduced. For (ii), we find that the process can extend a truncated distribution {\it beyond the truncation cutoff}, which leads to a crossover between a Lévy stable power law and the present ``dynamically-generated'' power law. We show that the process can explain the crossover behavior recently observed in the $S&P500$ stock index. △ Less

Submitted 8 May, 2000; v1 submitted 26 October, 1999; originally announced October 1999.

Comments: 7 pages, five figures. To appear in Europhysics Letters (2000)

arXiv:cond-mat/9906381 [pdf, ps, other]

doi 10.1209/epl/i2000-00464-8

Scale-invariant Truncated Lévy Process

Authors: Boris Podobnik, Plamen Ch. Ivanov, Youngki Lee, H. Eugene Stanley

Abstract: We develop a scale-invariant truncated Lévy (STL) process to describe physical systems characterized by correlated stochastic variables. The STL process exhibits Lévy stability for the probability density, and hence shows scaling properties (as observed in empirical data); it has the advantage that all moments are finite (and so accounts for the empirical scaling of the moments). To test the pot… ▽ More We develop a scale-invariant truncated Lévy (STL) process to describe physical systems characterized by correlated stochastic variables. The STL process exhibits Lévy stability for the probability density, and hence shows scaling properties (as observed in empirical data); it has the advantage that all moments are finite (and so accounts for the empirical scaling of the moments). To test the potential utility of the STL process, we analyze financial data. △ Less

Submitted 29 June, 1999; v1 submitted 25 June, 1999; originally announced June 1999.

Comments: 4 pages, 6 figures; corrected typos; changed figure 3

arXiv:cond-mat/9804100 [pdf, ps, other]

doi 10.1103/PhysRevLett.81.3275

Universal features in the growth dynamics of complex organizations

Authors: Youngki Lee, Luis A. N. Amaral, David Canning, Martin Meyer, H. Eugene Stanley

Abstract: We analyze the fluctuations in the gross domestic product (GDP) of 152 countries for the period 1950--1992. We find that (i) the distribution of annual growth rates for countries of a given GDP decays with ``fatter'' tails than for a Gaussian, and (ii) the width of the distribution scales as a power law of GDP with a scaling exponent $β\approx 0.15$. Both findings are in surprising agreement wit… ▽ More We analyze the fluctuations in the gross domestic product (GDP) of 152 countries for the period 1950--1992. We find that (i) the distribution of annual growth rates for countries of a given GDP decays with ``fatter'' tails than for a Gaussian, and (ii) the width of the distribution scales as a power law of GDP with a scaling exponent $β\approx 0.15$. Both findings are in surprising agreement with results on firm growth. These results are consistent with the hypothesis that the evolution of organizations with complex structure is governed by similar growth mechanisms. △ Less

Submitted 8 April, 1998; originally announced April 1998.

Comments: 4 pages, 7 ps figures, using Latex2e with epsf rotate and multicol style files. Submitted to PRL

Showing 1–13 of 13 results for author: Lee, Y