-
Trading Under Uncertainty: A Distribution-Based Strategy for Futures Markets Using FutureQuant Transformer
Authors:
Wenhao Guo,
Yuda Wang,
Zeqiao Huang,
Changjiang Zhang,
Shumin ma
Abstract:
In the complex landscape of traditional futures trading, where vast data and variables like real-time Limit Order Books (LOB) complicate price predictions, we introduce the FutureQuant Transformer model, leveraging attention mechanisms to navigate these challenges. Unlike conventional models focused on point predictions, the FutureQuant model excels in forecasting the range and volatility of futur…
▽ More
In the complex landscape of traditional futures trading, where vast data and variables like real-time Limit Order Books (LOB) complicate price predictions, we introduce the FutureQuant Transformer model, leveraging attention mechanisms to navigate these challenges. Unlike conventional models focused on point predictions, the FutureQuant model excels in forecasting the range and volatility of future prices, thus offering richer insights for trading strategies. Its ability to parse and learn from intricate market patterns allows for enhanced decision-making, significantly improving risk management and achieving a notable average gain of 0.1193% per 30-minute trade over state-of-the-art models with a simple algorithm using factors such as RSI, ATR, and Bollinger Bands. This innovation marks a substantial leap forward in predictive analytics within the volatile domain of futures trading.
△ Less
Submitted 8 May, 2025;
originally announced May 2025.
-
Quantum Walks-Based Adaptive Distribution Generation with Efficient CUDA-Q Acceleration
Authors:
Yen-Jui Chang,
Wei-Ting Wang,
Chen-Yu Liu,
Yun-Yuan Wang,
Ching-Ray Chang
Abstract:
We present a novel Adaptive Distribution Generator that leverages a quantum walks-based approach to generate high precision and efficiency of target probability distributions. Our method integrates variational quantum circuits with discrete-time quantum walks, specifically, split-step quantum walks and their entangled extensions, to dynamically tune coin parameters and drive the evolution of quant…
▽ More
We present a novel Adaptive Distribution Generator that leverages a quantum walks-based approach to generate high precision and efficiency of target probability distributions. Our method integrates variational quantum circuits with discrete-time quantum walks, specifically, split-step quantum walks and their entangled extensions, to dynamically tune coin parameters and drive the evolution of quantum states towards desired distributions. This enables accurate one-dimensional probability modeling for applications such as financial simulation and structured two-dimensional pattern generation exemplified by digit representations(0~9). Implemented within the CUDA-Q framework, our approach exploits GPU acceleration to significantly reduce computational overhead and improve scalability relative to conventional methods. Extensive benchmarks demonstrate that our Quantum Walks-Based Adaptive Distribution Generator achieves high simulation fidelity and bridges the gap between theoretical quantum algorithms and practical high-performance computation.
△ Less
Submitted 18 April, 2025;
originally announced April 2025.
-
A note on time-inconsistent stochastic control problems with higher-order moments
Authors:
Yike Wang
Abstract:
In this paper, we extend the research on time-consistent stochastic control problems with higher-order moments, as formulated by [Y. Wang et al. SIAM J. Control. Optim., 63 (2025), in press]. We consider a linear controlled dynamic equation with state-dependent diffusion, and let the sum of a conventional mean-variance utility and a fairly general function of higher-order central moments be the ob…
▽ More
In this paper, we extend the research on time-consistent stochastic control problems with higher-order moments, as formulated by [Y. Wang et al. SIAM J. Control. Optim., 63 (2025), in press]. We consider a linear controlled dynamic equation with state-dependent diffusion, and let the sum of a conventional mean-variance utility and a fairly general function of higher-order central moments be the objective functional. We obtain both the sufficiency and necessity of the equilibrium condition for an open-loop Nash equilibrium control (ONEC), under some continuity and integrability assumptions that are more relaxed and natural than those employed before. Notably, we derive an extended version of the stochastic Lebesgue differentiation theorem for necessity, because the equilibrium condition is represented by some diagonal processes generated by a flow of backward stochastic differential equations whose the data do not necessarily satisfy the usual square-integrability. Based on the derived equilibrium condition, we obtain the algebra equation for a deterministic ONEC. In particular, we find that the mean-variance equilibrium strategy is an ONEC for our higher-order moment problem if and only if the objective functional satisfies a homogeneity condition.
△ Less
Submitted 5 April, 2025;
originally announced April 2025.
-
Rough Heston model as the scaling limit of bivariate cumulative heavy-tailed INAR($\infty$) processes and applications
Authors:
Yingli Wang,
Zhenyu Cui
Abstract:
This paper establishes a novel link between nearly unstable cumulative heavy-tailed integer-valued autoregressive (INAR($\infty$)) processes and the rough Heston model via discrete scaling limits. We prove that a sequence of bivariate cumulative INAR($\infty$) processes converge in law to the rough Heston model under appropriate scaling conditions, providing a rigorous mathematical foundation for…
▽ More
This paper establishes a novel link between nearly unstable cumulative heavy-tailed integer-valued autoregressive (INAR($\infty$)) processes and the rough Heston model via discrete scaling limits. We prove that a sequence of bivariate cumulative INAR($\infty$) processes converge in law to the rough Heston model under appropriate scaling conditions, providing a rigorous mathematical foundation for understanding how microstructural order flow drives macroscopic prices following rough volatility dynamics. Our theoretical framework extends the scaling limit techniques from Hawkes processes to the INAR($\infty$) setting. Hence we can carry out efficient Monte Carlo simulation of the rough Heston model through simulating the corresponding approximating INAR($\infty$) processes, which provides an alternative discrete-time simulation method to the Euler-Maruyama method. Extensive numerical experiments illustrate the improved accuracy and efficiency of the proposed simulation scheme as compared to the literature, in the valuation of European options, and also path-dependent options such as arithmetic Asian options, lookback options and barrier options.
△ Less
Submitted 9 April, 2025; v1 submitted 23 March, 2025;
originally announced March 2025.
-
Assessing Uncertainty in Stock Returns: A Gaussian Mixture Distribution-Based Method
Authors:
Yanlong Wang,
Jian Xu,
Shao-Lun Huang,
Danny Dongning Sun,
Xiao-Ping Zhang
Abstract:
This study seeks to advance the understanding and prediction of stock market return uncertainty through the application of advanced deep learning techniques. We introduce a novel deep learning model that utilizes a Gaussian mixture distribution to capture the complex, time-varying nature of asset return distributions in the Chinese stock market. By incorporating the Gaussian mixture distribution,…
▽ More
This study seeks to advance the understanding and prediction of stock market return uncertainty through the application of advanced deep learning techniques. We introduce a novel deep learning model that utilizes a Gaussian mixture distribution to capture the complex, time-varying nature of asset return distributions in the Chinese stock market. By incorporating the Gaussian mixture distribution, our approach effectively characterizes short-term fluctuations and non-traditional features of stock returns, such as skewness and heavy tails, that are often overlooked by traditional models. Compared to GARCH models and their variants, our method demonstrates superior performance in volatility estimation, particularly during periods of heightened market volatility. It provides more accurate volatility forecasts and offers unique risk insights for different assets, thereby deepening the understanding of return uncertainty. Additionally, we propose a novel use of Code embedding which utilizes a bag-of-words approach to train hidden representations of stock codes and transforms the uncertainty attributes of stocks into high-dimensional vectors. These vectors are subsequently reduced to two dimensions, allowing the observation of similarity among different stocks. This visualization facilitates the identification of asset clusters with similar risk profiles, offering valuable insights for portfolio management and risk mitigation. Since we predict the uncertainty of returns by estimating their latent distribution, it is challenging to evaluate the return distribution when the true distribution is unobservable. However, we can measure it through the CRPS to assess how well the predicted distribution matches the true returns, and through MSE and QLIKE metrics to evaluate the error between the volatility level of the predicted distribution and proxy measures of true volatility.
△ Less
Submitted 10 March, 2025;
originally announced March 2025.
-
FinTSBridge: A New Evaluation Suite for Real-world Financial Prediction with Advanced Time Series Models
Authors:
Yanlong Wang,
Jian Xu,
Tiantian Gao,
Hongkang Zhang,
Shao-Lun Huang,
Danny Dongning Sun,
Xiao-Ping Zhang
Abstract:
Despite the growing attention to time series forecasting in recent years, many studies have proposed various solutions to address the challenges encountered in time series prediction, aiming to improve forecasting performance. However, effectively applying these time series forecasting models to the field of financial asset pricing remains a challenging issue. There is still a need for a bridge to…
▽ More
Despite the growing attention to time series forecasting in recent years, many studies have proposed various solutions to address the challenges encountered in time series prediction, aiming to improve forecasting performance. However, effectively applying these time series forecasting models to the field of financial asset pricing remains a challenging issue. There is still a need for a bridge to connect cutting-edge time series forecasting models with financial asset pricing. To bridge this gap, we have undertaken the following efforts: 1) We constructed three datasets from the financial domain; 2) We selected over ten time series forecasting models from recent studies and validated their performance in financial time series; 3) We developed new metrics, msIC and msIR, in addition to MSE and MAE, to showcase the time series correlation captured by the models; 4) We designed financial-specific tasks for these three datasets and assessed the practical performance and application potential of these forecasting models in important financial problems. We hope the developed new evaluation suite, FinTSBridge, can provide valuable insights into the effectiveness and robustness of advanced forecasting models in finanical domains.
△ Less
Submitted 10 March, 2025;
originally announced March 2025.
-
Time-consistent portfolio selection with strictly monotone mean-variance preference
Authors:
Yike Wang,
Yusha Chen
Abstract:
This paper is devoted to time-consistent control problems of portfolio selection with strictly monotone mean-variance preferences. These preferences are variational modifications of the conventional mean-variance preferences, and remain time-inconsistent as in mean-variance optimization problems. To tackle the time-inconsistency, we study the Nash equilibrium controls of both the open-loop type an…
▽ More
This paper is devoted to time-consistent control problems of portfolio selection with strictly monotone mean-variance preferences. These preferences are variational modifications of the conventional mean-variance preferences, and remain time-inconsistent as in mean-variance optimization problems. To tackle the time-inconsistency, we study the Nash equilibrium controls of both the open-loop type and the closed-loop type, and characterize them within a random parameter setting. The problem is reduced to solving a flow of forward-backward stochastic differential equations for open-loop equilibria, and to solving extended Hamilton-Jacobi-Bellman equations for closed-loop equilibria. In particular, we derive semi-closed-form solutions for these two types of equilibria under a deterministic parameter setting. Both solutions are represented by the same function, which is independent of wealth state and random path. This function can be expressed as the conventional time-consistent mean-variance portfolio strategy multiplied by a factor greater than one. Furthermore, we find that the state-independent closed-loop Nash equilibrium control is a strong equilibrium strategy in a constant parameter setting only when the interest rate is sufficiently large.
△ Less
Submitted 16 February, 2025;
originally announced February 2025.
-
Digital transformation: A systematic review and bibliometric analysis from the corporate finance perspective
Authors:
Ping Zhang,
Yiru Wang
Abstract:
Digital transformation significantly impacts firm investment, financing, and value enhancement. A systematic investigation from the corporate finance perspective has not yet been formed. This paper combines bibliometric and content analysis methods to systematically review the evolutionary trend, status quo, hotspots and overall structure of research in digital transformation from 2011 to 2024. Th…
▽ More
Digital transformation significantly impacts firm investment, financing, and value enhancement. A systematic investigation from the corporate finance perspective has not yet been formed. This paper combines bibliometric and content analysis methods to systematically review the evolutionary trend, status quo, hotspots and overall structure of research in digital transformation from 2011 to 2024. The study reveals an emerging and rapidly growing focus on digital transformation research, particularly in developed countries. We categorize the literature into three areas according to bibliometric clustering: the measurements (qualitative and quantitative), impact factors (internal and external), and the economic consequences (investment, financing, and firm value). These areas are divided into ten sub-branches, with a detailed literature review. We also review the existing theories related to digital transformation, identify the current gaps in these papers, and provide directions for future research on each sub-branches.
△ Less
Submitted 12 December, 2024;
originally announced December 2024.
-
Leveraging Convolutional Neural Network-Transformer Synergy for Predictive Modeling in Risk-Based Applications
Authors:
Yuhan Wang,
Zhen Xu,
Yue Yao,
Jinsong Liu,
Jiating Lin
Abstract:
With the development of the financial industry, credit default prediction, as an important task in financial risk management, has received increasing attention. Traditional credit default prediction methods mostly rely on machine learning models, such as decision trees and random forests, but these methods have certain limitations in processing complex data and capturing potential risk patterns. T…
▽ More
With the development of the financial industry, credit default prediction, as an important task in financial risk management, has received increasing attention. Traditional credit default prediction methods mostly rely on machine learning models, such as decision trees and random forests, but these methods have certain limitations in processing complex data and capturing potential risk patterns. To this end, this paper proposes a deep learning model based on the combination of convolutional neural networks (CNN) and Transformer for credit user default prediction. The model combines the advantages of CNN in local feature extraction with the ability of Transformer in global dependency modeling, effectively improving the accuracy and robustness of credit default prediction. Through experiments on public credit default datasets, the results show that the CNN+Transformer model outperforms traditional machine learning models, such as random forests and XGBoost, in multiple evaluation indicators such as accuracy, AUC, and KS value, demonstrating its powerful ability in complex financial data modeling. Further experimental analysis shows that appropriate optimizer selection and learning rate adjustment play a vital role in improving model performance. In addition, the ablation experiment of the model verifies the advantages of the combination of CNN and Transformer and proves the complementarity of the two in credit default prediction. This study provides a new idea for credit default prediction and provides strong support for risk assessment and intelligent decision-making in the financial field. Future research can further improve the prediction effect and generalization ability by introducing more unstructured data and improving the model architecture.
△ Less
Submitted 24 December, 2024;
originally announced December 2024.
-
Strictly monotone mean-variance preferences with dynamic portfolio management
Authors:
Yike Wang,
Yusha Chen
Abstract:
This paper is devoted to extending the monotone mean-variance (MMV) preference to a large class of strictly monotone mean-variance (SMMV) preferences, and illustrating its application to single-period/continuous-time portfolio selection problems. The properties and equivalent representations of the SMMV preference are also studied. To illustrate applications, we provide the gradient condition for…
▽ More
This paper is devoted to extending the monotone mean-variance (MMV) preference to a large class of strictly monotone mean-variance (SMMV) preferences, and illustrating its application to single-period/continuous-time portfolio selection problems. The properties and equivalent representations of the SMMV preference are also studied. To illustrate applications, we provide the gradient condition for the single-period portfolio problem with SMMV preferences, and investigate its association with the optimal mean-variance static strategy. For the continuous-time portfolio problem with SMMV preferences and continuous price processes, we show the condition that the solution is the same as the corresponding optimal mean-variance strategy. When this consistency condition is not satisfied, the primal problems are unbounded, and we turn to study a sequence of approximate linear-quadratic problems generated by penalty function method. The solution can be characterized by stochastic Hamilton-Jacobi-Bellman-Isaacs equation, but it is still difficult to derive a closed-form expression. We take a joint adoption of embedding method and convex duality method to derive an analytical solution. In particular, if the parameter that characterizes the strict monotonicity of SMMV preference is a constant, the solution can be given by two equations in the form of Black-Scholes formula.
△ Less
Submitted 18 December, 2024;
originally announced December 2024.
-
On stochastic control problems with higher-order moments
Authors:
Yike Wang,
Jingzhen Liu,
Alain Bensoussan,
Ka-Fai Cedric Yiu,
Jiaqin Wei
Abstract:
In this paper, we focus on a class of time-inconsistent stochastic control problems, where the objective function includes the mean and several higher-order central moments of the terminal value of state. To tackle the time-inconsistency, we seek both the closed-loop and the open-loop Nash equilibrium controls as time-consistent solutions. We establish a partial differential equation (PDE) system…
▽ More
In this paper, we focus on a class of time-inconsistent stochastic control problems, where the objective function includes the mean and several higher-order central moments of the terminal value of state. To tackle the time-inconsistency, we seek both the closed-loop and the open-loop Nash equilibrium controls as time-consistent solutions. We establish a partial differential equation (PDE) system for deriving a closed-loop Nash equilibrium control, which does not include the equilibrium value function and is different from the extended Hamilton-Jacobi-Bellman (HJB) equations as in Björk et al. (Finance Stoch. 21: 331-360, 2017). We show that our PDE system is equivalent to the extended HJB equations that seems difficult to be solved for our higher-order moment problems. In deriving an open-loop Nash equilibrium control, due to the non-separable higher-order moments in the objective function, we make some moment estimates in addition to the standard perturbation argument for developing a maximum principle. Then, the problem is reduced to solving a flow of forward-backward stochastic differential equations. In particular, we investigate linear controlled dynamics and some objective functions affine in the mean. The closed-loop and the open-loop Nash equilibrium controls are identical, which are independent of the state value, random path and the preference on the odd-order central moments. By sending the highest order of central moments to infinity, we obtain the time-consistent solutions to some control problems whose objective functions include some penalty functions for deviation.
△ Less
Submitted 30 January, 2025; v1 submitted 18 December, 2024;
originally announced December 2024.
-
Research on Optimizing Real-Time Data Processing in High-Frequency Trading Algorithms using Machine Learning
Authors:
Yuxin Fan,
Zhuohuan Hu,
Lei Fu,
Yu Cheng,
Liyang Wang,
Yuxiang Wang
Abstract:
High-frequency trading (HFT) represents a pivotal and intensely competitive domain within the financial markets. The velocity and accuracy of data processing exert a direct influence on profitability, underscoring the significance of this field. The objective of this work is to optimise the real-time processing of data in high-frequency trading algorithms. The dynamic feature selection mechanism i…
▽ More
High-frequency trading (HFT) represents a pivotal and intensely competitive domain within the financial markets. The velocity and accuracy of data processing exert a direct influence on profitability, underscoring the significance of this field. The objective of this work is to optimise the real-time processing of data in high-frequency trading algorithms. The dynamic feature selection mechanism is responsible for monitoring and analysing market data in real time through clustering and feature weight analysis, with the objective of automatically selecting the most relevant features. This process employs an adaptive feature extraction method, which enables the system to respond and adjust its feature set in a timely manner when the data input changes, thus ensuring the efficient utilisation of data. The lightweight neural networks are designed in a modular fashion, comprising fast convolutional layers and pruning techniques that facilitate the expeditious completion of data processing and output prediction. In contrast to conventional deep learning models, the neural network architecture has been specifically designed to minimise the number of parameters and computational complexity, thereby markedly reducing the inference time. The experimental results demonstrate that the model is capable of maintaining consistent performance in the context of varying market conditions, thereby illustrating its advantages in terms of processing speed and revenue enhancement.
△ Less
Submitted 1 December, 2024;
originally announced December 2024.
-
TIMeSynC: Temporal Intent Modelling with Synchronized Context Encodings for Financial Service Applications
Authors:
Dwipam Katariya,
Juan Manuel Origgi,
Yage Wang,
Thomas Caputo
Abstract:
Users engage with financial services companies through multiple channels, often interacting with mobile applications, web platforms, call centers, and physical locations to service their accounts. The resulting interactions are recorded at heterogeneous temporal resolutions across these domains. This multi-channel data can be combined and encoded to create a comprehensive representation of the cus…
▽ More
Users engage with financial services companies through multiple channels, often interacting with mobile applications, web platforms, call centers, and physical locations to service their accounts. The resulting interactions are recorded at heterogeneous temporal resolutions across these domains. This multi-channel data can be combined and encoded to create a comprehensive representation of the customer's journey for accurate intent prediction. This demands sequential learning solutions. NMT transformers achieve state-of-the-art sequential representation learning by encoding context and decoding for the next best action to represent long-range dependencies. However, three major challenges exist while combining multi-domain sequences within an encoder-decoder transformers architecture for intent prediction applications: a) aligning sequences with different sampling rates b) learning temporal dynamics across multi-variate, multi-domain sequences c) combining dynamic and static sequences. We propose an encoder-decoder transformer model to address these challenges for contextual and sequential intent prediction in financial servicing applications. Our experiments show significant improvement over the existing tabular method.
△ Less
Submitted 3 February, 2025; v1 submitted 1 October, 2024;
originally announced October 2024.
-
KANOP: A Data-Efficient Option Pricing Model using Kolmogorov-Arnold Networks
Authors:
Rushikesh Handal,
Kazuki Matoya,
Yunzhuo Wang,
Masanori Hirano
Abstract:
Inspired by the recently proposed Kolmogorov-Arnold Networks (KANs), we introduce the KAN-based Option Pricing (KANOP) model to value American-style options, building on the conventional Least Square Monte Carlo (LSMC) algorithm. KANs, which are based on Kolmogorov-Arnold representation theorem, offer a data-efficient alternative to traditional Multi-Layer Perceptrons, requiring fewer hidden layer…
▽ More
Inspired by the recently proposed Kolmogorov-Arnold Networks (KANs), we introduce the KAN-based Option Pricing (KANOP) model to value American-style options, building on the conventional Least Square Monte Carlo (LSMC) algorithm. KANs, which are based on Kolmogorov-Arnold representation theorem, offer a data-efficient alternative to traditional Multi-Layer Perceptrons, requiring fewer hidden layers to achieve a higher level of performance. By leveraging the flexibility of KANs, KANOP provides a learnable alternative to the conventional set of basis functions used in the LSMC model, allowing the model to adapt to the pricing task and effectively estimate the expected continuation value. Using examples of standard American and Asian-American options, we demonstrate that KANOP produces more reliable option value estimates, both for single-dimensional cases and in more complex scenarios involving multiple input variables. The delta estimated by the KANOP model is also more accurate than that obtained using conventional basis functions, which is crucial for effective option hedging. Graphical illustrations further validate KANOP's ability to accurately model the expected continuation value for American-style options.
△ Less
Submitted 1 October, 2024;
originally announced October 2024.
-
Ethereum Fraud Detection via Joint Transaction Language Model and Graph Representation Learning
Authors:
Jianguo Sun,
Yifan Jia,
Yanbin Wang,
Yiwei Liu,
Zhang Sheng,
Ye Tian
Abstract:
Ethereum faces growing fraud threats. Current fraud detection methods, whether employing graph neural networks or sequence models, fail to consider the semantic information and similarity patterns within transactions. Moreover, these approaches do not leverage the potential synergistic benefits of combining both types of models. To address these challenges, we propose TLMG4Eth that combines a tran…
▽ More
Ethereum faces growing fraud threats. Current fraud detection methods, whether employing graph neural networks or sequence models, fail to consider the semantic information and similarity patterns within transactions. Moreover, these approaches do not leverage the potential synergistic benefits of combining both types of models. To address these challenges, we propose TLMG4Eth that combines a transaction language model with graph-based methods to capture semantic, similarity, and structural features of transaction data in Ethereum. We first propose a transaction language model that converts numerical transaction data into meaningful transaction sentences, enabling the model to learn explicit transaction semantics. Then, we propose a transaction attribute similarity graph to learn transaction similarity information, enabling us to capture intuitive insights into transaction anomalies. Additionally, we construct an account interaction graph to capture the structural information of the account transaction network. We employ a deep multi-head attention network to fuse transaction semantic and similarity embeddings, and ultimately propose a joint training approach for the multi-head attention network and the account interaction graph to obtain the synergistic benefits of both.
△ Less
Submitted 18 February, 2025; v1 submitted 9 September, 2024;
originally announced September 2024.
-
Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications
Authors:
Jimin Huang,
Mengxi Xiao,
Dong Li,
Zihao Jiang,
Yuzhe Yang,
Yifei Zhang,
Lingfei Qian,
Yan Wang,
Xueqing Peng,
Yang Ren,
Ruoyu Xiang,
Zhengyu Chen,
Xiao Zhang,
Yueru He,
Weiguang Han,
Shunian Chen,
Lihang Shen,
Daniel Kim,
Yangyang Yu,
Yupeng Cao,
Zhiyang Deng,
Haohang Li,
Duanyu Feng,
Yongfu Dai,
VijayaSai Somasundaram
, et al. (19 additional authors not shown)
Abstract:
Financial LLMs hold promise for advancing financial tasks and domain-specific applications. However, they are limited by scarce corpora, weak multimodal capabilities, and narrow evaluations, making them less suited for real-world application. To address this, we introduce \textit{Open-FinLLMs}, the first open-source multimodal financial LLMs designed to handle diverse tasks across text, tabular, t…
▽ More
Financial LLMs hold promise for advancing financial tasks and domain-specific applications. However, they are limited by scarce corpora, weak multimodal capabilities, and narrow evaluations, making them less suited for real-world application. To address this, we introduce \textit{Open-FinLLMs}, the first open-source multimodal financial LLMs designed to handle diverse tasks across text, tabular, time-series, and chart data, excelling in zero-shot, few-shot, and fine-tuning settings. The suite includes FinLLaMA, pre-trained on a comprehensive 52-billion-token corpus; FinLLaMA-Instruct, fine-tuned with 573K financial instructions; and FinLLaVA, enhanced with 1.43M multimodal tuning pairs for strong cross-modal reasoning. We comprehensively evaluate Open-FinLLMs across 14 financial tasks, 30 datasets, and 4 multimodal tasks in zero-shot, few-shot, and supervised fine-tuning settings, introducing two new multimodal evaluation datasets. Our results show that Open-FinLLMs outperforms afvanced financial and general LLMs such as GPT-4, across financial NLP, decision-making, and multi-modal tasks, highlighting their potential to tackle real-world challenges. To foster innovation and collaboration across academia and industry, we release all codes (https://anonymous.4open.science/r/PIXIU2-0D70/B1D7/LICENSE) and models under OSI-approved licenses.
△ Less
Submitted 2 April, 2025; v1 submitted 20 August, 2024;
originally announced August 2024.
-
Harnessing Earnings Reports for Stock Predictions: A QLoRA-Enhanced LLM Approach
Authors:
Haowei Ni,
Shuchen Meng,
Xupeng Chen,
Ziqing Zhao,
Andi Chen,
Panfeng Li,
Shiyao Zhang,
Qifu Yin,
Yuanqing Wang,
Yuxi Chan
Abstract:
Accurate stock market predictions following earnings reports are crucial for investors. Traditional methods, particularly classical machine learning models, struggle with these predictions because they cannot effectively process and interpret extensive textual data contained in earnings reports and often overlook nuances that influence market movements. This paper introduces an advanced approach b…
▽ More
Accurate stock market predictions following earnings reports are crucial for investors. Traditional methods, particularly classical machine learning models, struggle with these predictions because they cannot effectively process and interpret extensive textual data contained in earnings reports and often overlook nuances that influence market movements. This paper introduces an advanced approach by employing Large Language Models (LLMs) instruction fine-tuned with a novel combination of instruction-based techniques and quantized low-rank adaptation (QLoRA) compression. Our methodology integrates 'base factors', such as financial metric growth and earnings transcripts, with 'external factors', including recent market indices performances and analyst grades, to create a rich, supervised dataset. This comprehensive dataset enables our models to achieve superior predictive performance in terms of accuracy, weighted F1, and Matthews correlation coefficient (MCC), especially evident in the comparison with benchmarks such as GPT-4. We specifically highlight the efficacy of the llama-3-8b-Instruct-4bit model, which showcases significant improvements over baseline models. The paper also discusses the potential of expanding the output capabilities to include a 'Hold' option and extending the prediction horizon, aiming to accommodate various investment styles and time frames. This study not only demonstrates the power of integrating cutting-edge AI with fine-tuned financial data but also paves the way for future research in enhancing AI-driven financial analysis tools.
△ Less
Submitted 12 November, 2024; v1 submitted 13 August, 2024;
originally announced August 2024.
-
Quantifying the Blockchain Trilemma: A Comparative Analysis of Algorand, Ethereum 2.0, and Beyond
Authors:
Yihang Fu,
Mingwei Jing,
Jiaolun Zhou,
Peilin Wu,
Ye Wang,
Luyao Zhang,
Chuang Hu
Abstract:
Blockchain technology is essential for the digital economy and metaverse, supporting applications from decentralized finance to virtual assets. However, its potential is constrained by the "Blockchain Trilemma," which necessitates balancing decentralization, security, and scalability. This study evaluates and compares two leading proof-of-stake (PoS) systems, Algorand and Ethereum 2.0, against the…
▽ More
Blockchain technology is essential for the digital economy and metaverse, supporting applications from decentralized finance to virtual assets. However, its potential is constrained by the "Blockchain Trilemma," which necessitates balancing decentralization, security, and scalability. This study evaluates and compares two leading proof-of-stake (PoS) systems, Algorand and Ethereum 2.0, against these critical metrics. Our research interprets existing indices to measure decentralization, evaluates scalability through transactional data, and assesses security by identifying potential vulnerabilities. Utilizing real-world data, we analyze each platform's strategies in a structured manner to understand their effectiveness in addressing trilemma challenges. The findings highlight each platform's strengths and propose general methodologies for evaluating key blockchain characteristics applicable to other systems. This research advances the understanding of blockchain technologies and their implications for the future digital economy. Data and code are available on GitHub as open source.
△ Less
Submitted 19 July, 2024;
originally announced July 2024.
-
Statistics-Informed Parameterized Quantum Circuit via Maximum Entropy Principle for Data Science and Finance
Authors:
Xi-Ning Zhuang,
Zhao-Yun Chen,
Cheng Xue,
Xiao-Fan Xu,
Chao Wang,
Huan-Yu Liu,
Tai-Ping Sun,
Yun-Jie Wang,
Yu-Chun Wu,
Guo-Ping Guo
Abstract:
Quantum machine learning has demonstrated significant potential in solving practical problems, particularly in statistics-focused areas such as data science and finance. However, challenges remain in preparing and learning statistical models on a quantum processor due to issues with trainability and interpretability. In this letter, we utilize the maximum entropy principle to design a statistics-i…
▽ More
Quantum machine learning has demonstrated significant potential in solving practical problems, particularly in statistics-focused areas such as data science and finance. However, challenges remain in preparing and learning statistical models on a quantum processor due to issues with trainability and interpretability. In this letter, we utilize the maximum entropy principle to design a statistics-informed parameterized quantum circuit (SI-PQC) for efficiently preparing and training of quantum computational statistical models, including arbitrary distributions and their weighted mixtures. The SI-PQC features a static structure with trainable parameters, enabling in-depth optimized circuit compilation, exponential reductions in resource and time consumption, and improved trainability and interpretability for learning quantum states and classical model parameters simultaneously. As an efficient subroutine for preparing and learning in various quantum algorithms, the SI-PQC addresses the input bottleneck and facilitates the injection of prior knowledge.
△ Less
Submitted 18 June, 2024; v1 submitted 3 June, 2024;
originally announced June 2024.
-
A K-means Algorithm for Financial Market Risk Forecasting
Authors:
Jinxin Xu,
Kaixian Xu,
Yue Wang,
Qinyan Shen,
Ruisi Li
Abstract:
Financial market risk forecasting involves applying mathematical models, historical data analysis and statistical methods to estimate the impact of future market movements on investments. This process is crucial for investors to develop strategies, financial institutions to manage assets and regulators to formulate policy. In today's society, there are problems of high error rate and low precision…
▽ More
Financial market risk forecasting involves applying mathematical models, historical data analysis and statistical methods to estimate the impact of future market movements on investments. This process is crucial for investors to develop strategies, financial institutions to manage assets and regulators to formulate policy. In today's society, there are problems of high error rate and low precision in financial market risk prediction, which greatly affect the accuracy of financial market risk prediction. K-means algorithm in machine learning is an effective risk prediction technique for financial market. This study uses K-means algorithm to develop a financial market risk prediction system, which significantly improves the accuracy and efficiency of financial market risk prediction. Ultimately, the outcomes of the experiments confirm that the K-means algorithm operates with user-friendly simplicity and achieves a 94.61% accuracy rate
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
RVRAE: A Dynamic Factor Model Based on Variational Recurrent Autoencoder for Stock Returns Prediction
Authors:
Yilun Wang,
Shengjie Guo
Abstract:
In recent years, the dynamic factor model has emerged as a dominant tool in economics and finance, particularly for investment strategies. This model offers improved handling of complex, nonlinear, and noisy market conditions compared to traditional static factor models. The advancement of machine learning, especially in dealing with nonlinear data, has further enhanced asset pricing methodologies…
▽ More
In recent years, the dynamic factor model has emerged as a dominant tool in economics and finance, particularly for investment strategies. This model offers improved handling of complex, nonlinear, and noisy market conditions compared to traditional static factor models. The advancement of machine learning, especially in dealing with nonlinear data, has further enhanced asset pricing methodologies. This paper introduces a groundbreaking dynamic factor model named RVRAE. This model is a probabilistic approach that addresses the temporal dependencies and noise in market data. RVRAE ingeniously combines the principles of dynamic factor modeling with the variational recurrent autoencoder (VRAE) from deep learning. A key feature of RVRAE is its use of a prior-posterior learning method. This method fine-tunes the model's learning process by seeking an optimal posterior factor model informed by future data. Notably, RVRAE is adept at risk modeling in volatile stock markets, estimating variances from latent space distributions while also predicting returns. Our empirical tests with real stock market data underscore RVRAE's superior performance compared to various established baseline methods.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
Learning the Market: Sentiment-Based Ensemble Trading Agents
Authors:
Andrew Ye,
James Xu,
Vidyut Veedgav,
Yi Wang,
Yifan Yu,
Daniel Yan,
Ryan Chen,
Vipin Chaudhary,
Shuai Xu
Abstract:
We propose and study the integration of sentiment analysis and deep reinforcement learning ensemble algorithms for stock trading by evaluating strategies capable of dynamically altering their active agent given the concurrent market environment. In particular, we design a simple-yet-effective method for extracting financial sentiment and combine this with improvements on existing trading agents, r…
▽ More
We propose and study the integration of sentiment analysis and deep reinforcement learning ensemble algorithms for stock trading by evaluating strategies capable of dynamically altering their active agent given the concurrent market environment. In particular, we design a simple-yet-effective method for extracting financial sentiment and combine this with improvements on existing trading agents, resulting in a strategy that effectively considers both qualitative market factors and quantitative stock data. We show that our approach results in a strategy that is profitable, robust, and risk-minimal - outperforming the traditional ensemble strategy as well as single agent algorithms and market metrics. Our findings suggest that the conventional practice of switching and reevaluating agents in ensemble every fixed-number of months is sub-optimal, and that a dynamic sentiment-based framework greatly unlocks additional performance. Furthermore, as we have designed our algorithm with simplicity and efficiency in mind, we hypothesize that the transition of our method from historical evaluation towards real-time trading with live data to be relatively simple.
△ Less
Submitted 20 November, 2024; v1 submitted 2 February, 2024;
originally announced February 2024.
-
Shai: A large language model for asset management
Authors:
Zhongyang Guo,
Guanran Jiang,
Zhongdan Zhang,
Peng Li,
Zhefeng Wang,
Yinchun Wang
Abstract:
This paper introduces "Shai" a 10B level large language model specifically designed for the asset management industry, built upon an open-source foundational model. With continuous pre-training and fine-tuning using a targeted corpus, Shai demonstrates enhanced performance in tasks relevant to its domain, outperforming baseline models. Our research includes the development of an innovative evaluat…
▽ More
This paper introduces "Shai" a 10B level large language model specifically designed for the asset management industry, built upon an open-source foundational model. With continuous pre-training and fine-tuning using a targeted corpus, Shai demonstrates enhanced performance in tasks relevant to its domain, outperforming baseline models. Our research includes the development of an innovative evaluation framework, which integrates professional qualification exams, tailored tasks, open-ended question answering, and safety assessments, to comprehensively assess Shai's capabilities. Furthermore, we discuss the challenges and implications of utilizing large language models like GPT-4 for performance assessment in asset management, suggesting a combination of automated evaluation and human judgment. Shai's development, showcasing the potential and versatility of 10B-level large language models in the financial sector with significant performance and modest computational requirements, hopes to provide practical insights and methodologies to assist industry peers in their similar endeavors.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
Predictable Relative Forward Performance Processes: Multi-Agent and Mean Field Games for Portfolio Management
Authors:
Gechun Liang,
Moris S. Strub,
Yuwei Wang
Abstract:
We consider a new framework of predictable relative forward performance processes (PRFPP) to study portfolio management within a competitive environment. Each agent trades a distinct stock following a binomial distribution with probabilities for a positive return depending on the market regime characterized by a binomial common noise. For both the finite population and mean field games, we constru…
▽ More
We consider a new framework of predictable relative forward performance processes (PRFPP) to study portfolio management within a competitive environment. Each agent trades a distinct stock following a binomial distribution with probabilities for a positive return depending on the market regime characterized by a binomial common noise. For both the finite population and mean field games, we construct and analyse PRFPPs for initial data of the CARA class along with the associated equilibrium strategies. We find that relative performance concerns do not necessarily lead to more investment in the risky asset. Under some parameter constellations, agents short a stock with positive expected excess return.
△ Less
Submitted 2 December, 2023; v1 submitted 8 November, 2023;
originally announced November 2023.
-
Topological Portfolio Selection and Optimization
Authors:
Yuanrong Wang,
Antonio Briola,
Tomaso Aste
Abstract:
Modern portfolio optimization is centered around creating a low-risk portfolio with extensive asset diversification. Following the seminal work of Markowitz, optimal asset allocation can be computed using a constrained optimization model based on empirical covariance. However, covariance is typically estimated from historical lookback observations, and it is prone to noise and may inadequately rep…
▽ More
Modern portfolio optimization is centered around creating a low-risk portfolio with extensive asset diversification. Following the seminal work of Markowitz, optimal asset allocation can be computed using a constrained optimization model based on empirical covariance. However, covariance is typically estimated from historical lookback observations, and it is prone to noise and may inadequately represent future market behavior. As a remedy, information filtering networks from network science can be used to mitigate the noise in empirical covariance estimation, and therefore, can bring added value to the portfolio construction process. In this paper, we propose the use of the Statistically Robust Information Filtering Network (SR-IFN) which leverages the bootstrapping techniques to eliminate unnecessary edges during the network formation and enhances the network's noise reduction capability further. We apply SR-IFN to index component stock pools in the US, UK, and China to assess its effectiveness. The SR-IFN network is partially disconnected with isolated nodes representing lesser-correlated assets, facilitating the selection of peripheral, diversified and higher-performing portfolios. Further optimization of performance can be achieved by inversely proportioning asset weights to their centrality based on the resultant network.
△ Less
Submitted 23 October, 2023;
originally announced October 2023.
-
Quantum-Enhanced Forecasting: Leveraging Quantum Gramian Angular Field and CNNs for Stock Return Predictions
Authors:
Zhengmeng Xu,
Yujie Wang,
Xiaotong Feng,
Yilin Wang,
Yanli Li,
Hai Lin
Abstract:
We propose a time series forecasting method named Quantum Gramian Angular Field (QGAF). This approach merges the advantages of quantum computing technology with deep learning, aiming to enhance the precision of time series classification and forecasting. We successfully transformed stock return time series data into two-dimensional images suitable for Convolutional Neural Network (CNN) training by…
▽ More
We propose a time series forecasting method named Quantum Gramian Angular Field (QGAF). This approach merges the advantages of quantum computing technology with deep learning, aiming to enhance the precision of time series classification and forecasting. We successfully transformed stock return time series data into two-dimensional images suitable for Convolutional Neural Network (CNN) training by designing specific quantum circuits. Distinct from the classical Gramian Angular Field (GAF) approach, QGAF's uniqueness lies in eliminating the need for data normalization and inverse cosine calculations, simplifying the transformation process from time series data to two-dimensional images. To validate the effectiveness of this method, we conducted experiments on datasets from three major stock markets: the China A-share market, the Hong Kong stock market, and the US stock market. Experimental results revealed that compared to the classical GAF method, the QGAF approach significantly improved time series prediction accuracy, reducing prediction errors by an average of 25% for Mean Absolute Error (MAE) and 48% for Mean Squared Error (MSE). This research confirms the potential and promising prospects of integrating quantum computing with deep learning techniques in financial time series forecasting.
△ Less
Submitted 11 December, 2023; v1 submitted 11 October, 2023;
originally announced October 2023.
-
Market Crowds' Trading Behaviors, Agreement Prices, and the Implications of Trading Volume
Authors:
Leilei Shi,
Bing Han,
Yingzi Zhu,
Liyan Han,
Yiwen Wang,
Yan Piao
Abstract:
It has been long that literature in financial academics focuses mainly on price and return but much less on trading volume. In the past twenty years, it has already linked both price and trading volume to economic fundamentals, and explored the behavioral implications of trading volume such as investor's attitude toward risks, overconfidence, disagreement, and attention etc. However, what is surpr…
▽ More
It has been long that literature in financial academics focuses mainly on price and return but much less on trading volume. In the past twenty years, it has already linked both price and trading volume to economic fundamentals, and explored the behavioral implications of trading volume such as investor's attitude toward risks, overconfidence, disagreement, and attention etc. However, what is surprising is how little we really know about trading volume. Here we show that trading volume probability represents the frequency of market crowd's trading action in terms of behavior analysis, and test two adaptive hypotheses relevant to the volume uncertainty associated with price in China stock market. The empirical work reveals that market crowd trade a stock in efficient adaptation except for simple heuristics, gradually tend to achieve agreement on an outcome or an asset price widely on a trading day, and generate such a stationary equilibrium price very often in interaction and competition among themselves no matter whether it is highly overestimated or underestimated. This suggests that asset prices include not only a fundamental value but also private information, speculative, sentiment, attention, gamble, and entertainment values etc. Moreover, market crowd adapt to gain and loss by trading volume increase or decrease significantly in interaction with environment in any two consecutive trading days. Our results demonstrate how interaction between information and news, the trading action, and return outcomes in the three-term feedback loop produces excessive trading volume which includes various internal and external causes.
△ Less
Submitted 8 October, 2023;
originally announced October 2023.
-
A Markovian empirical model for the VIX index and the pricing of the corresponding derivatives
Authors:
Ying-Li Wang,
Cheng-Long Xu,
Ping He
Abstract:
In this paper, we propose an empirical model for the VIX index. Our findings indicate that the VIX has a long-term empirical distribution. To model its dynamics, we utilize a continuous-time Markov process with a uniform distribution as its invariant distribution and a suitable function $h$. We determined that $h$ is the inverse function of the VIX data's empirical distribution. Additionally, we u…
▽ More
In this paper, we propose an empirical model for the VIX index. Our findings indicate that the VIX has a long-term empirical distribution. To model its dynamics, we utilize a continuous-time Markov process with a uniform distribution as its invariant distribution and a suitable function $h$. We determined that $h$ is the inverse function of the VIX data's empirical distribution. Additionally, we use the method of variables of separation to get the exact solution to the pricing problem for VIX futures and call options.
△ Less
Submitted 15 September, 2023;
originally announced September 2023.
-
Domain-adapted Learning and Imitation: DRL for Power Arbitrage
Authors:
Yuanrong Wang,
Vignesh Raja Swaminathan,
Nikita P. Granger,
Carlos Ros Perez,
Christian Michler
Abstract:
In this paper, we discuss the Dutch power market, which is comprised of a day-ahead market and an intraday balancing market that operates like an auction. Due to fluctuations in power supply and demand, there is often an imbalance that leads to different prices in the two markets, providing an opportunity for arbitrage. To address this issue, we restructure the problem and propose a collaborative…
▽ More
In this paper, we discuss the Dutch power market, which is comprised of a day-ahead market and an intraday balancing market that operates like an auction. Due to fluctuations in power supply and demand, there is often an imbalance that leads to different prices in the two markets, providing an opportunity for arbitrage. To address this issue, we restructure the problem and propose a collaborative dual-agent reinforcement learning approach for this bi-level simulation and optimization of European power arbitrage trading. We also introduce two new implementations designed to incorporate domain-specific knowledge by imitating the trading behaviours of power traders. By utilizing reward engineering to imitate domain expertise, we are able to reform the reward system for the RL agent, which improves convergence during training and enhances overall performance. Additionally, the tranching of orders increases bidding success rates and significantly boosts profit and loss (P&L). Our study demonstrates that by leveraging domain expertise in a general learning problem, the performance can be improved substantially, and the final integrated approach leads to a three-fold improvement in cumulative P&L compared to the original agent. Furthermore, our methodology outperforms the highest benchmark policy by around 50% while maintaining efficient computational performance.
△ Less
Submitted 10 September, 2023; v1 submitted 19 January, 2023;
originally announced January 2023.
-
Domain-adapted Learning and Interpretability: DRL for Gas Trading
Authors:
Yuanrong Wang,
Yinsen Miao,
Alexander CY Wong,
Nikita P Granger,
Christian Michler
Abstract:
Deep Reinforcement Learning (Deep RL) has been explored for a number of applications in finance and stock trading. In this paper, we present a practical implementation of Deep RL for trading natural gas futures contracts. The Sharpe Ratio obtained exceeds benchmarks given by trend following and mean reversion strategies as well as results reported in literature. Moreover, we propose a simple but e…
▽ More
Deep Reinforcement Learning (Deep RL) has been explored for a number of applications in finance and stock trading. In this paper, we present a practical implementation of Deep RL for trading natural gas futures contracts. The Sharpe Ratio obtained exceeds benchmarks given by trend following and mean reversion strategies as well as results reported in literature. Moreover, we propose a simple but effective ensemble learning scheme for trading, which significantly improves performance through enhanced model stability and robustness as well as lower turnover and hence lower transaction cost. We discuss the resulting Deep RL strategy in terms of model explainability, trading frequency and risk measures.
△ Less
Submitted 10 September, 2023; v1 submitted 19 January, 2023;
originally announced January 2023.
-
Regime-based Implied Stochastic Volatility Model for Crypto Option Pricing
Authors:
Danial Saef,
Yuanrong Wang,
Tomaso Aste
Abstract:
The increasing adoption of Digital Assets (DAs), such as Bitcoin (BTC), rises the need for accurate option pricing models. Yet, existing methodologies fail to cope with the volatile nature of the emerging DAs. Many models have been proposed to address the unorthodox market dynamics and frequent disruptions in the microstructure caused by the non-stationarity, and peculiar statistics, in DA markets…
▽ More
The increasing adoption of Digital Assets (DAs), such as Bitcoin (BTC), rises the need for accurate option pricing models. Yet, existing methodologies fail to cope with the volatile nature of the emerging DAs. Many models have been proposed to address the unorthodox market dynamics and frequent disruptions in the microstructure caused by the non-stationarity, and peculiar statistics, in DA markets. However, they are either prone to the curse of dimensionality, as additional complexity is required to employ traditional theories, or they overfit historical patterns that may never repeat. Instead, we leverage recent advances in market regime (MR) clustering with the Implied Stochastic Volatility Model (ISVM). Time-regime clustering is a temporal clustering method, that clusters the historic evolution of a market into different volatility periods accounting for non-stationarity. ISVM can incorporate investor expectations in each of the sentiment-driven periods by using implied volatility (IV) data. In this paper, we applied this integrated time-regime clustering and ISVM method (termed MR-ISVM) to high-frequency data on BTC options at the popular trading platform Deribit. We demonstrate that MR-ISVM contributes to overcome the burden of complex adaption to jumps in higher order characteristics of option pricing models. This allows us to price the market based on the expectations of its participants in an adaptive fashion.
△ Less
Submitted 27 September, 2022; v1 submitted 15 August, 2022;
originally announced August 2022.
-
Anatomy of a Stablecoin's failure: the Terra-Luna case
Authors:
Antonio Briola,
David Vidal-Tomás,
Yuanrong Wang,
Tomaso Aste
Abstract:
We quantitatively describe the main events that led to the Terra project's failure in May 2022. We first review, in a systematic way, news from heterogeneous social media sources; we discuss the fragility of the Terra project and its vicious dependence on the Anchor protocol. We hence identify the crash's trigger events, analysing hourly and transaction data for Bitcoin, Luna, and TerraUSD. Finall…
▽ More
We quantitatively describe the main events that led to the Terra project's failure in May 2022. We first review, in a systematic way, news from heterogeneous social media sources; we discuss the fragility of the Terra project and its vicious dependence on the Anchor protocol. We hence identify the crash's trigger events, analysing hourly and transaction data for Bitcoin, Luna, and TerraUSD. Finally, using state-of-the-art techniques from network science, we study the evolution of dependency structures for 61 highly capitalised cryptocurrencies during the down-market and we also highlight the absence of herding behaviour analysing cross-sectional absolute deviation of returns.
△ Less
Submitted 25 September, 2022; v1 submitted 28 July, 2022;
originally announced July 2022.
-
Predicting Stock Price Movement after Disclosure of Corporate Annual Reports: A Case Study of 2021 China CSI 300 Stocks
Authors:
Fengyu Han,
Yue Wang
Abstract:
In the current stock market, computer science and technology are more and more widely used to analyse stocks. Not same as most related machine learning stock price prediction work, this work study the predicting the tendency of the stock price on the second day right after the disclosure of the companies' annual reports. We use a variety of different models, including decision tree, logistic regre…
▽ More
In the current stock market, computer science and technology are more and more widely used to analyse stocks. Not same as most related machine learning stock price prediction work, this work study the predicting the tendency of the stock price on the second day right after the disclosure of the companies' annual reports. We use a variety of different models, including decision tree, logistic regression, random forest, neural network, prototypical networks. We use two sets of financial indicators (key and expanded) to conduct experiments, these financial indicators are obtained from the EastMoney website disclosed by companies, and finally we find that these models are not well behaved to predict the tendency. In addition, we also filter stocks with ROE greater than 0.15 and net cash ratio greater than 0.9. We conclude that according to the financial indicators based on the just-released annual report of the company, the predictability of the stock price movement on the second day after disclosure is weak, with maximum accuracy about 59.6% and maximum precision about 0.56 on our test set by the random forest classifier, and the stock filtering does not improve the performance. And random forests perform best in general among all these models which conforms to some work's findings.
△ Less
Submitted 21 July, 2022; v1 submitted 24 June, 2022;
originally announced June 2022.
-
Static Replication of Impermanent Loss for Concentrated Liquidity Provision in Decentralised Markets
Authors:
Jun Deng,
Hua Zong,
Yun Wang
Abstract:
This article analytically characterizes the impermanent loss of concentrated liquidity provision for automatic market makers in decentralised markets such as Uniswap. We propose two static replication formulas for the impermanent loss by a combination of European calls or puts with strike prices supported on the liquidity provision price interval. It facilitates liquidity providers to hedge perman…
▽ More
This article analytically characterizes the impermanent loss of concentrated liquidity provision for automatic market makers in decentralised markets such as Uniswap. We propose two static replication formulas for the impermanent loss by a combination of European calls or puts with strike prices supported on the liquidity provision price interval. It facilitates liquidity providers to hedge permanent loss by trading crypto options in more liquid centralised exchanges such as Deribit. Numerical examples illustrate the astonishing accuracy of the static replication.
△ Less
Submitted 2 March, 2023; v1 submitted 24 May, 2022;
originally announced May 2022.
-
Bivariate Distribution Regression with Application to Insurance Data
Authors:
Yunyun Wang,
Tatsushi Oka,
Dan Zhu
Abstract:
Understanding variable dependence, particularly eliciting their statistical properties given a set of covariates, provides the mathematical foundation in practical operations management such as risk analysis and decision-making given observed circumstances. This article presents an estimation method for modeling the conditional joint distribution of bivariate outcomes based on the distribution reg…
▽ More
Understanding variable dependence, particularly eliciting their statistical properties given a set of covariates, provides the mathematical foundation in practical operations management such as risk analysis and decision-making given observed circumstances. This article presents an estimation method for modeling the conditional joint distribution of bivariate outcomes based on the distribution regression and factorization methods. This method is considered semiparametric in that it allows for flexible modeling of both the marginal and joint distributions conditional on covariates without imposing global parametric assumptions across the entire distribution. In contrast to existing parametric approaches, our method can accommodate discrete, continuous, or mixed variables, and provides a simple yet effective way to capture distributional dependence structures between bivariate outcomes and covariates. Various simulation results confirm that our method can perform similarly or better in finite samples compared to the alternative methods. In an application to the study of a motor third-party liability insurance portfolio, the proposed method effectively estimates risk measures such as the conditional Value-at-Risk and Expected Shortfall. This result suggests that this semiparametric approach can serve as an alternative in insurance risk management.
△ Less
Submitted 3 September, 2023; v1 submitted 23 March, 2022;
originally announced March 2022.
-
Sparsification and Filtering for Spatial-temporal GNN in Multivariate Time-series
Authors:
Yuanrong Wang,
Tomaso Aste
Abstract:
We propose an end-to-end architecture for multivariate time-series prediction that integrates a spatial-temporal graph neural network with a matrix filtering module. This module generates filtered (inverse) correlation graphs from multivariate time series before inputting them into a GNN. In contrast with existing sparsification methods adopted in graph neural network, our model explicitly leverag…
▽ More
We propose an end-to-end architecture for multivariate time-series prediction that integrates a spatial-temporal graph neural network with a matrix filtering module. This module generates filtered (inverse) correlation graphs from multivariate time series before inputting them into a GNN. In contrast with existing sparsification methods adopted in graph neural network, our model explicitly leverage time-series filtering to overcome the low signal-to-noise ratio typical of complex systems data. We present a set of experiments, where we predict future sales from a synthetic time-series sales dataset. The proposed spatial-temporal graph neural network displays superior performances with respect to baseline approaches, with no graphical information, and with fully connected, disconnected graphs and unfiltered graphs.
△ Less
Submitted 8 March, 2022;
originally announced March 2022.
-
The Evolution of Blockchain: from Lit to Dark
Authors:
Agostino Capponi,
Ruizhe Jia,
Ye Wang
Abstract:
Transactions submitted through the blockchain peer-to-peer (P2P) network may leak out exploitable information. We study the economic incentives behind the adoption of blockchain dark venues, where users' transactions are observable only by miners on these venues. We show that miners may not fully adopt dark venues to preserve rents extracted from arbitrageurs, hence creating execution risk for use…
▽ More
Transactions submitted through the blockchain peer-to-peer (P2P) network may leak out exploitable information. We study the economic incentives behind the adoption of blockchain dark venues, where users' transactions are observable only by miners on these venues. We show that miners may not fully adopt dark venues to preserve rents extracted from arbitrageurs, hence creating execution risk for users. The dark venue neither eliminates frontrunning risk nor reduces transaction costs. It strictly increases the payoff of miners, weakly increases the payoff of users, and weakly reduces arbitrageurs' profits. We provide empirical support for our main implications, and show that they are economically significant. A 1% increase in the probability of being frontrun raises users' adoption rate of the dark venue by 0.6%. Arbitrageurs' cost-to-revenue ratio increases by a third with a dark venue.
△ Less
Submitted 11 February, 2022;
originally announced February 2022.
-
Smooth Nested Simulation: Bridging Cubic and Square Root Convergence Rates in High Dimensions
Authors:
Wenjia Wang,
Yanyuan Wang,
Xiaowei Zhang
Abstract:
Nested simulation concerns estimating functionals of a conditional expectation via simulation. In this paper, we propose a new method based on kernel ridge regression to exploit the smoothness of the conditional expectation as a function of the multidimensional conditioning variable. Asymptotic analysis shows that the proposed method can effectively alleviate the curse of dimensionality on the con…
▽ More
Nested simulation concerns estimating functionals of a conditional expectation via simulation. In this paper, we propose a new method based on kernel ridge regression to exploit the smoothness of the conditional expectation as a function of the multidimensional conditioning variable. Asymptotic analysis shows that the proposed method can effectively alleviate the curse of dimensionality on the convergence rate as the simulation budget increases, provided that the conditional expectation is sufficiently smooth. The smoothness bridges the gap between the cubic root convergence rate (that is, the optimal rate for the standard nested simulation) and the square root convergence rate (that is, the canonical rate for the standard Monte Carlo simulation). We demonstrate the performance of the proposed method via numerical examples from portfolio risk management and input uncertainty quantification.
△ Less
Submitted 11 October, 2023; v1 submitted 9 January, 2022;
originally announced January 2022.
-
Dynamic Portfolio Optimization with Inverse Covariance Clustering
Authors:
Yuanrong Wang,
Tomaso Aste
Abstract:
Market conditions change continuously. However, in portfolio's investment strategies, it is hard to account for this intrinsic non-stationarity. In this paper, we propose to address this issue by using the Inverse Covariance Clustering (ICC) method to identify inherent market states and then integrate such states into a dynamic portfolio optimization process. Extensive experiments across three dif…
▽ More
Market conditions change continuously. However, in portfolio's investment strategies, it is hard to account for this intrinsic non-stationarity. In this paper, we propose to address this issue by using the Inverse Covariance Clustering (ICC) method to identify inherent market states and then integrate such states into a dynamic portfolio optimization process. Extensive experiments across three different markets, NASDAQ, FTSE and HS300, over a period of ten years, demonstrate the advantages of our proposed algorithm, termed Inverse Covariance Clustering-Portfolio Optimization (ICC-PO). The core of the ICC-PO methodology concerns the identification and clustering of market states from the analytics of past data and the forecasting of the future market state. It is therefore agnostic to the specific portfolio optimization method of choice. By applying the same portfolio optimization technique on a ICC temporal cluster, instead of the whole train period, we show that one can generate portfolios with substantially higher Sharpe Ratios, which are statistically more robust and resilient with great reductions in maximum loss in extreme situations. This is shown to be consistent across markets, periods, optimization methods and selection of portfolio assets.
△ Less
Submitted 14 January, 2022; v1 submitted 31 December, 2021;
originally announced December 2021.
-
Predictable Forward Performance Processes: Infrequent Evaluation and Applications to Human-Machine Interactions
Authors:
Gechun Liang,
Moris S. Strub,
Yuwei Wang
Abstract:
We study discrete-time predictable forward processes when trading times do not coincide with performance evaluation times in a binomial tree model for the financial market. The key step in the construction of these processes is to solve a linear functional equation of higher order associated with the inverse problem driving the evolution of the predictable forward process. We provide sufficient co…
▽ More
We study discrete-time predictable forward processes when trading times do not coincide with performance evaluation times in a binomial tree model for the financial market. The key step in the construction of these processes is to solve a linear functional equation of higher order associated with the inverse problem driving the evolution of the predictable forward process. We provide sufficient conditions for the existence and uniqueness and an explicit construction of the predictable forward process under these conditions. Furthermore, we find that these processes are inherently myopic in the sense that optimal strategies do not make use of future model parameters even if these are known. Finally, we argue that predictable forward preferences are a viable framework to model human-machine interactions occuring in automated trading or robo-advising. For both applications, we determine an optimal interaction schedule of a human agent interacting infrequently with a machine that is in charge of trading.
△ Less
Submitted 2 December, 2023; v1 submitted 17 October, 2021;
originally announced October 2021.
-
Robust Risk-Aware Reinforcement Learning
Authors:
Sebastian Jaimungal,
Silvana Pesenti,
Ye Sheng Wang,
Hariom Tatsat
Abstract:
We present a reinforcement learning (RL) approach for robust optimisation of risk-aware performance criteria. To allow agents to express a wide variety of risk-reward profiles, we assess the value of a policy using rank dependent expected utility (RDEU). RDEU allows the agent to seek gains, while simultaneously protecting themselves against downside risk. To robustify optimal policies against mode…
▽ More
We present a reinforcement learning (RL) approach for robust optimisation of risk-aware performance criteria. To allow agents to express a wide variety of risk-reward profiles, we assess the value of a policy using rank dependent expected utility (RDEU). RDEU allows the agent to seek gains, while simultaneously protecting themselves against downside risk. To robustify optimal policies against model uncertainty, we assess a policy not by its distribution, but rather, by the worst possible distribution that lies within a Wasserstein ball around it. Thus, our problem formulation may be viewed as an actor/agent choosing a policy (the outer problem), and the adversary then acting to worsen the performance of that strategy (the inner problem). We develop explicit policy gradient formulae for the inner and outer problems, and show its efficacy on three prototypical financial problems: robust portfolio allocation, optimising a benchmark, and statistical arbitrage.
△ Less
Submitted 14 December, 2021; v1 submitted 23 August, 2021;
originally announced August 2021.
-
Adaptive Gradient Descent Methods for Computing Implied Volatility
Authors:
Yixiao Lu,
Yihong Wang,
Tinggan Yang
Abstract:
In this paper, a new numerical method based on adaptive gradient descent optimizers is provided for computing the implied volatility from the Black-Scholes (B-S) option pricing model. It is shown that the new method is more accurate than the close form approximation. Compared with the Newton-Raphson method, the new method obtains a reliable rate of convergence and tends to be less sensitive to the…
▽ More
In this paper, a new numerical method based on adaptive gradient descent optimizers is provided for computing the implied volatility from the Black-Scholes (B-S) option pricing model. It is shown that the new method is more accurate than the close form approximation. Compared with the Newton-Raphson method, the new method obtains a reliable rate of convergence and tends to be less sensitive to the beginning point.
△ Less
Submitted 22 March, 2023; v1 submitted 16 August, 2021;
originally announced August 2021.
-
Behavior of Liquidity Providers in Decentralized Exchanges
Authors:
Lioba Heimbach,
Ye Wang,
Roger Wattenhofer
Abstract:
Decentralized exchanges (DEXes) have introduced an innovative trading mechanism, where it is not necessary to match buy-orders and sell-orders to execute a trade. DEXes execute each trade individually, and the exchange rate is automatically determined by the ratio of assets reserved in the market. Therefore, apart from trading, financial players can also liquidity providers, benefiting from transa…
▽ More
Decentralized exchanges (DEXes) have introduced an innovative trading mechanism, where it is not necessary to match buy-orders and sell-orders to execute a trade. DEXes execute each trade individually, and the exchange rate is automatically determined by the ratio of assets reserved in the market. Therefore, apart from trading, financial players can also liquidity providers, benefiting from transaction fees from trades executed in DEXes. Although liquidity providers are essential for the functionality of DEXes, it is not clear how liquidity providers behave in such markets. In this paper, we aim to understand how liquidity providers react to market information and how they benefit from providing liquidity in DEXes. We measure the operations of liquidity providers on Uniswap and analyze how they determine their investment strategy based on market changes. We also reveal their returns and risks of investments in different trading pair categories, i.e., stable pairs, normal pairs, and exotic pairs. Further, we investigate the movement of liquidity between trading pools. To the best of our knowledge, this is the first work that systematically studies the behavior of liquidity providers in DEXes.
△ Less
Submitted 11 October, 2021; v1 submitted 28 May, 2021;
originally announced May 2021.
-
Cyclic Arbitrage in Decentralized Exchanges
Authors:
Ye Wang,
Yan Chen,
Haotian Wu,
Liyi Zhou,
Shuiguang Deng,
Roger Wattenhofer
Abstract:
Decentralized Exchanges (DEXes) enable users to create markets for exchanging any pair of cryptocurrencies. The direct exchange rate of two tokens may not match the cross-exchange rate in the market, and such price discrepancies open up arbitrage possibilities with trading through different cryptocurrencies cyclically. In this paper, we conduct a systematic investigation on cyclic arbitrages in DE…
▽ More
Decentralized Exchanges (DEXes) enable users to create markets for exchanging any pair of cryptocurrencies. The direct exchange rate of two tokens may not match the cross-exchange rate in the market, and such price discrepancies open up arbitrage possibilities with trading through different cryptocurrencies cyclically. In this paper, we conduct a systematic investigation on cyclic arbitrages in DEXes. We propose a theoretical framework for studying cyclic arbitrage. With our framework, we analyze the profitability conditions and optimal trading strategies of cyclic transactions. We further examine exploitable arbitrage opportunities and the market size of cyclic arbitrages with transaction-level data of Uniswap V2. We find that traders have executed 292,606 cyclic arbitrages over eleven months and exploited more than 138 million USD in revenue. However, the revenue of the most profitable unexploited opportunity is persistently higher than 1 ETH (4,000 USD), which indicates that DEX markets may not be efficient enough. By analyzing how traders implement cyclic arbitrages, we find that traders can utilize smart contracts to issue atomic transactions and the atomic implementations could mitigate users' financial loss in cyclic arbitrage from the price impact.
△ Less
Submitted 14 January, 2022; v1 submitted 21 April, 2021;
originally announced May 2021.
-
A Black-Scholes user's guide to the Bachelier model
Authors:
Jaehyuk Choi,
Minsuk Kwak,
Chyng Wen Tee,
Yumeng Wang
Abstract:
To cope with the negative oil futures price caused by the COVID-19 recession, global commodity futures exchanges temporarily switched the option model from Black--Scholes to Bachelier in 2020. This study reviews the literature on Bachelier's pioneering option pricing model and summarizes the practical results on volatility conversion, risk management, stochastic volatility, and barrier options pri…
▽ More
To cope with the negative oil futures price caused by the COVID-19 recession, global commodity futures exchanges temporarily switched the option model from Black--Scholes to Bachelier in 2020. This study reviews the literature on Bachelier's pioneering option pricing model and summarizes the practical results on volatility conversion, risk management, stochastic volatility, and barrier options pricing to facilitate the model transition. In particular, using the displaced Black-Scholes model as a model family with the Black-Scholes and Bachelier models as special cases, we not only connect the two models but also present a continuous spectrum of model choices.
△ Less
Submitted 6 February, 2022; v1 submitted 17 April, 2021;
originally announced April 2021.
-
Overnight GARCH-Itô Volatility Models
Authors:
Donggyu Kim,
Minseok Shin,
Yazhen Wang
Abstract:
Various parametric volatility models for financial data have been developed to incorporate high-frequency realized volatilities and better capture market dynamics. However, because high-frequency trading data are not available during the close-to-open period, the volatility models often ignore volatility information over the close-to-open period and thus may suffer from loss of important informati…
▽ More
Various parametric volatility models for financial data have been developed to incorporate high-frequency realized volatilities and better capture market dynamics. However, because high-frequency trading data are not available during the close-to-open period, the volatility models often ignore volatility information over the close-to-open period and thus may suffer from loss of important information relevant to market dynamics. In this paper, to account for whole-day market dynamics, we propose an overnight volatility model based on Itô diffusions to accommodate two different instantaneous volatility processes for the open-to-close and close-to-open periods. We develop a weighted least squares method to estimate model parameters for two different periods and investigate its asymptotic properties. We conduct a simulation study to check the finite sample performance of the proposed model and method. Finally, we apply the proposed approaches to real trading data.
△ Less
Submitted 17 June, 2022; v1 submitted 24 February, 2021;
originally announced February 2021.
-
Insights into Fairness through Trust: Multi-scale Trust Quantification for Financial Deep Learning
Authors:
Alexander Wong,
Andrew Hryniowski,
Xiao Yu Wang
Abstract:
The success of deep learning in recent years have led to a significant increase in interest and prevalence for its adoption to tackle financial services tasks. One particular question that often arises as a barrier to adopting deep learning for financial services is whether the developed financial deep learning models are fair in their predictions, particularly in light of strong governance and re…
▽ More
The success of deep learning in recent years have led to a significant increase in interest and prevalence for its adoption to tackle financial services tasks. One particular question that often arises as a barrier to adopting deep learning for financial services is whether the developed financial deep learning models are fair in their predictions, particularly in light of strong governance and regulatory compliance requirements in the financial services industry. A fundamental aspect of fairness that has not been explored in financial deep learning is the concept of trust, whose variations may point to an egocentric view of fairness and thus provide insights into the fairness of models. In this study we explore the feasibility and utility of a multi-scale trust quantification strategy to gain insights into the fairness of a financial deep learning model, particularly under different scenarios at different scales. More specifically, we conduct multi-scale trust quantification on a deep neural network for the purpose of credit card default prediction to study: 1) the overall trustworthiness of the model 2) the trust level under all possible prediction-truth relationships, 3) the trust level across the spectrum of possible predictions, 4) the trust level across different demographic groups (e.g., age, gender, and education), and 5) distribution of overall trust for an individual prediction scenario. The insights for this proof-of-concept study demonstrate that such a multi-scale trust quantification strategy may be helpful for data scientists and regulators in financial services as part of the verification and certification of financial deep learning solutions to gain insights into fairness and trust of these solutions.
△ Less
Submitted 3 November, 2020;
originally announced November 2020.
-
Stock2Vec: A Hybrid Deep Learning Framework for Stock Market Prediction with Representation Learning and Temporal Convolutional Network
Authors:
Xing Wang,
Yijun Wang,
Bin Weng,
Aleksandr Vinel
Abstract:
We have proposed to develop a global hybrid deep learning framework to predict the daily prices in the stock market. With representation learning, we derived an embedding called Stock2Vec, which gives us insight for the relationship among different stocks, while the temporal convolutional layers are used for automatically capturing effective temporal patterns both within and across series. Evaluat…
▽ More
We have proposed to develop a global hybrid deep learning framework to predict the daily prices in the stock market. With representation learning, we derived an embedding called Stock2Vec, which gives us insight for the relationship among different stocks, while the temporal convolutional layers are used for automatically capturing effective temporal patterns both within and across series. Evaluated on S&P 500, our hybrid framework integrates both advantages and achieves better performance on the stock price prediction task than several popular benchmarked models.
△ Less
Submitted 29 September, 2020;
originally announced October 2020.
-
Improving Investment Suggestions for Peer-to-Peer (P2P) Lending via Integrating Credit Scoring into Profit Scoring
Authors:
Yan Wang,
Xuelei Sherry Ni
Abstract:
In the peer-to-peer (P2P) lending market, lenders lend the money to the borrowers through a virtual platform and earn the possible profit generated by the interest rate. From the perspective of lenders, they want to maximize the profit while minimizing the risk. Therefore, many studies have used machine learning algorithms to help the lenders identify the "best" loans for making investments. The s…
▽ More
In the peer-to-peer (P2P) lending market, lenders lend the money to the borrowers through a virtual platform and earn the possible profit generated by the interest rate. From the perspective of lenders, they want to maximize the profit while minimizing the risk. Therefore, many studies have used machine learning algorithms to help the lenders identify the "best" loans for making investments. The studies have mainly focused on two categories to guide the lenders' investments: one aims at minimizing the risk of investment (i.e., the credit scoring perspective) while the other aims at maximizing the profit (i.e., the profit scoring perspective). However, they have all focused on one category only and there is seldom research trying to integrate the two categories together. Motivated by this, we propose a two-stage framework that incorporates the credit information into a profit scoring modeling. We conducted the empirical experiment on a real-world P2P lending data from the US P2P market and used the Light Gradient Boosting Machine (lightGBM) algorithm in the two-stage framework. Results show that the proposed two-stage method could identify more profitable loans and thereby provide better investment guidance to the investors compared to the existing one-stage profit scoring alone approach. Therefore, the proposed framework serves as an innovative perspective for making investment decisions in P2P lending.
△ Less
Submitted 9 September, 2020;
originally announced September 2020.
-
Automated Market Makers for Decentralized Finance (DeFi)
Authors:
Yongge Wang
Abstract:
This paper compares mathematical models for automated market makers including logarithmic market scoring rule (LMSR), liquidity sensitive LMSR (LS-LMSR), constant product/mean/sum, and others. It is shown that though LMSR may not be a good model for Decentralized Finance (DeFi) applications, LS-LMSR has several advantages over constant product/mean based automated market makers. However, LS-LMSR r…
▽ More
This paper compares mathematical models for automated market makers including logarithmic market scoring rule (LMSR), liquidity sensitive LMSR (LS-LMSR), constant product/mean/sum, and others. It is shown that though LMSR may not be a good model for Decentralized Finance (DeFi) applications, LS-LMSR has several advantages over constant product/mean based automated market makers. However, LS-LMSR requires complicated computation (i.e., logarithm and exponentiation) and the cost function curve is concave. In certain DeFi applications, it is preferred to have computationally efficient cost functions with convex curves to conform with the principle of supply and demand. This paper proposes and analyzes constant circle/ellipse based cost functions for automated market makers. The proposed cost functions are computationally efficient (only requires multiplication and square root calculation) and have several advantages over widely deployed constant product cost functions. For example, the proposed market makers are more robust against front-runner (slippage) attacks.
△ Less
Submitted 18 May, 2024; v1 submitted 3 September, 2020;
originally announced September 2020.