Search | arXiv e-print repository

Quantum Stochastic Walks for Portfolio Optimization: Theory and Implementation on Financial Networks

Authors: Yen Jui Chang, Wei-Ting Wang, Yun-Yuan Wang, Chen-Yu Liu, Kuan-Cheng Chen, Ching-Ray Chang

Abstract: Financial markets are noisy yet contain a latent graph-theoretic structure that can be exploited for superior risk-adjusted returns. We propose a quantum stochastic walk (QSW) optimizer that embeds assets in a weighted graph: nodes represent securities while edges encode the return-covariance kernel. Portfolio weights are derived from the walk's stationary distribution. Three empirical studies sup… ▽ More Financial markets are noisy yet contain a latent graph-theoretic structure that can be exploited for superior risk-adjusted returns. We propose a quantum stochastic walk (QSW) optimizer that embeds assets in a weighted graph: nodes represent securities while edges encode the return-covariance kernel. Portfolio weights are derived from the walk's stationary distribution. Three empirical studies support the approach. (i) For the top 100 S\&P 500 constituents over 2016-2024, six scenario portfolios calibrated on 1- and 2-year windows lift the out-of-sample Sharpe ratio by up to 27\% while cutting annual turnover from 480\% (mean-variance) to 2-90%. (ii) A $5^{4}=625$-point grid search identifies a robust sweet spot, $α,λ\lesssim0.5$ and $ω\in[0.2,0.4]$, that delivers Sharpe $\approx0.97$ at $\le 5\%$ turnover and Herfindahl-Hirschman index $\sim0.01$. (iii) Repeating the full grid on 50 random 100-stock subsets of the S\&P 500 adds 31\,350 back-tests: the best-per-draw QSW beats re-optimised mean-variance on Sharpe in 54\% of cases and always wins on trading efficiency, with median turnover 36\% versus 351\%. Overall, QSW raises the annualized Sharpe ratio by 15\% and cuts turnover by 90\% relative to classical optimisation, all while respecting the UCITS 5/10/40 rule. These results show that hybrid quantum-classical dynamics can uncover non-linear dependencies overlooked by quadratic models and offer a practical, low-cost weighting engine for themed ETFs and other systematic mandates. △ Less

Submitted 5 July, 2025; originally announced July 2025.

Comments: 56 pages. 25 Figures

arXiv:2505.05595 [pdf, ps, other]

Trading Under Uncertainty: A Distribution-Based Strategy for Futures Markets Using FutureQuant Transformer

Authors: Wenhao Guo, Yuda Wang, Zeqiao Huang, Changjiang Zhang, Shumin ma

Abstract: In the complex landscape of traditional futures trading, where vast data and variables like real-time Limit Order Books (LOB) complicate price predictions, we introduce the FutureQuant Transformer model, leveraging attention mechanisms to navigate these challenges. Unlike conventional models focused on point predictions, the FutureQuant model excels in forecasting the range and volatility of futur… ▽ More In the complex landscape of traditional futures trading, where vast data and variables like real-time Limit Order Books (LOB) complicate price predictions, we introduce the FutureQuant Transformer model, leveraging attention mechanisms to navigate these challenges. Unlike conventional models focused on point predictions, the FutureQuant model excels in forecasting the range and volatility of future prices, thus offering richer insights for trading strategies. Its ability to parse and learn from intricate market patterns allows for enhanced decision-making, significantly improving risk management and achieving a notable average gain of 0.1193% per 30-minute trade over state-of-the-art models with a simple algorithm using factors such as RSI, ATR, and Bollinger Bands. This innovation marks a substantial leap forward in predictive analytics within the volatile domain of futures trading. △ Less

Submitted 8 May, 2025; originally announced May 2025.

Comments: 16 pages, 12 figures

arXiv:2504.13532 [pdf, other]

Quantum Walks-Based Adaptive Distribution Generation with Efficient CUDA-Q Acceleration

Authors: Yen-Jui Chang, Wei-Ting Wang, Chen-Yu Liu, Yun-Yuan Wang, Ching-Ray Chang

Abstract: We present a novel Adaptive Distribution Generator that leverages a quantum walks-based approach to generate high precision and efficiency of target probability distributions. Our method integrates variational quantum circuits with discrete-time quantum walks, specifically, split-step quantum walks and their entangled extensions, to dynamically tune coin parameters and drive the evolution of quant… ▽ More We present a novel Adaptive Distribution Generator that leverages a quantum walks-based approach to generate high precision and efficiency of target probability distributions. Our method integrates variational quantum circuits with discrete-time quantum walks, specifically, split-step quantum walks and their entangled extensions, to dynamically tune coin parameters and drive the evolution of quantum states towards desired distributions. This enables accurate one-dimensional probability modeling for applications such as financial simulation and structured two-dimensional pattern generation exemplified by digit representations(0~9). Implemented within the CUDA-Q framework, our approach exploits GPU acceleration to significantly reduce computational overhead and improve scalability relative to conventional methods. Extensive benchmarks demonstrate that our Quantum Walks-Based Adaptive Distribution Generator achieves high simulation fidelity and bridges the gap between theoretical quantum algorithms and practical high-performance computation. △ Less

Submitted 18 April, 2025; originally announced April 2025.

Comments: 17 pages, 5 figures

arXiv:2504.04113 [pdf, ps, other]

A note on time-inconsistent stochastic control problems with higher-order moments

Authors: Yike Wang

Abstract: In this paper, we extend the research on time-consistent stochastic control problems with higher-order moments, as formulated by [Y. Wang et al. SIAM J. Control. Optim., 63 (2025), in press]. We consider a linear controlled dynamic equation with state-dependent diffusion, and let the sum of a conventional mean-variance utility and a fairly general function of higher-order central moments be the ob… ▽ More In this paper, we extend the research on time-consistent stochastic control problems with higher-order moments, as formulated by [Y. Wang et al. SIAM J. Control. Optim., 63 (2025), in press]. We consider a linear controlled dynamic equation with state-dependent diffusion, and let the sum of a conventional mean-variance utility and a fairly general function of higher-order central moments be the objective functional. We obtain both the sufficiency and necessity of the equilibrium condition for an open-loop Nash equilibrium control (ONEC), under some continuity and integrability assumptions that are more relaxed and natural than those employed before. Notably, we derive an extended version of the stochastic Lebesgue differentiation theorem for necessity, because the equilibrium condition is represented by some diagonal processes generated by a flow of backward stochastic differential equations whose the data do not necessarily satisfy the usual square-integrability. Based on the derived equilibrium condition, we obtain the algebra equation for a deterministic ONEC. In particular, we find that the mean-variance equilibrium strategy is an ONEC for our higher-order moment problem if and only if the objective functional satisfies a homogeneity condition. △ Less

Submitted 5 April, 2025; originally announced April 2025.

Comments: 20 pages

MSC Class: Primary: 93E20; 91G80; Secondary: 91B08; 49N90

arXiv:2503.18259 [pdf, ps, other]

Rough Heston model as the scaling limit of bivariate cumulative heavy-tailed INAR($\infty$) processes and applications

Authors: Yingli Wang, Zhenyu Cui

Abstract: This paper establishes a novel link between nearly unstable cumulative heavy-tailed integer-valued autoregressive (INAR($\infty$)) processes and the rough Heston model via discrete scaling limits. We prove that a sequence of bivariate cumulative INAR($\infty$) processes converge in law to the rough Heston model under appropriate scaling conditions, providing a rigorous mathematical foundation for… ▽ More This paper establishes a novel link between nearly unstable cumulative heavy-tailed integer-valued autoregressive (INAR($\infty$)) processes and the rough Heston model via discrete scaling limits. We prove that a sequence of bivariate cumulative INAR($\infty$) processes converge in law to the rough Heston model under appropriate scaling conditions, providing a rigorous mathematical foundation for understanding how microstructural order flow drives macroscopic prices following rough volatility dynamics. Our theoretical framework extends the scaling limit techniques from Hawkes processes to the INAR($\infty$) setting. Hence we can carry out efficient Monte Carlo simulation of the rough Heston model through simulating the corresponding approximating INAR($\infty$) processes, which provides an alternative discrete-time simulation method to the Euler-Maruyama method. Extensive numerical experiments illustrate the improved accuracy and efficiency of the proposed simulation scheme as compared to the literature, in the valuation of European options, and also path-dependent options such as arithmetic Asian options, lookback options and barrier options. △ Less

Submitted 9 April, 2025; v1 submitted 23 March, 2025; originally announced March 2025.

MSC Class: 60G22; 60H35; 91G20; 62M10; 60F17

arXiv:2503.06929 [pdf, other]

Assessing Uncertainty in Stock Returns: A Gaussian Mixture Distribution-Based Method

Authors: Yanlong Wang, Jian Xu, Shao-Lun Huang, Danny Dongning Sun, Xiao-Ping Zhang

Abstract: This study seeks to advance the understanding and prediction of stock market return uncertainty through the application of advanced deep learning techniques. We introduce a novel deep learning model that utilizes a Gaussian mixture distribution to capture the complex, time-varying nature of asset return distributions in the Chinese stock market. By incorporating the Gaussian mixture distribution,… ▽ More This study seeks to advance the understanding and prediction of stock market return uncertainty through the application of advanced deep learning techniques. We introduce a novel deep learning model that utilizes a Gaussian mixture distribution to capture the complex, time-varying nature of asset return distributions in the Chinese stock market. By incorporating the Gaussian mixture distribution, our approach effectively characterizes short-term fluctuations and non-traditional features of stock returns, such as skewness and heavy tails, that are often overlooked by traditional models. Compared to GARCH models and their variants, our method demonstrates superior performance in volatility estimation, particularly during periods of heightened market volatility. It provides more accurate volatility forecasts and offers unique risk insights for different assets, thereby deepening the understanding of return uncertainty. Additionally, we propose a novel use of Code embedding which utilizes a bag-of-words approach to train hidden representations of stock codes and transforms the uncertainty attributes of stocks into high-dimensional vectors. These vectors are subsequently reduced to two dimensions, allowing the observation of similarity among different stocks. This visualization facilitates the identification of asset clusters with similar risk profiles, offering valuable insights for portfolio management and risk mitigation. Since we predict the uncertainty of returns by estimating their latent distribution, it is challenging to evaluate the return distribution when the true distribution is unobservable. However, we can measure it through the CRPS to assess how well the predicted distribution matches the true returns, and through MSE and QLIKE metrics to evaluate the error between the volatility level of the predicted distribution and proxy measures of true volatility. △ Less

Submitted 10 March, 2025; originally announced March 2025.

Comments: 23 pages

arXiv:2503.06928 [pdf, ps, other]

FinTSBridge: A New Evaluation Suite for Real-world Financial Prediction with Advanced Time Series Models

Authors: Yanlong Wang, Jian Xu, Tiantian Gao, Hongkang Zhang, Shao-Lun Huang, Danny Dongning Sun, Xiao-Ping Zhang

Abstract: Despite the growing attention to time series forecasting in recent years, many studies have proposed various solutions to address the challenges encountered in time series prediction, aiming to improve forecasting performance. However, effectively applying these time series forecasting models to the field of financial asset pricing remains a challenging issue. There is still a need for a bridge to… ▽ More Despite the growing attention to time series forecasting in recent years, many studies have proposed various solutions to address the challenges encountered in time series prediction, aiming to improve forecasting performance. However, effectively applying these time series forecasting models to the field of financial asset pricing remains a challenging issue. There is still a need for a bridge to connect cutting-edge time series forecasting models with financial asset pricing. To bridge this gap, we have undertaken the following efforts: 1) We constructed three datasets from the financial domain; 2) We selected over ten time series forecasting models from recent studies and validated their performance in financial time series; 3) We developed new metrics, msIC and msIR, in addition to MSE and MAE, to showcase the time series correlation captured by the models; 4) We designed financial-specific tasks for these three datasets and assessed the practical performance and application potential of these forecasting models in important financial problems. We hope the developed new evaluation suite, FinTSBridge, can provide valuable insights into the effectiveness and robustness of advanced forecasting models in finanical domains. △ Less

Submitted 11 June, 2025; v1 submitted 10 March, 2025; originally announced March 2025.

Comments: ICLR 2025 Workshop Advances in Financial AI

arXiv:2502.11052 [pdf, other]

Time-consistent portfolio selection with strictly monotone mean-variance preference

Authors: Yike Wang, Yusha Chen

Abstract: This paper is devoted to time-consistent control problems of portfolio selection with strictly monotone mean-variance preferences. These preferences are variational modifications of the conventional mean-variance preferences, and remain time-inconsistent as in mean-variance optimization problems. To tackle the time-inconsistency, we study the Nash equilibrium controls of both the open-loop type an… ▽ More This paper is devoted to time-consistent control problems of portfolio selection with strictly monotone mean-variance preferences. These preferences are variational modifications of the conventional mean-variance preferences, and remain time-inconsistent as in mean-variance optimization problems. To tackle the time-inconsistency, we study the Nash equilibrium controls of both the open-loop type and the closed-loop type, and characterize them within a random parameter setting. The problem is reduced to solving a flow of forward-backward stochastic differential equations for open-loop equilibria, and to solving extended Hamilton-Jacobi-Bellman equations for closed-loop equilibria. In particular, we derive semi-closed-form solutions for these two types of equilibria under a deterministic parameter setting. Both solutions are represented by the same function, which is independent of wealth state and random path. This function can be expressed as the conventional time-consistent mean-variance portfolio strategy multiplied by a factor greater than one. Furthermore, we find that the state-independent closed-loop Nash equilibrium control is a strong equilibrium strategy in a constant parameter setting only when the interest rate is sufficiently large. △ Less

Submitted 16 February, 2025; originally announced February 2025.

Comments: 25 pages, 2 figures

MSC Class: Primary: 91G10; 49N10; Secondary: 91B05; 49N90

arXiv:2412.19817 [pdf]

Digital transformation: A systematic review and bibliometric analysis from the corporate finance perspective

Authors: Ping Zhang, Yiru Wang

Abstract: Digital transformation significantly impacts firm investment, financing, and value enhancement. A systematic investigation from the corporate finance perspective has not yet been formed. This paper combines bibliometric and content analysis methods to systematically review the evolutionary trend, status quo, hotspots and overall structure of research in digital transformation from 2011 to 2024. Th… ▽ More Digital transformation significantly impacts firm investment, financing, and value enhancement. A systematic investigation from the corporate finance perspective has not yet been formed. This paper combines bibliometric and content analysis methods to systematically review the evolutionary trend, status quo, hotspots and overall structure of research in digital transformation from 2011 to 2024. The study reveals an emerging and rapidly growing focus on digital transformation research, particularly in developed countries. We categorize the literature into three areas according to bibliometric clustering: the measurements (qualitative and quantitative), impact factors (internal and external), and the economic consequences (investment, financing, and firm value). These areas are divided into ten sub-branches, with a detailed literature review. We also review the existing theories related to digital transformation, identify the current gaps in these papers, and provide directions for future research on each sub-branches. △ Less

Submitted 12 December, 2024; originally announced December 2024.

arXiv:2412.18222 [pdf]

Leveraging Convolutional Neural Network-Transformer Synergy for Predictive Modeling in Risk-Based Applications

Authors: Yuhan Wang, Zhen Xu, Yue Yao, Jinsong Liu, Jiating Lin

Abstract: With the development of the financial industry, credit default prediction, as an important task in financial risk management, has received increasing attention. Traditional credit default prediction methods mostly rely on machine learning models, such as decision trees and random forests, but these methods have certain limitations in processing complex data and capturing potential risk patterns. T… ▽ More With the development of the financial industry, credit default prediction, as an important task in financial risk management, has received increasing attention. Traditional credit default prediction methods mostly rely on machine learning models, such as decision trees and random forests, but these methods have certain limitations in processing complex data and capturing potential risk patterns. To this end, this paper proposes a deep learning model based on the combination of convolutional neural networks (CNN) and Transformer for credit user default prediction. The model combines the advantages of CNN in local feature extraction with the ability of Transformer in global dependency modeling, effectively improving the accuracy and robustness of credit default prediction. Through experiments on public credit default datasets, the results show that the CNN+Transformer model outperforms traditional machine learning models, such as random forests and XGBoost, in multiple evaluation indicators such as accuracy, AUC, and KS value, demonstrating its powerful ability in complex financial data modeling. Further experimental analysis shows that appropriate optimizer selection and learning rate adjustment play a vital role in improving model performance. In addition, the ablation experiment of the model verifies the advantages of the combination of CNN and Transformer and proves the complementarity of the two in credit default prediction. This study provides a new idea for credit default prediction and provides strong support for risk assessment and intelligent decision-making in the financial field. Future research can further improve the prediction effect and generalization ability by introducing more unstructured data and improving the model architecture. △ Less

Submitted 24 December, 2024; originally announced December 2024.

arXiv:2412.13523 [pdf, other]

Strictly monotone mean-variance preferences with applications to portfolio selection

Authors: Yike Wang, Yusha Chen

Abstract: This paper extends the monotone mean-variance (MMV) preference to a broader class of strictly monotone mean-variance (SMMV) preferences, and demonstrates its applications to portfolio selection problems. For the single-period portfolio problem under the SMMV preference, we derive the gradient condition for the optimal strategy, and investigate its association with the optimal mean-variance (MV) st… ▽ More This paper extends the monotone mean-variance (MMV) preference to a broader class of strictly monotone mean-variance (SMMV) preferences, and demonstrates its applications to portfolio selection problems. For the single-period portfolio problem under the SMMV preference, we derive the gradient condition for the optimal strategy, and investigate its association with the optimal mean-variance (MV) static strategy. A novel contribution of this work is the reduction of the problem to solving a set of linear equations by analyzing the saddle point of some minimax problem. Building on this advancement, we conduct numerical experiments and compare our results with those of Maccheroni, et al. (Math. Finance 19(3): 487-521, 2009). The findings indicate that our SMMV preferences provide a more rational basis for assessing given prospects. For the continuous-time portfolio problem with the SMMV preference, we consider continuous price processes with random coefficients. We establish the condition under which the optimal dynamic strategies for SMMV and MV preferences coincide, and characterize the optimal solution using the dynamic programming principle and the martingale convex duality method, respectively. Consequently, the problem is reduced to solving a stochastic Hamilton-Jacobi-Bellman-Isaacs equation, or a multi-stage linear-quadratic optimization problem with the embedding technique. △ Less

Submitted 27 May, 2025; v1 submitted 18 December, 2024; originally announced December 2024.

Comments: 47 pages

MSC Class: Primary: 91G10; 49N10; Secondary: 91B05; 49N90

arXiv:2412.13521 [pdf, ps, other]

doi 10.1137/23M1621058

On stochastic control problems with higher-order moments

Authors: Yike Wang, Jingzhen Liu, Alain Bensoussan, Ka-Fai Cedric Yiu, Jiaqin Wei

Abstract: In this paper, we focus on a class of time-inconsistent stochastic control problems, where the objective function includes the mean and several higher-order central moments of the terminal value of state. To tackle the time-inconsistency, we seek both the closed-loop and the open-loop Nash equilibrium controls as time-consistent solutions. We establish a partial differential equation (PDE) system… ▽ More In this paper, we focus on a class of time-inconsistent stochastic control problems, where the objective function includes the mean and several higher-order central moments of the terminal value of state. To tackle the time-inconsistency, we seek both the closed-loop and the open-loop Nash equilibrium controls as time-consistent solutions. We establish a partial differential equation (PDE) system for deriving a closed-loop Nash equilibrium control, which does not include the equilibrium value function and is different from the extended Hamilton-Jacobi-Bellman (HJB) equations as in Björk et al. (Finance Stoch. 21: 331-360, 2017). We show that our PDE system is equivalent to the extended HJB equations that seems difficult to be solved for our higher-order moment problems. In deriving an open-loop Nash equilibrium control, due to the non-separable higher-order moments in the objective function, we make some moment estimates in addition to the standard perturbation argument for developing a maximum principle. Then, the problem is reduced to solving a flow of forward-backward stochastic differential equations. In particular, we investigate linear controlled dynamics and some objective functions affine in the mean. The closed-loop and the open-loop Nash equilibrium controls are identical, which are independent of the state value, random path and the preference on the odd-order central moments. By sending the highest order of central moments to infinity, we obtain the time-consistent solutions to some control problems whose objective functions include some penalty functions for deviation. △ Less

Submitted 30 January, 2025; v1 submitted 18 December, 2024; originally announced December 2024.

Comments: 31 pages

MSC Class: Primary: 93E20; 91G80; Secondary: 91B08; 49N90

arXiv:2412.01062 [pdf]

Research on Optimizing Real-Time Data Processing in High-Frequency Trading Algorithms using Machine Learning

Authors: Yuxin Fan, Zhuohuan Hu, Lei Fu, Yu Cheng, Liyang Wang, Yuxiang Wang

Abstract: High-frequency trading (HFT) represents a pivotal and intensely competitive domain within the financial markets. The velocity and accuracy of data processing exert a direct influence on profitability, underscoring the significance of this field. The objective of this work is to optimise the real-time processing of data in high-frequency trading algorithms. The dynamic feature selection mechanism i… ▽ More High-frequency trading (HFT) represents a pivotal and intensely competitive domain within the financial markets. The velocity and accuracy of data processing exert a direct influence on profitability, underscoring the significance of this field. The objective of this work is to optimise the real-time processing of data in high-frequency trading algorithms. The dynamic feature selection mechanism is responsible for monitoring and analysing market data in real time through clustering and feature weight analysis, with the objective of automatically selecting the most relevant features. This process employs an adaptive feature extraction method, which enables the system to respond and adjust its feature set in a timely manner when the data input changes, thus ensuring the efficient utilisation of data. The lightweight neural networks are designed in a modular fashion, comprising fast convolutional layers and pruning techniques that facilitate the expeditious completion of data processing and output prediction. In contrast to conventional deep learning models, the neural network architecture has been specifically designed to minimise the number of parameters and computational complexity, thereby markedly reducing the inference time. The experimental results demonstrate that the model is capable of maintaining consistent performance in the context of varying market conditions, thereby illustrating its advantages in terms of processing speed and revenue enhancement. △ Less

Submitted 1 December, 2024; originally announced December 2024.

arXiv:2410.12825 [pdf, other]

TIMeSynC: Temporal Intent Modelling with Synchronized Context Encodings for Financial Service Applications

Authors: Dwipam Katariya, Juan Manuel Origgi, Yage Wang, Thomas Caputo

Abstract: Users engage with financial services companies through multiple channels, often interacting with mobile applications, web platforms, call centers, and physical locations to service their accounts. The resulting interactions are recorded at heterogeneous temporal resolutions across these domains. This multi-channel data can be combined and encoded to create a comprehensive representation of the cus… ▽ More Users engage with financial services companies through multiple channels, often interacting with mobile applications, web platforms, call centers, and physical locations to service their accounts. The resulting interactions are recorded at heterogeneous temporal resolutions across these domains. This multi-channel data can be combined and encoded to create a comprehensive representation of the customer's journey for accurate intent prediction. This demands sequential learning solutions. NMT transformers achieve state-of-the-art sequential representation learning by encoding context and decoding for the next best action to represent long-range dependencies. However, three major challenges exist while combining multi-domain sequences within an encoder-decoder transformers architecture for intent prediction applications: a) aligning sequences with different sampling rates b) learning temporal dynamics across multi-variate, multi-domain sequences c) combining dynamic and static sequences. We propose an encoder-decoder transformer model to address these challenges for contextual and sequential intent prediction in financial servicing applications. Our experiments show significant improvement over the existing tabular method. △ Less

Submitted 3 February, 2025; v1 submitted 1 October, 2024; originally announced October 2024.

Comments: 6 pages, Accepted at RecTemp @ RecSys 2024

arXiv:2410.00419 [pdf, other]

KANOP: A Data-Efficient Option Pricing Model using Kolmogorov-Arnold Networks

Authors: Rushikesh Handal, Kazuki Matoya, Yunzhuo Wang, Masanori Hirano

Abstract: Inspired by the recently proposed Kolmogorov-Arnold Networks (KANs), we introduce the KAN-based Option Pricing (KANOP) model to value American-style options, building on the conventional Least Square Monte Carlo (LSMC) algorithm. KANs, which are based on Kolmogorov-Arnold representation theorem, offer a data-efficient alternative to traditional Multi-Layer Perceptrons, requiring fewer hidden layer… ▽ More Inspired by the recently proposed Kolmogorov-Arnold Networks (KANs), we introduce the KAN-based Option Pricing (KANOP) model to value American-style options, building on the conventional Least Square Monte Carlo (LSMC) algorithm. KANs, which are based on Kolmogorov-Arnold representation theorem, offer a data-efficient alternative to traditional Multi-Layer Perceptrons, requiring fewer hidden layers to achieve a higher level of performance. By leveraging the flexibility of KANs, KANOP provides a learnable alternative to the conventional set of basis functions used in the LSMC model, allowing the model to adapt to the pricing task and effectively estimate the expected continuation value. Using examples of standard American and Asian-American options, we demonstrate that KANOP produces more reliable option value estimates, both for single-dimensional cases and in more complex scenarios involving multiple input variables. The delta estimated by the KANOP model is also more accurate than that obtained using conventional basis functions, which is crucial for effective option hedging. Graphical illustrations further validate KANOP's ability to accurately model the expected continuation value for American-style options. △ Less

Submitted 1 October, 2024; originally announced October 2024.

arXiv:2409.07494 [pdf, other]

Ethereum Fraud Detection via Joint Transaction Language Model and Graph Representation Learning

Authors: Jianguo Sun, Yifan Jia, Yanbin Wang, Yiwei Liu, Zhang Sheng, Ye Tian

Abstract: Ethereum faces growing fraud threats. Current fraud detection methods, whether employing graph neural networks or sequence models, fail to consider the semantic information and similarity patterns within transactions. Moreover, these approaches do not leverage the potential synergistic benefits of combining both types of models. To address these challenges, we propose TLMG4Eth that combines a tran… ▽ More Ethereum faces growing fraud threats. Current fraud detection methods, whether employing graph neural networks or sequence models, fail to consider the semantic information and similarity patterns within transactions. Moreover, these approaches do not leverage the potential synergistic benefits of combining both types of models. To address these challenges, we propose TLMG4Eth that combines a transaction language model with graph-based methods to capture semantic, similarity, and structural features of transaction data in Ethereum. We first propose a transaction language model that converts numerical transaction data into meaningful transaction sentences, enabling the model to learn explicit transaction semantics. Then, we propose a transaction attribute similarity graph to learn transaction similarity information, enabling us to capture intuitive insights into transaction anomalies. Additionally, we construct an account interaction graph to capture the structural information of the account transaction network. We employ a deep multi-head attention network to fuse transaction semantic and similarity embeddings, and ultimately propose a joint training approach for the multi-head attention network and the account interaction graph to obtain the synergistic benefits of both. △ Less

Submitted 18 February, 2025; v1 submitted 9 September, 2024; originally announced September 2024.

arXiv:2408.11878 [pdf, ps, other]

Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Authors: Jimin Huang, Mengxi Xiao, Dong Li, Zihao Jiang, Yuzhe Yang, Yifei Zhang, Lingfei Qian, Yan Wang, Xueqing Peng, Yang Ren, Ruoyu Xiang, Zhengyu Chen, Xiao Zhang, Yueru He, Weiguang Han, Shunian Chen, Lihang Shen, Daniel Kim, Yangyang Yu, Yupeng Cao, Zhiyang Deng, Haohang Li, Duanyu Feng, Yongfu Dai, VijayaSai Somasundaram , et al. (19 additional authors not shown)

Abstract: Financial LLMs hold promise for advancing financial tasks and domain-specific applications. However, they are limited by scarce corpora, weak multimodal capabilities, and narrow evaluations, making them less suited for real-world application. To address this, we introduce \textit{Open-FinLLMs}, the first open-source multimodal financial LLMs designed to handle diverse tasks across text, tabular, t… ▽ More Financial LLMs hold promise for advancing financial tasks and domain-specific applications. However, they are limited by scarce corpora, weak multimodal capabilities, and narrow evaluations, making them less suited for real-world application. To address this, we introduce \textit{Open-FinLLMs}, the first open-source multimodal financial LLMs designed to handle diverse tasks across text, tabular, time-series, and chart data, excelling in zero-shot, few-shot, and fine-tuning settings. The suite includes FinLLaMA, pre-trained on a comprehensive 52-billion-token corpus; FinLLaMA-Instruct, fine-tuned with 573K financial instructions; and FinLLaVA, enhanced with 1.43M multimodal tuning pairs for strong cross-modal reasoning. We comprehensively evaluate Open-FinLLMs across 14 financial tasks, 30 datasets, and 4 multimodal tasks in zero-shot, few-shot, and supervised fine-tuning settings, introducing two new multimodal evaluation datasets. Our results show that Open-FinLLMs outperforms afvanced financial and general LLMs such as GPT-4, across financial NLP, decision-making, and multi-modal tasks, highlighting their potential to tackle real-world challenges. To foster innovation and collaboration across academia and industry, we release all codes (https://anonymous.4open.science/r/PIXIU2-0D70/B1D7/LICENSE) and models under OSI-approved licenses. △ Less

Submitted 6 June, 2025; v1 submitted 20 August, 2024; originally announced August 2024.

Comments: 33 pages, 13 figures

arXiv:2408.06634 [pdf, other]

doi 10.1109/DOCS63458.2024.10704454

Harnessing Earnings Reports for Stock Predictions: A QLoRA-Enhanced LLM Approach

Authors: Haowei Ni, Shuchen Meng, Xupeng Chen, Ziqing Zhao, Andi Chen, Panfeng Li, Shiyao Zhang, Qifu Yin, Yuanqing Wang, Yuxi Chan

Abstract: Accurate stock market predictions following earnings reports are crucial for investors. Traditional methods, particularly classical machine learning models, struggle with these predictions because they cannot effectively process and interpret extensive textual data contained in earnings reports and often overlook nuances that influence market movements. This paper introduces an advanced approach b… ▽ More Accurate stock market predictions following earnings reports are crucial for investors. Traditional methods, particularly classical machine learning models, struggle with these predictions because they cannot effectively process and interpret extensive textual data contained in earnings reports and often overlook nuances that influence market movements. This paper introduces an advanced approach by employing Large Language Models (LLMs) instruction fine-tuned with a novel combination of instruction-based techniques and quantized low-rank adaptation (QLoRA) compression. Our methodology integrates 'base factors', such as financial metric growth and earnings transcripts, with 'external factors', including recent market indices performances and analyst grades, to create a rich, supervised dataset. This comprehensive dataset enables our models to achieve superior predictive performance in terms of accuracy, weighted F1, and Matthews correlation coefficient (MCC), especially evident in the comparison with benchmarks such as GPT-4. We specifically highlight the efficacy of the llama-3-8b-Instruct-4bit model, which showcases significant improvements over baseline models. The paper also discusses the potential of expanding the output capabilities to include a 'Hold' option and extending the prediction horizon, aiming to accommodate various investment styles and time frames. This study not only demonstrates the power of integrating cutting-edge AI with fine-tuned financial data but also paves the way for future research in enhancing AI-driven financial analysis tools. △ Less

Submitted 12 November, 2024; v1 submitted 13 August, 2024; originally announced August 2024.

Comments: Accepted by 2024 6th International Conference on Data-driven Optimization of Complex Systems

Journal ref: Proceedings of the 2024 6th International Conference on Data-driven Optimization of Complex Systems (DOCS), 2024, pp. 909-915

arXiv:2407.14335 [pdf, other]

doi 10.1109/MetaCom62920.2024.00028

Quantifying the Blockchain Trilemma: A Comparative Analysis of Algorand, Ethereum 2.0, and Beyond

Authors: Yihang Fu, Mingwei Jing, Jiaolun Zhou, Peilin Wu, Ye Wang, Luyao Zhang, Chuang Hu

Abstract: Blockchain technology is essential for the digital economy and metaverse, supporting applications from decentralized finance to virtual assets. However, its potential is constrained by the "Blockchain Trilemma," which necessitates balancing decentralization, security, and scalability. This study evaluates and compares two leading proof-of-stake (PoS) systems, Algorand and Ethereum 2.0, against the… ▽ More Blockchain technology is essential for the digital economy and metaverse, supporting applications from decentralized finance to virtual assets. However, its potential is constrained by the "Blockchain Trilemma," which necessitates balancing decentralization, security, and scalability. This study evaluates and compares two leading proof-of-stake (PoS) systems, Algorand and Ethereum 2.0, against these critical metrics. Our research interprets existing indices to measure decentralization, evaluates scalability through transactional data, and assesses security by identifying potential vulnerabilities. Utilizing real-world data, we analyze each platform's strategies in a structured manner to understand their effectiveness in addressing trilemma challenges. The findings highlight each platform's strengths and propose general methodologies for evaluating key blockchain characteristics applicable to other systems. This research advances the understanding of blockchain technologies and their implications for the future digital economy. Data and code are available on GitHub as open source. △ Less

Submitted 19 July, 2024; originally announced July 2024.

arXiv:2406.01335 [pdf, other]

Statistics-Informed Parameterized Quantum Circuit via Maximum Entropy Principle for Data Science and Finance

Authors: Xi-Ning Zhuang, Zhao-Yun Chen, Cheng Xue, Xiao-Fan Xu, Chao Wang, Huan-Yu Liu, Tai-Ping Sun, Yun-Jie Wang, Yu-Chun Wu, Guo-Ping Guo

Abstract: Quantum machine learning has demonstrated significant potential in solving practical problems, particularly in statistics-focused areas such as data science and finance. However, challenges remain in preparing and learning statistical models on a quantum processor due to issues with trainability and interpretability. In this letter, we utilize the maximum entropy principle to design a statistics-i… ▽ More Quantum machine learning has demonstrated significant potential in solving practical problems, particularly in statistics-focused areas such as data science and finance. However, challenges remain in preparing and learning statistical models on a quantum processor due to issues with trainability and interpretability. In this letter, we utilize the maximum entropy principle to design a statistics-informed parameterized quantum circuit (SI-PQC) for efficiently preparing and training of quantum computational statistical models, including arbitrary distributions and their weighted mixtures. The SI-PQC features a static structure with trainable parameters, enabling in-depth optimized circuit compilation, exponential reductions in resource and time consumption, and improved trainability and interpretability for learning quantum states and classical model parameters simultaneously. As an efficient subroutine for preparing and learning in various quantum algorithms, the SI-PQC addresses the input bottleneck and facilitates the injection of prior knowledge. △ Less

Submitted 18 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

Comments: 19 pages, 5 figures

arXiv:2405.13076 [pdf]

A K-means Algorithm for Financial Market Risk Forecasting

Authors: Jinxin Xu, Kaixian Xu, Yue Wang, Qinyan Shen, Ruisi Li

Abstract: Financial market risk forecasting involves applying mathematical models, historical data analysis and statistical methods to estimate the impact of future market movements on investments. This process is crucial for investors to develop strategies, financial institutions to manage assets and regulators to formulate policy. In today's society, there are problems of high error rate and low precision… ▽ More Financial market risk forecasting involves applying mathematical models, historical data analysis and statistical methods to estimate the impact of future market movements on investments. This process is crucial for investors to develop strategies, financial institutions to manage assets and regulators to formulate policy. In today's society, there are problems of high error rate and low precision in financial market risk prediction, which greatly affect the accuracy of financial market risk prediction. K-means algorithm in machine learning is an effective risk prediction technique for financial market. This study uses K-means algorithm to develop a financial market risk prediction system, which significantly improves the accuracy and efficiency of financial market risk prediction. Ultimately, the outcomes of the experiments confirm that the K-means algorithm operates with user-friendly simplicity and achieves a 94.61% accuracy rate △ Less

Submitted 20 May, 2024; originally announced May 2024.

arXiv:2403.02500 [pdf, ps, other]

RVRAE: A Dynamic Factor Model Based on Variational Recurrent Autoencoder for Stock Returns Prediction

Authors: Yilun Wang, Shengjie Guo

Abstract: In recent years, the dynamic factor model has emerged as a dominant tool in economics and finance, particularly for investment strategies. This model offers improved handling of complex, nonlinear, and noisy market conditions compared to traditional static factor models. The advancement of machine learning, especially in dealing with nonlinear data, has further enhanced asset pricing methodologies… ▽ More In recent years, the dynamic factor model has emerged as a dominant tool in economics and finance, particularly for investment strategies. This model offers improved handling of complex, nonlinear, and noisy market conditions compared to traditional static factor models. The advancement of machine learning, especially in dealing with nonlinear data, has further enhanced asset pricing methodologies. This paper introduces a groundbreaking dynamic factor model named RVRAE. This model is a probabilistic approach that addresses the temporal dependencies and noise in market data. RVRAE ingeniously combines the principles of dynamic factor modeling with the variational recurrent autoencoder (VRAE) from deep learning. A key feature of RVRAE is its use of a prior-posterior learning method. This method fine-tunes the model's learning process by seeking an optimal posterior factor model informed by future data. Notably, RVRAE is adept at risk modeling in volatile stock markets, estimating variances from latent space distributions while also predicting returns. Our empirical tests with real stock market data underscore RVRAE's superior performance compared to various established baseline methods. △ Less

Submitted 4 March, 2024; originally announced March 2024.

arXiv:2402.01441 [pdf, ps, other]

Learning the Market: Sentiment-Based Ensemble Trading Agents

Authors: Andrew Ye, James Xu, Vidyut Veedgav, Yi Wang, Yifan Yu, Daniel Yan, Ryan Chen, Vipin Chaudhary, Shuai Xu

Abstract: We propose and study the integration of sentiment analysis and deep reinforcement learning ensemble algorithms for stock trading by evaluating strategies capable of dynamically altering their active agent given the concurrent market environment. In particular, we design a simple-yet-effective method for extracting financial sentiment and combine this with improvements on existing trading agents, r… ▽ More We propose and study the integration of sentiment analysis and deep reinforcement learning ensemble algorithms for stock trading by evaluating strategies capable of dynamically altering their active agent given the concurrent market environment. In particular, we design a simple-yet-effective method for extracting financial sentiment and combine this with improvements on existing trading agents, resulting in a strategy that effectively considers both qualitative market factors and quantitative stock data. We show that our approach results in a strategy that is profitable, robust, and risk-minimal - outperforming the traditional ensemble strategy as well as single agent algorithms and market metrics. Our findings suggest that the conventional practice of switching and reevaluating agents in ensemble every fixed-number of months is sub-optimal, and that a dynamic sentiment-based framework greatly unlocks additional performance. Furthermore, as we have designed our algorithm with simplicity and efficiency in mind, we hypothesize that the transition of our method from historical evaluation towards real-time trading with live data to be relatively simple. △ Less

Submitted 20 November, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

arXiv:2312.14203 [pdf, other]

Shai: A large language model for asset management

Authors: Zhongyang Guo, Guanran Jiang, Zhongdan Zhang, Peng Li, Zhefeng Wang, Yinchun Wang

Abstract: This paper introduces "Shai" a 10B level large language model specifically designed for the asset management industry, built upon an open-source foundational model. With continuous pre-training and fine-tuning using a targeted corpus, Shai demonstrates enhanced performance in tasks relevant to its domain, outperforming baseline models. Our research includes the development of an innovative evaluat… ▽ More This paper introduces "Shai" a 10B level large language model specifically designed for the asset management industry, built upon an open-source foundational model. With continuous pre-training and fine-tuning using a targeted corpus, Shai demonstrates enhanced performance in tasks relevant to its domain, outperforming baseline models. Our research includes the development of an innovative evaluation framework, which integrates professional qualification exams, tailored tasks, open-ended question answering, and safety assessments, to comprehensively assess Shai's capabilities. Furthermore, we discuss the challenges and implications of utilizing large language models like GPT-4 for performance assessment in asset management, suggesting a combination of automated evaluation and human judgment. Shai's development, showcasing the potential and versatility of 10B-level large language models in the financial sector with significant performance and modest computational requirements, hopes to provide practical insights and methodologies to assist industry peers in their similar endeavors. △ Less

Submitted 21 December, 2023; originally announced December 2023.

arXiv:2311.04841 [pdf, ps, other]

Predictable Relative Forward Performance Processes: Multi-Agent and Mean Field Games for Portfolio Management

Authors: Gechun Liang, Moris S. Strub, Yuwei Wang

Abstract: We consider a new framework of predictable relative forward performance processes (PRFPP) to study portfolio management within a competitive environment. Each agent trades a distinct stock following a binomial distribution with probabilities for a positive return depending on the market regime characterized by a binomial common noise. For both the finite population and mean field games, we constru… ▽ More We consider a new framework of predictable relative forward performance processes (PRFPP) to study portfolio management within a competitive environment. Each agent trades a distinct stock following a binomial distribution with probabilities for a positive return depending on the market regime characterized by a binomial common noise. For both the finite population and mean field games, we construct and analyse PRFPPs for initial data of the CARA class along with the associated equilibrium strategies. We find that relative performance concerns do not necessarily lead to more investment in the risky asset. Under some parameter constellations, agents short a stock with positive expected excess return. △ Less

Submitted 2 December, 2023; v1 submitted 8 November, 2023; originally announced November 2023.

arXiv:2310.14881 [pdf, other]

Topological Portfolio Selection and Optimization

Authors: Yuanrong Wang, Antonio Briola, Tomaso Aste

Abstract: Modern portfolio optimization is centered around creating a low-risk portfolio with extensive asset diversification. Following the seminal work of Markowitz, optimal asset allocation can be computed using a constrained optimization model based on empirical covariance. However, covariance is typically estimated from historical lookback observations, and it is prone to noise and may inadequately rep… ▽ More Modern portfolio optimization is centered around creating a low-risk portfolio with extensive asset diversification. Following the seminal work of Markowitz, optimal asset allocation can be computed using a constrained optimization model based on empirical covariance. However, covariance is typically estimated from historical lookback observations, and it is prone to noise and may inadequately represent future market behavior. As a remedy, information filtering networks from network science can be used to mitigate the noise in empirical covariance estimation, and therefore, can bring added value to the portfolio construction process. In this paper, we propose the use of the Statistically Robust Information Filtering Network (SR-IFN) which leverages the bootstrapping techniques to eliminate unnecessary edges during the network formation and enhances the network's noise reduction capability further. We apply SR-IFN to index component stock pools in the US, UK, and China to assess its effectiveness. The SR-IFN network is partially disconnected with isolated nodes representing lesser-correlated assets, facilitating the selection of peripheral, diversified and higher-performing portfolios. Further optimization of performance can be achieved by inversely proportioning asset weights to their centrality based on the resultant network. △ Less

Submitted 23 October, 2023; originally announced October 2023.

arXiv:2310.07427 [pdf, other]

Quantum-Enhanced Forecasting: Leveraging Quantum Gramian Angular Field and CNNs for Stock Return Predictions

Authors: Zhengmeng Xu, Yujie Wang, Xiaotong Feng, Yilin Wang, Yanli Li, Hai Lin

Abstract: We propose a time series forecasting method named Quantum Gramian Angular Field (QGAF). This approach merges the advantages of quantum computing technology with deep learning, aiming to enhance the precision of time series classification and forecasting. We successfully transformed stock return time series data into two-dimensional images suitable for Convolutional Neural Network (CNN) training by… ▽ More We propose a time series forecasting method named Quantum Gramian Angular Field (QGAF). This approach merges the advantages of quantum computing technology with deep learning, aiming to enhance the precision of time series classification and forecasting. We successfully transformed stock return time series data into two-dimensional images suitable for Convolutional Neural Network (CNN) training by designing specific quantum circuits. Distinct from the classical Gramian Angular Field (GAF) approach, QGAF's uniqueness lies in eliminating the need for data normalization and inverse cosine calculations, simplifying the transformation process from time series data to two-dimensional images. To validate the effectiveness of this method, we conducted experiments on datasets from three major stock markets: the China A-share market, the Hong Kong stock market, and the US stock market. Experimental results revealed that compared to the classical GAF method, the QGAF approach significantly improved time series prediction accuracy, reducing prediction errors by an average of 25% for Mean Absolute Error (MAE) and 48% for Mean Squared Error (MSE). This research confirms the potential and promising prospects of integrating quantum computing with deep learning techniques in financial time series forecasting. △ Less

Submitted 11 December, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

arXiv:2310.05322 [pdf]

Market Crowds' Trading Behaviors, Agreement Prices, and the Implications of Trading Volume

Authors: Leilei Shi, Bing Han, Yingzi Zhu, Liyan Han, Yiwen Wang, Yan Piao

Abstract: It has been long that literature in financial academics focuses mainly on price and return but much less on trading volume. In the past twenty years, it has already linked both price and trading volume to economic fundamentals, and explored the behavioral implications of trading volume such as investor's attitude toward risks, overconfidence, disagreement, and attention etc. However, what is surpr… ▽ More It has been long that literature in financial academics focuses mainly on price and return but much less on trading volume. In the past twenty years, it has already linked both price and trading volume to economic fundamentals, and explored the behavioral implications of trading volume such as investor's attitude toward risks, overconfidence, disagreement, and attention etc. However, what is surprising is how little we really know about trading volume. Here we show that trading volume probability represents the frequency of market crowd's trading action in terms of behavior analysis, and test two adaptive hypotheses relevant to the volume uncertainty associated with price in China stock market. The empirical work reveals that market crowd trade a stock in efficient adaptation except for simple heuristics, gradually tend to achieve agreement on an outcome or an asset price widely on a trading day, and generate such a stationary equilibrium price very often in interaction and competition among themselves no matter whether it is highly overestimated or underestimated. This suggests that asset prices include not only a fundamental value but also private information, speculative, sentiment, attention, gamble, and entertainment values etc. Moreover, market crowd adapt to gain and loss by trading volume increase or decrease significantly in interaction with environment in any two consecutive trading days. Our results demonstrate how interaction between information and news, the trading action, and return outcomes in the three-term feedback loop produces excessive trading volume which includes various internal and external causes. △ Less

Submitted 8 October, 2023; originally announced October 2023.

Comments: 57 pages, 11 figures, 5 tables

Journal ref: Proceedings of 2013 China Finance Review International Conference, 845-897 (2013)

arXiv:2309.08175 [pdf, other]

Closed-form solutions for VIX derivatives in a Legendre empirical model

Authors: Ying-Li Wang, Cheng-Long Xu, Ping He

Abstract: In this paper, we introduce a data-driven, single-parameter Markov diffusion model for the VIX. The volatility factor evolves in $(-1,1)$ with a uniform invariant distribution ensured by Legendre polynomials, mapped to the empirical distribution. We derive analytical series solutions for VIX futures and options using separation of variables to solve the Feynman-Kac PDE. Compared to the 3/2 model,… ▽ More In this paper, we introduce a data-driven, single-parameter Markov diffusion model for the VIX. The volatility factor evolves in $(-1,1)$ with a uniform invariant distribution ensured by Legendre polynomials, mapped to the empirical distribution. We derive analytical series solutions for VIX futures and options using separation of variables to solve the Feynman-Kac PDE. Compared to the 3/2 model, our approach offers equal or superior accuracy and flexibility, providing an efficient, robust alternative for VIX pricing and risk management. Code and data are available at github.com/gagawjbytw/empirical-VIX. △ Less

Submitted 21 May, 2025; v1 submitted 15 September, 2023; originally announced September 2023.

MSC Class: 91G20; 60J25; 65C30

arXiv:2301.08360 [pdf, other]

Domain-adapted Learning and Imitation: DRL for Power Arbitrage

Authors: Yuanrong Wang, Vignesh Raja Swaminathan, Nikita P. Granger, Carlos Ros Perez, Christian Michler

Abstract: In this paper, we discuss the Dutch power market, which is comprised of a day-ahead market and an intraday balancing market that operates like an auction. Due to fluctuations in power supply and demand, there is often an imbalance that leads to different prices in the two markets, providing an opportunity for arbitrage. To address this issue, we restructure the problem and propose a collaborative… ▽ More In this paper, we discuss the Dutch power market, which is comprised of a day-ahead market and an intraday balancing market that operates like an auction. Due to fluctuations in power supply and demand, there is often an imbalance that leads to different prices in the two markets, providing an opportunity for arbitrage. To address this issue, we restructure the problem and propose a collaborative dual-agent reinforcement learning approach for this bi-level simulation and optimization of European power arbitrage trading. We also introduce two new implementations designed to incorporate domain-specific knowledge by imitating the trading behaviours of power traders. By utilizing reward engineering to imitate domain expertise, we are able to reform the reward system for the RL agent, which improves convergence during training and enhances overall performance. Additionally, the tranching of orders increases bidding success rates and significantly boosts profit and loss (P&L). Our study demonstrates that by leveraging domain expertise in a general learning problem, the performance can be improved substantially, and the final integrated approach leads to a three-fold improvement in cumulative P&L compared to the original agent. Furthermore, our methodology outperforms the highest benchmark policy by around 50% while maintaining efficient computational performance. △ Less

Submitted 10 September, 2023; v1 submitted 19 January, 2023; originally announced January 2023.

arXiv:2301.08359 [pdf, other]

Domain-adapted Learning and Interpretability: DRL for Gas Trading

Authors: Yuanrong Wang, Yinsen Miao, Alexander CY Wong, Nikita P Granger, Christian Michler

Abstract: Deep Reinforcement Learning (Deep RL) has been explored for a number of applications in finance and stock trading. In this paper, we present a practical implementation of Deep RL for trading natural gas futures contracts. The Sharpe Ratio obtained exceeds benchmarks given by trend following and mean reversion strategies as well as results reported in literature. Moreover, we propose a simple but e… ▽ More Deep Reinforcement Learning (Deep RL) has been explored for a number of applications in finance and stock trading. In this paper, we present a practical implementation of Deep RL for trading natural gas futures contracts. The Sharpe Ratio obtained exceeds benchmarks given by trend following and mean reversion strategies as well as results reported in literature. Moreover, we propose a simple but effective ensemble learning scheme for trading, which significantly improves performance through enhanced model stability and robustness as well as lower turnover and hence lower transaction cost. We discuss the resulting Deep RL strategy in terms of model explainability, trading frequency and risk measures. △ Less

Submitted 10 September, 2023; v1 submitted 19 January, 2023; originally announced January 2023.

arXiv:2208.12614 [pdf, other]

Regime-based Implied Stochastic Volatility Model for Crypto Option Pricing

Authors: Danial Saef, Yuanrong Wang, Tomaso Aste

Abstract: The increasing adoption of Digital Assets (DAs), such as Bitcoin (BTC), rises the need for accurate option pricing models. Yet, existing methodologies fail to cope with the volatile nature of the emerging DAs. Many models have been proposed to address the unorthodox market dynamics and frequent disruptions in the microstructure caused by the non-stationarity, and peculiar statistics, in DA markets… ▽ More The increasing adoption of Digital Assets (DAs), such as Bitcoin (BTC), rises the need for accurate option pricing models. Yet, existing methodologies fail to cope with the volatile nature of the emerging DAs. Many models have been proposed to address the unorthodox market dynamics and frequent disruptions in the microstructure caused by the non-stationarity, and peculiar statistics, in DA markets. However, they are either prone to the curse of dimensionality, as additional complexity is required to employ traditional theories, or they overfit historical patterns that may never repeat. Instead, we leverage recent advances in market regime (MR) clustering with the Implied Stochastic Volatility Model (ISVM). Time-regime clustering is a temporal clustering method, that clusters the historic evolution of a market into different volatility periods accounting for non-stationarity. ISVM can incorporate investor expectations in each of the sentiment-driven periods by using implied volatility (IV) data. In this paper, we applied this integrated time-regime clustering and ISVM method (termed MR-ISVM) to high-frequency data on BTC options at the popular trading platform Deribit. We demonstrate that MR-ISVM contributes to overcome the burden of complex adaption to jumps in higher order characteristics of option pricing models. This allows us to price the market based on the expectations of its participants in an adaptive fashion. △ Less

Submitted 27 September, 2022; v1 submitted 15 August, 2022; originally announced August 2022.

ACM Class: G.3

arXiv:2207.13914 [pdf, other]

doi 10.1016/j.frl.2022.103358

Anatomy of a Stablecoin's failure: the Terra-Luna case

Authors: Antonio Briola, David Vidal-Tomás, Yuanrong Wang, Tomaso Aste

Abstract: We quantitatively describe the main events that led to the Terra project's failure in May 2022. We first review, in a systematic way, news from heterogeneous social media sources; we discuss the fragility of the Terra project and its vicious dependence on the Anchor protocol. We hence identify the crash's trigger events, analysing hourly and transaction data for Bitcoin, Luna, and TerraUSD. Finall… ▽ More We quantitatively describe the main events that led to the Terra project's failure in May 2022. We first review, in a systematic way, news from heterogeneous social media sources; we discuss the fragility of the Terra project and its vicious dependence on the Anchor protocol. We hence identify the crash's trigger events, analysing hourly and transaction data for Bitcoin, Luna, and TerraUSD. Finally, using state-of-the-art techniques from network science, we study the evolution of dependency structures for 61 highly capitalised cryptocurrencies during the down-market and we also highlight the absence of herding behaviour analysing cross-sectional absolute deviation of returns. △ Less

Submitted 25 September, 2022; v1 submitted 28 July, 2022; originally announced July 2022.

Comments: 17 pages, 7 figures, 6 tables, 1 appendix

arXiv:2206.12528

Predicting Stock Price Movement after Disclosure of Corporate Annual Reports: A Case Study of 2021 China CSI 300 Stocks

Authors: Fengyu Han, Yue Wang

Abstract: In the current stock market, computer science and technology are more and more widely used to analyse stocks. Not same as most related machine learning stock price prediction work, this work study the predicting the tendency of the stock price on the second day right after the disclosure of the companies' annual reports. We use a variety of different models, including decision tree, logistic regre… ▽ More In the current stock market, computer science and technology are more and more widely used to analyse stocks. Not same as most related machine learning stock price prediction work, this work study the predicting the tendency of the stock price on the second day right after the disclosure of the companies' annual reports. We use a variety of different models, including decision tree, logistic regression, random forest, neural network, prototypical networks. We use two sets of financial indicators (key and expanded) to conduct experiments, these financial indicators are obtained from the EastMoney website disclosed by companies, and finally we find that these models are not well behaved to predict the tendency. In addition, we also filter stocks with ROE greater than 0.15 and net cash ratio greater than 0.9. We conclude that according to the financial indicators based on the just-released annual report of the company, the predictability of the stock price movement on the second day after disclosure is weak, with maximum accuracy about 59.6% and maximum precision about 0.56 on our test set by the random forest classifier, and the stock filtering does not improve the performance. And random forests perform best in general among all these models which conforms to some work's findings. △ Less

Submitted 21 July, 2022; v1 submitted 24 June, 2022; originally announced June 2022.

Comments: My experimental conditions were not set correctly, and almost all the data in the table were filled in incorrectly. I had to repeat all the experiments and make updated descriptions, but the wrong data and descriptions caused confusion to others. I may need several months to redo the experiment, so I hope to withdraw my manuscript first

arXiv:2205.12043 [pdf, ps, other]

Static Replication of Impermanent Loss for Concentrated Liquidity Provision in Decentralised Markets

Authors: Jun Deng, Hua Zong, Yun Wang

Abstract: This article analytically characterizes the impermanent loss of concentrated liquidity provision for automatic market makers in decentralised markets such as Uniswap. We propose two static replication formulas for the impermanent loss by a combination of European calls or puts with strike prices supported on the liquidity provision price interval. It facilitates liquidity providers to hedge perman… ▽ More This article analytically characterizes the impermanent loss of concentrated liquidity provision for automatic market makers in decentralised markets such as Uniswap. We propose two static replication formulas for the impermanent loss by a combination of European calls or puts with strike prices supported on the liquidity provision price interval. It facilitates liquidity providers to hedge permanent loss by trading crypto options in more liquid centralised exchanges such as Deribit. Numerical examples illustrate the astonishing accuracy of the static replication. △ Less

Submitted 2 March, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

Comments: 12pages, 1 figure

arXiv:2203.12228 [pdf, ps, other]

Bivariate Distribution Regression with Application to Insurance Data

Authors: Yunyun Wang, Tatsushi Oka, Dan Zhu

Abstract: Understanding variable dependence, particularly eliciting their statistical properties given a set of covariates, provides the mathematical foundation in practical operations management such as risk analysis and decision-making given observed circumstances. This article presents an estimation method for modeling the conditional joint distribution of bivariate outcomes based on the distribution reg… ▽ More Understanding variable dependence, particularly eliciting their statistical properties given a set of covariates, provides the mathematical foundation in practical operations management such as risk analysis and decision-making given observed circumstances. This article presents an estimation method for modeling the conditional joint distribution of bivariate outcomes based on the distribution regression and factorization methods. This method is considered semiparametric in that it allows for flexible modeling of both the marginal and joint distributions conditional on covariates without imposing global parametric assumptions across the entire distribution. In contrast to existing parametric approaches, our method can accommodate discrete, continuous, or mixed variables, and provides a simple yet effective way to capture distributional dependence structures between bivariate outcomes and covariates. Various simulation results confirm that our method can perform similarly or better in finite samples compared to the alternative methods. In an application to the study of a motor third-party liability insurance portfolio, the proposed method effectively estimates risk measures such as the conditional Value-at-Risk and Expected Shortfall. This result suggests that this semiparametric approach can serve as an alternative in insurance risk management. △ Less

Submitted 3 September, 2023; v1 submitted 23 March, 2022; originally announced March 2022.

arXiv:2203.03991 [pdf, other]

Sparsification and Filtering for Spatial-temporal GNN in Multivariate Time-series

Authors: Yuanrong Wang, Tomaso Aste

Abstract: We propose an end-to-end architecture for multivariate time-series prediction that integrates a spatial-temporal graph neural network with a matrix filtering module. This module generates filtered (inverse) correlation graphs from multivariate time series before inputting them into a GNN. In contrast with existing sparsification methods adopted in graph neural network, our model explicitly leverag… ▽ More We propose an end-to-end architecture for multivariate time-series prediction that integrates a spatial-temporal graph neural network with a matrix filtering module. This module generates filtered (inverse) correlation graphs from multivariate time series before inputting them into a GNN. In contrast with existing sparsification methods adopted in graph neural network, our model explicitly leverage time-series filtering to overcome the low signal-to-noise ratio typical of complex systems data. We present a set of experiments, where we predict future sales from a synthetic time-series sales dataset. The proposed spatial-temporal graph neural network displays superior performances with respect to baseline approaches, with no graphical information, and with fully connected, disconnected graphs and unfiltered graphs. △ Less

Submitted 8 March, 2022; originally announced March 2022.

Comments: 7 pages, 1 figure, 3tables

arXiv:2202.05779 [pdf, other]

The Evolution of Blockchain: from Lit to Dark

Authors: Agostino Capponi, Ruizhe Jia, Ye Wang

Abstract: Transactions submitted through the blockchain peer-to-peer (P2P) network may leak out exploitable information. We study the economic incentives behind the adoption of blockchain dark venues, where users' transactions are observable only by miners on these venues. We show that miners may not fully adopt dark venues to preserve rents extracted from arbitrageurs, hence creating execution risk for use… ▽ More Transactions submitted through the blockchain peer-to-peer (P2P) network may leak out exploitable information. We study the economic incentives behind the adoption of blockchain dark venues, where users' transactions are observable only by miners on these venues. We show that miners may not fully adopt dark venues to preserve rents extracted from arbitrageurs, hence creating execution risk for users. The dark venue neither eliminates frontrunning risk nor reduces transaction costs. It strictly increases the payoff of miners, weakly increases the payoff of users, and weakly reduces arbitrageurs' profits. We provide empirical support for our main implications, and show that they are economically significant. A 1% increase in the probability of being frontrun raises users' adoption rate of the dark venue by 0.6%. Arbitrageurs' cost-to-revenue ratio increases by a third with a dark venue. △ Less

Submitted 11 February, 2022; originally announced February 2022.

arXiv:2201.02958 [pdf, other]

Smooth Nested Simulation: Bridging Cubic and Square Root Convergence Rates in High Dimensions

Authors: Wenjia Wang, Yanyuan Wang, Xiaowei Zhang

Abstract: Nested simulation concerns estimating functionals of a conditional expectation via simulation. In this paper, we propose a new method based on kernel ridge regression to exploit the smoothness of the conditional expectation as a function of the multidimensional conditioning variable. Asymptotic analysis shows that the proposed method can effectively alleviate the curse of dimensionality on the con… ▽ More Nested simulation concerns estimating functionals of a conditional expectation via simulation. In this paper, we propose a new method based on kernel ridge regression to exploit the smoothness of the conditional expectation as a function of the multidimensional conditioning variable. Asymptotic analysis shows that the proposed method can effectively alleviate the curse of dimensionality on the convergence rate as the simulation budget increases, provided that the conditional expectation is sufficiently smooth. The smoothness bridges the gap between the cubic root convergence rate (that is, the optimal rate for the standard nested simulation) and the square root convergence rate (that is, the canonical rate for the standard Monte Carlo simulation). We demonstrate the performance of the proposed method via numerical examples from portfolio risk management and input uncertainty quantification. △ Less

Submitted 11 October, 2023; v1 submitted 9 January, 2022; originally announced January 2022.

Comments: Main body: 46 pages, 5 figures, 5 tables; Supplemental material: 28 pages

arXiv:2112.15499 [pdf, other]

Dynamic Portfolio Optimization with Inverse Covariance Clustering

Authors: Yuanrong Wang, Tomaso Aste

Abstract: Market conditions change continuously. However, in portfolio's investment strategies, it is hard to account for this intrinsic non-stationarity. In this paper, we propose to address this issue by using the Inverse Covariance Clustering (ICC) method to identify inherent market states and then integrate such states into a dynamic portfolio optimization process. Extensive experiments across three dif… ▽ More Market conditions change continuously. However, in portfolio's investment strategies, it is hard to account for this intrinsic non-stationarity. In this paper, we propose to address this issue by using the Inverse Covariance Clustering (ICC) method to identify inherent market states and then integrate such states into a dynamic portfolio optimization process. Extensive experiments across three different markets, NASDAQ, FTSE and HS300, over a period of ten years, demonstrate the advantages of our proposed algorithm, termed Inverse Covariance Clustering-Portfolio Optimization (ICC-PO). The core of the ICC-PO methodology concerns the identification and clustering of market states from the analytics of past data and the forecasting of the future market state. It is therefore agnostic to the specific portfolio optimization method of choice. By applying the same portfolio optimization technique on a ICC temporal cluster, instead of the whole train period, we show that one can generate portfolios with substantially higher Sharpe Ratios, which are statistically more robust and resilient with great reductions in maximum loss in extreme situations. This is shown to be consistent across markets, periods, optimization methods and selection of portfolio assets. △ Less

Submitted 14 January, 2022; v1 submitted 31 December, 2021; originally announced December 2021.

Comments: 12 pages, 2 figures, 2 tables

arXiv:2110.08900 [pdf, other]

Predictable Forward Performance Processes: Infrequent Evaluation and Applications to Human-Machine Interactions

Authors: Gechun Liang, Moris S. Strub, Yuwei Wang

Abstract: We study discrete-time predictable forward processes when trading times do not coincide with performance evaluation times in a binomial tree model for the financial market. The key step in the construction of these processes is to solve a linear functional equation of higher order associated with the inverse problem driving the evolution of the predictable forward process. We provide sufficient co… ▽ More We study discrete-time predictable forward processes when trading times do not coincide with performance evaluation times in a binomial tree model for the financial market. The key step in the construction of these processes is to solve a linear functional equation of higher order associated with the inverse problem driving the evolution of the predictable forward process. We provide sufficient conditions for the existence and uniqueness and an explicit construction of the predictable forward process under these conditions. Furthermore, we find that these processes are inherently myopic in the sense that optimal strategies do not make use of future model parameters even if these are known. Finally, we argue that predictable forward preferences are a viable framework to model human-machine interactions occuring in automated trading or robo-advising. For both applications, we determine an optimal interaction schedule of a human agent interacting infrequently with a machine that is in charge of trading. △ Less

Submitted 2 December, 2023; v1 submitted 17 October, 2021; originally announced October 2021.

arXiv:2108.10403 [pdf, other]

Robust Risk-Aware Reinforcement Learning

Authors: Sebastian Jaimungal, Silvana Pesenti, Ye Sheng Wang, Hariom Tatsat

Abstract: We present a reinforcement learning (RL) approach for robust optimisation of risk-aware performance criteria. To allow agents to express a wide variety of risk-reward profiles, we assess the value of a policy using rank dependent expected utility (RDEU). RDEU allows the agent to seek gains, while simultaneously protecting themselves against downside risk. To robustify optimal policies against mode… ▽ More We present a reinforcement learning (RL) approach for robust optimisation of risk-aware performance criteria. To allow agents to express a wide variety of risk-reward profiles, we assess the value of a policy using rank dependent expected utility (RDEU). RDEU allows the agent to seek gains, while simultaneously protecting themselves against downside risk. To robustify optimal policies against model uncertainty, we assess a policy not by its distribution, but rather, by the worst possible distribution that lies within a Wasserstein ball around it. Thus, our problem formulation may be viewed as an actor/agent choosing a policy (the outer problem), and the adversary then acting to worsen the performance of that strategy (the inner problem). We develop explicit policy gradient formulae for the inner and outer problems, and show its efficacy on three prototypical financial problems: robust portfolio allocation, optimising a benchmark, and statistical arbitrage. △ Less

Submitted 14 December, 2021; v1 submitted 23 August, 2021; originally announced August 2021.

Comments: 12 pages, 5 figures

MSC Class: 91G70; 91-10; 91-08; 90C17; 93E35

Journal ref: SIAM J. Financial Mathematics, Forthcoming 2021

arXiv:2108.07035

Adaptive Gradient Descent Methods for Computing Implied Volatility

Authors: Yixiao Lu, Yihong Wang, Tinggan Yang

Abstract: In this paper, a new numerical method based on adaptive gradient descent optimizers is provided for computing the implied volatility from the Black-Scholes (B-S) option pricing model. It is shown that the new method is more accurate than the close form approximation. Compared with the Newton-Raphson method, the new method obtains a reliable rate of convergence and tends to be less sensitive to the… ▽ More In this paper, a new numerical method based on adaptive gradient descent optimizers is provided for computing the implied volatility from the Black-Scholes (B-S) option pricing model. It is shown that the new method is more accurate than the close form approximation. Compared with the Newton-Raphson method, the new method obtains a reliable rate of convergence and tends to be less sensitive to the beginning point. △ Less

Submitted 22 March, 2023; v1 submitted 16 August, 2021; originally announced August 2021.

Comments: Our implement of Newton-Raphson iteration has defects. After correcting the code implement, we find Newton-Raphson won't be non-convergent. See https://github.com/cloudy-sfu/Newton-Raphson-Implied-Volatility for details

arXiv:2105.13822 [pdf, other]

Behavior of Liquidity Providers in Decentralized Exchanges

Authors: Lioba Heimbach, Ye Wang, Roger Wattenhofer

Abstract: Decentralized exchanges (DEXes) have introduced an innovative trading mechanism, where it is not necessary to match buy-orders and sell-orders to execute a trade. DEXes execute each trade individually, and the exchange rate is automatically determined by the ratio of assets reserved in the market. Therefore, apart from trading, financial players can also liquidity providers, benefiting from transa… ▽ More Decentralized exchanges (DEXes) have introduced an innovative trading mechanism, where it is not necessary to match buy-orders and sell-orders to execute a trade. DEXes execute each trade individually, and the exchange rate is automatically determined by the ratio of assets reserved in the market. Therefore, apart from trading, financial players can also liquidity providers, benefiting from transaction fees from trades executed in DEXes. Although liquidity providers are essential for the functionality of DEXes, it is not clear how liquidity providers behave in such markets. In this paper, we aim to understand how liquidity providers react to market information and how they benefit from providing liquidity in DEXes. We measure the operations of liquidity providers on Uniswap and analyze how they determine their investment strategy based on market changes. We also reveal their returns and risks of investments in different trading pair categories, i.e., stable pairs, normal pairs, and exotic pairs. Further, we investigate the movement of liquidity between trading pools. To the best of our knowledge, this is the first work that systematically studies the behavior of liquidity providers in DEXes. △ Less

Submitted 11 October, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

arXiv:2105.02784 [pdf, other]

Cyclic Arbitrage in Decentralized Exchanges

Authors: Ye Wang, Yan Chen, Haotian Wu, Liyi Zhou, Shuiguang Deng, Roger Wattenhofer

Abstract: Decentralized Exchanges (DEXes) enable users to create markets for exchanging any pair of cryptocurrencies. The direct exchange rate of two tokens may not match the cross-exchange rate in the market, and such price discrepancies open up arbitrage possibilities with trading through different cryptocurrencies cyclically. In this paper, we conduct a systematic investigation on cyclic arbitrages in DE… ▽ More Decentralized Exchanges (DEXes) enable users to create markets for exchanging any pair of cryptocurrencies. The direct exchange rate of two tokens may not match the cross-exchange rate in the market, and such price discrepancies open up arbitrage possibilities with trading through different cryptocurrencies cyclically. In this paper, we conduct a systematic investigation on cyclic arbitrages in DEXes. We propose a theoretical framework for studying cyclic arbitrage. With our framework, we analyze the profitability conditions and optimal trading strategies of cyclic transactions. We further examine exploitable arbitrage opportunities and the market size of cyclic arbitrages with transaction-level data of Uniswap V2. We find that traders have executed 292,606 cyclic arbitrages over eleven months and exploited more than 138 million USD in revenue. However, the revenue of the most profitable unexploited opportunity is persistently higher than 1 ETH (4,000 USD), which indicates that DEX markets may not be efficient enough. By analyzing how traders implement cyclic arbitrages, we find that traders can utilize smart contracts to issue atomic transactions and the atomic implementations could mitigate users' financial loss in cyclic arbitrage from the price impact. △ Less

Submitted 14 January, 2022; v1 submitted 21 April, 2021; originally announced May 2021.

arXiv:2104.08686 [pdf, other]

doi 10.1002/fut.22315

A Black-Scholes user's guide to the Bachelier model

Authors: Jaehyuk Choi, Minsuk Kwak, Chyng Wen Tee, Yumeng Wang

Abstract: To cope with the negative oil futures price caused by the COVID-19 recession, global commodity futures exchanges temporarily switched the option model from Black--Scholes to Bachelier in 2020. This study reviews the literature on Bachelier's pioneering option pricing model and summarizes the practical results on volatility conversion, risk management, stochastic volatility, and barrier options pri… ▽ More To cope with the negative oil futures price caused by the COVID-19 recession, global commodity futures exchanges temporarily switched the option model from Black--Scholes to Bachelier in 2020. This study reviews the literature on Bachelier's pioneering option pricing model and summarizes the practical results on volatility conversion, risk management, stochastic volatility, and barrier options pricing to facilitate the model transition. In particular, using the displaced Black-Scholes model as a model family with the Black-Scholes and Bachelier models as special cases, we not only connect the two models but also present a continuous spectrum of model choices. △ Less

Submitted 6 February, 2022; v1 submitted 17 April, 2021; originally announced April 2021.

Journal ref: Journal of Futures Markets, 42(5):959-980, 2022

arXiv:2102.13467 [pdf, other]

Overnight GARCH-Itô Volatility Models

Authors: Donggyu Kim, Minseok Shin, Yazhen Wang

Abstract: Various parametric volatility models for financial data have been developed to incorporate high-frequency realized volatilities and better capture market dynamics. However, because high-frequency trading data are not available during the close-to-open period, the volatility models often ignore volatility information over the close-to-open period and thus may suffer from loss of important informati… ▽ More Various parametric volatility models for financial data have been developed to incorporate high-frequency realized volatilities and better capture market dynamics. However, because high-frequency trading data are not available during the close-to-open period, the volatility models often ignore volatility information over the close-to-open period and thus may suffer from loss of important information relevant to market dynamics. In this paper, to account for whole-day market dynamics, we propose an overnight volatility model based on Itô diffusions to accommodate two different instantaneous volatility processes for the open-to-close and close-to-open periods. We develop a weighted least squares method to estimate model parameters for two different periods and investigate its asymptotic properties. We conduct a simulation study to check the finite sample performance of the proposed model and method. Finally, we apply the proposed approaches to real trading data. △ Less

Submitted 17 June, 2022; v1 submitted 24 February, 2021; originally announced February 2021.

arXiv:2011.01961 [pdf, other]

Insights into Fairness through Trust: Multi-scale Trust Quantification for Financial Deep Learning

Authors: Alexander Wong, Andrew Hryniowski, Xiao Yu Wang

Abstract: The success of deep learning in recent years have led to a significant increase in interest and prevalence for its adoption to tackle financial services tasks. One particular question that often arises as a barrier to adopting deep learning for financial services is whether the developed financial deep learning models are fair in their predictions, particularly in light of strong governance and re… ▽ More The success of deep learning in recent years have led to a significant increase in interest and prevalence for its adoption to tackle financial services tasks. One particular question that often arises as a barrier to adopting deep learning for financial services is whether the developed financial deep learning models are fair in their predictions, particularly in light of strong governance and regulatory compliance requirements in the financial services industry. A fundamental aspect of fairness that has not been explored in financial deep learning is the concept of trust, whose variations may point to an egocentric view of fairness and thus provide insights into the fairness of models. In this study we explore the feasibility and utility of a multi-scale trust quantification strategy to gain insights into the fairness of a financial deep learning model, particularly under different scenarios at different scales. More specifically, we conduct multi-scale trust quantification on a deep neural network for the purpose of credit card default prediction to study: 1) the overall trustworthiness of the model 2) the trust level under all possible prediction-truth relationships, 3) the trust level across the spectrum of possible predictions, 4) the trust level across different demographic groups (e.g., age, gender, and education), and 5) distribution of overall trust for an individual prediction scenario. The insights for this proof-of-concept study demonstrate that such a multi-scale trust quantification strategy may be helpful for data scientists and regulators in financial services as part of the verification and certification of financial deep learning solutions to gain insights into fairness and trust of these solutions. △ Less

Submitted 3 November, 2020; originally announced November 2020.

Comments: 9 pages

arXiv:2010.01197 [pdf, other]

Stock2Vec: A Hybrid Deep Learning Framework for Stock Market Prediction with Representation Learning and Temporal Convolutional Network

Authors: Xing Wang, Yijun Wang, Bin Weng, Aleksandr Vinel

Abstract: We have proposed to develop a global hybrid deep learning framework to predict the daily prices in the stock market. With representation learning, we derived an embedding called Stock2Vec, which gives us insight for the relationship among different stocks, while the temporal convolutional layers are used for automatically capturing effective temporal patterns both within and across series. Evaluat… ▽ More We have proposed to develop a global hybrid deep learning framework to predict the daily prices in the stock market. With representation learning, we derived an embedding called Stock2Vec, which gives us insight for the relationship among different stocks, while the temporal convolutional layers are used for automatically capturing effective temporal patterns both within and across series. Evaluated on S&P 500, our hybrid framework integrates both advantages and achieves better performance on the stock price prediction task than several popular benchmarked models. △ Less

Submitted 29 September, 2020; originally announced October 2020.

arXiv:2009.04536 [pdf, other]

doi 10.1145/3374135.3385272

Improving Investment Suggestions for Peer-to-Peer (P2P) Lending via Integrating Credit Scoring into Profit Scoring

Authors: Yan Wang, Xuelei Sherry Ni

Abstract: In the peer-to-peer (P2P) lending market, lenders lend the money to the borrowers through a virtual platform and earn the possible profit generated by the interest rate. From the perspective of lenders, they want to maximize the profit while minimizing the risk. Therefore, many studies have used machine learning algorithms to help the lenders identify the "best" loans for making investments. The s… ▽ More In the peer-to-peer (P2P) lending market, lenders lend the money to the borrowers through a virtual platform and earn the possible profit generated by the interest rate. From the perspective of lenders, they want to maximize the profit while minimizing the risk. Therefore, many studies have used machine learning algorithms to help the lenders identify the "best" loans for making investments. The studies have mainly focused on two categories to guide the lenders' investments: one aims at minimizing the risk of investment (i.e., the credit scoring perspective) while the other aims at maximizing the profit (i.e., the profit scoring perspective). However, they have all focused on one category only and there is seldom research trying to integrate the two categories together. Motivated by this, we propose a two-stage framework that incorporates the credit information into a profit scoring modeling. We conducted the empirical experiment on a real-world P2P lending data from the US P2P market and used the Light Gradient Boosting Machine (lightGBM) algorithm in the two-stage framework. Results show that the proposed two-stage method could identify more profitable loans and thereby provide better investment guidance to the investors compared to the existing one-stage profit scoring alone approach. Therefore, the proposed framework serves as an innovative perspective for making investment decisions in P2P lending. △ Less

Submitted 9 September, 2020; originally announced September 2020.

Showing 1–50 of 81 results for author: Wang, Y