-
Fast Learning in Quantitative Finance with Extreme Learning Machine
Authors:
Liexin Cheng,
Xue Cheng,
Shuaiqiang Liu
Abstract:
A critical factor in adopting machine learning for time-sensitive financial tasks is computational speed, including model training and inference. This paper demonstrates that a broad class of such problems, especially those previously addressed using deep neural networks, can be efficiently solved using single-layer neural networks without iterative gradient-based training. This is achieved throug…
▽ More
A critical factor in adopting machine learning for time-sensitive financial tasks is computational speed, including model training and inference. This paper demonstrates that a broad class of such problems, especially those previously addressed using deep neural networks, can be efficiently solved using single-layer neural networks without iterative gradient-based training. This is achieved through the extreme learning machine (ELM) framework. ELM utilizes a single-layer network with randomly initialized hidden nodes and output weights obtained via convex optimization, enabling rapid training and inference. We present various applications in both supervised and unsupervised learning settings, including option pricing, intraday return prediction, volatility surface fitting, and numerical solution of partial differential equations. Across these examples, ELM demonstrates notable improvements in computational efficiency while maintaining comparable accuracy and generalization compared to deep neural networks and classical machine learning methods. We also briefly discuss theoretical aspects of ELM implementation and its generalization capabilities.
△ Less
Submitted 24 May, 2025; v1 submitted 14 May, 2025;
originally announced May 2025.
-
Discrimination-free Insurance Pricing with Privatized Sensitive Attributes
Authors:
Tianhe Zhang,
Suhan Liu,
Peng Shi
Abstract:
Fairness has emerged as a critical consideration in the landscape of machine learning algorithms, particularly as AI continues to transform decision-making across societal domains. To ensure that these algorithms are free from bias and do not discriminate against individuals based on sensitive attributes such as gender and race, the field of algorithmic bias has introduced various fairness concept…
▽ More
Fairness has emerged as a critical consideration in the landscape of machine learning algorithms, particularly as AI continues to transform decision-making across societal domains. To ensure that these algorithms are free from bias and do not discriminate against individuals based on sensitive attributes such as gender and race, the field of algorithmic bias has introduced various fairness concepts, along with methodologies to achieve these notions in different contexts. Despite the rapid advancement, not all sectors have embraced these fairness principles to the same extent. One specific sector that merits attention in this regard is insurance. Within the realm of insurance pricing, fairness is defined through a distinct and specialized framework. Consequently, achieving fairness according to established notions does not automatically ensure fair pricing in insurance. In particular, regulators are increasingly emphasizing transparency in pricing algorithms and imposing constraints on insurance companies on the collection and utilization of sensitive consumer attributes. These factors present additional challenges in the implementation of fairness in pricing algorithms. To address these complexities and comply with regulatory demands, we propose an efficient method for constructing fair models that are tailored to the insurance domain, using only privatized sensitive attributes. Notably, our approach ensures statistical guarantees, does not require direct access to sensitive attributes, and adapts to varying transparency requirements, addressing regulatory demands while ensuring fairness in insurance pricing.
△ Less
Submitted 16 April, 2025;
originally announced April 2025.
-
Can ChatGPT Overcome Behavioral Biases in the Financial Sector? Classify-and-Rethink: Multi-Step Zero-Shot Reasoning in the Gold Investment
Authors:
Shuoling Liu,
Gaoguo Jia,
Yuhang Jiang,
Liyuan Chen,
Qiang Yang
Abstract:
Large Language Models (LLMs) have achieved remarkable success recently, displaying exceptional capabilities in creating understandable and organized text. These LLMs have been utilized in diverse fields, such as clinical research, where domain-specific models like Med-Palm have achieved human-level performance. Recently, researchers have employed advanced prompt engineering to enhance the general…
▽ More
Large Language Models (LLMs) have achieved remarkable success recently, displaying exceptional capabilities in creating understandable and organized text. These LLMs have been utilized in diverse fields, such as clinical research, where domain-specific models like Med-Palm have achieved human-level performance. Recently, researchers have employed advanced prompt engineering to enhance the general reasoning ability of LLMs. Despite the remarkable success of zero-shot Chain-of-Thoughts (CoT) in solving general reasoning tasks, the potential of these methods still remains paid limited attention in the financial reasoning task.To address this issue, we explore multiple prompt strategies and incorporated semantic news information to improve LLMs' performance on financial reasoning tasks.To the best of our knowledge, we are the first to explore this important issue by applying ChatGPT to the gold investment.In this work, our aim is to investigate the financial reasoning capabilities of LLMs and their capacity to generate logical and persuasive investment opinions. We will use ChatGPT, one of the most powerful LLMs recently, and prompt engineering to achieve this goal. Our research will focus on understanding the ability of LLMs in sophisticated analysis and reasoning within the context of investment decision-making. Our study finds that ChatGPT with CoT prompt can provide more explainable predictions and overcome behavioral biases, which is crucial in finance-related tasks and can achieve higher investment returns.
△ Less
Submitted 15 January, 2025; v1 submitted 19 November, 2024;
originally announced November 2024.
-
StockTime: A Time Series Specialized Large Language Model Architecture for Stock Price Prediction
Authors:
Shengkun Wang,
Taoran Ji,
Linhan Wang,
Yanshen Sun,
Shang-Ching Liu,
Amit Kumar,
Chang-Tien Lu
Abstract:
The stock price prediction task holds a significant role in the financial domain and has been studied for a long time. Recently, large language models (LLMs) have brought new ways to improve these predictions. While recent financial large language models (FinLLMs) have shown considerable progress in financial NLP tasks compared to smaller pre-trained language models (PLMs), challenges persist in s…
▽ More
The stock price prediction task holds a significant role in the financial domain and has been studied for a long time. Recently, large language models (LLMs) have brought new ways to improve these predictions. While recent financial large language models (FinLLMs) have shown considerable progress in financial NLP tasks compared to smaller pre-trained language models (PLMs), challenges persist in stock price forecasting. Firstly, effectively integrating the modalities of time series data and natural language to fully leverage these capabilities remains complex. Secondly, FinLLMs focus more on analysis and interpretability, which can overlook the essential features of time series data. Moreover, due to the abundance of false and redundant information in financial markets, models often produce less accurate predictions when faced with such input data. In this paper, we introduce StockTime, a novel LLM-based architecture designed specifically for stock price data. Unlike recent FinLLMs, StockTime is specifically designed for stock price time series data. It leverages the natural ability of LLMs to predict the next token by treating stock prices as consecutive tokens, extracting textual information such as stock correlations, statistical trends and timestamps directly from these stock prices. StockTime then integrates both textual and time series data into the embedding space. By fusing this multimodal data, StockTime effectively predicts stock prices across arbitrary look-back periods. Our experiments demonstrate that StockTime outperforms recent LLMs, as it gives more accurate predictions while reducing memory usage and runtime costs.
△ Less
Submitted 24 August, 2024;
originally announced September 2024.
-
Improved model-free bounds for multi-asset options using option-implied information and deep learning
Authors:
Evangelia Dragazi,
Shuaiqiang Liu,
Antonis Papapantoleon
Abstract:
We consider the computation of model-free bounds for multi-asset options in a setting that combines dependence uncertainty with additional information on the dependence structure. More specifically, we consider the setting where the marginal distributions are known and partial information, in the form of known prices for multi-asset options, is also available in the market. We provide a fundamenta…
▽ More
We consider the computation of model-free bounds for multi-asset options in a setting that combines dependence uncertainty with additional information on the dependence structure. More specifically, we consider the setting where the marginal distributions are known and partial information, in the form of known prices for multi-asset options, is also available in the market. We provide a fundamental theorem of asset pricing in this setting, as well as a superhedging duality that allows to transform the maximization problem over probability measures in a more tractable minimization problem over trading strategies. The latter is solved using a penalization approach combined with a deep learning approximation using artificial neural networks. The numerical method is fast and the computational time scales linearly with respect to the number of traded assets. We finally examine the significance of various pieces of additional information. Empirical evidence suggests that "relevant" information, i.e. prices of derivatives with the same payoff structure as the target payoff, are more useful that other information, and should be prioritized in view of the trade-off between accuracy and computational efficiency.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
Financial Time-Series Forecasting: Towards Synergizing Performance And Interpretability Within a Hybrid Machine Learning Approach
Authors:
Shun Liu,
Kexin Wu,
Chufeng Jiang,
Bin Huang,
Danqing Ma
Abstract:
In the realm of cryptocurrency, the prediction of Bitcoin prices has garnered substantial attention due to its potential impact on financial markets and investment strategies. This paper propose a comparative study on hybrid machine learning algorithms and leverage on enhancing model interpretability. Specifically, linear regression(OLS, LASSO), long-short term memory(LSTM), decision tree regresso…
▽ More
In the realm of cryptocurrency, the prediction of Bitcoin prices has garnered substantial attention due to its potential impact on financial markets and investment strategies. This paper propose a comparative study on hybrid machine learning algorithms and leverage on enhancing model interpretability. Specifically, linear regression(OLS, LASSO), long-short term memory(LSTM), decision tree regressors are introduced. Through the grounded experiments, we observe linear regressor achieves the best performance among candidate models. For the interpretability, we carry out a systematic overview on the preprocessing techniques of time-series statistics, including decomposition, auto-correlational function, exponential triple forecasting, which aim to excavate latent relations and complex patterns appeared in the financial time-series forecasting. We believe this work may derive more attention and inspire more researches in the realm of time-series analysis and its realistic applications.
△ Less
Submitted 31 December, 2023;
originally announced January 2024.
-
Optimal Market Making in the Chinese Stock Market: A Stochastic Control and Scenario Analysis
Authors:
Shiqi Gong,
Shuaiqiang Liu,
Danny D. Sun
Abstract:
Market making plays a crucial role in providing liquidity and maintaining stability in financial markets, making it an essential component of well-functioning capital markets. Despite its importance, there is limited research on market making in the Chinese stock market, which is one of the largest and most rapidly growing markets globally. To address this gap, we employ an optimal market making f…
▽ More
Market making plays a crucial role in providing liquidity and maintaining stability in financial markets, making it an essential component of well-functioning capital markets. Despite its importance, there is limited research on market making in the Chinese stock market, which is one of the largest and most rapidly growing markets globally. To address this gap, we employ an optimal market making framework with an exponential CARA-type (Constant Absolute Risk Aversion) utility function that accounts for various market conditions, such as price drift, volatility, and stamp duty, and is capable of describing 3 major risks (i.e., inventory, execution and adverse selection risks) in market making practice, and provide an in-depth quantitative and scenario analysis of market making in the Chinese stock market. Our numerical experiments explore the impact of volatility on the market maker's inventory. Furthermore, we find that the stamp duty rate is a critical factor in market making, with a negative impact on both the profit of the market maker and the liquidity of the market. Additionally, our analysis emphasizes the significance of accurately estimating stock drift for managing inventory and adverse selection risks effectively and enhancing profit for the market maker. These findings offer valuable insights for both market makers and policymakers in the Chinese stock market and provide directions for further research in designing effective market making strategies and policies.
△ Less
Submitted 5 June, 2023;
originally announced June 2023.
-
GPU acceleration of the Seven-League Scheme for large time step simulations of stochastic differential equations
Authors:
Shuaiqiang Liu,
Graziana Colonna,
Lech A. Grzelak,
Cornelis W. Oosterlee
Abstract:
Monte Carlo simulation is widely used to numerically solve stochastic differential equations. Although the method is flexible and easy to implement, it may be slow to converge. Moreover, an inaccurate solution will result when using large time steps. The Seven League scheme, a deep learning-based numerical method, has been proposed to address these issues. This paper generalizes the scheme regardi…
▽ More
Monte Carlo simulation is widely used to numerically solve stochastic differential equations. Although the method is flexible and easy to implement, it may be slow to converge. Moreover, an inaccurate solution will result when using large time steps. The Seven League scheme, a deep learning-based numerical method, has been proposed to address these issues. This paper generalizes the scheme regarding parallel computing, particularly on Graphics Processing Units (GPUs), improving the computational speed.
△ Less
Submitted 10 February, 2023;
originally announced February 2023.
-
A Novel Deep Reinforcement Learning Based Automated Stock Trading System Using Cascaded LSTM Networks
Authors:
Jie Zou,
Jiashu Lou,
Baohua Wang,
Sixue Liu
Abstract:
More and more stock trading strategies are constructed using deep reinforcement learning (DRL) algorithms, but DRL methods originally widely used in the gaming community are not directly adaptable to financial data with low signal-to-noise ratios and unevenness, and thus suffer from performance shortcomings. In this paper, to capture the hidden information, we propose a DRL based stock trading sys…
▽ More
More and more stock trading strategies are constructed using deep reinforcement learning (DRL) algorithms, but DRL methods originally widely used in the gaming community are not directly adaptable to financial data with low signal-to-noise ratios and unevenness, and thus suffer from performance shortcomings. In this paper, to capture the hidden information, we propose a DRL based stock trading system using cascaded LSTM, which first uses LSTM to extract the time-series features from stock daily data, and then the features extracted are fed to the agent for training, while the strategy functions in reinforcement learning also use another LSTM for training. Experiments in DJI in the US market and SSE50 in the Chinese stock market show that our model outperforms previous baseline models in terms of cumulative returns and Sharp ratio, and this advantage is more significant in the Chinese stock market, a merging market. It indicates that our proposed method is a promising way to build a automated stock trading system.
△ Less
Submitted 26 July, 2023; v1 submitted 5 December, 2022;
originally announced December 2022.
-
Incorporating Interactive Facts for Stock Selection via Neural Recursive ODEs
Authors:
Qiang Gao,
Xinzhu Zhou,
Kunpeng Zhang,
Li Huang,
Siyuan Liu,
Fan Zhou
Abstract:
Stock selection attempts to rank a list of stocks for optimizing investment decision making, aiming at minimizing investment risks while maximizing profit returns. Recently, researchers have developed various (recurrent) neural network-based methods to tackle this problem. Without exceptions, they primarily leverage historical market volatility to enhance the selection performance. However, these…
▽ More
Stock selection attempts to rank a list of stocks for optimizing investment decision making, aiming at minimizing investment risks while maximizing profit returns. Recently, researchers have developed various (recurrent) neural network-based methods to tackle this problem. Without exceptions, they primarily leverage historical market volatility to enhance the selection performance. However, these approaches greatly rely on discrete sampled market observations, which either fail to consider the uncertainty of stock fluctuations or predict continuous stock dynamics in the future. Besides, some studies have considered the explicit stock interdependence derived from multiple domains (e.g., industry and shareholder). Nevertheless, the implicit cross-dependencies among different domains are under-explored. To address such limitations, we present a novel stock selection solution -- StockODE, a latent variable model with Gaussian prior. Specifically, we devise a Movement Trend Correlation module to expose the time-varying relationships regarding stock movements. We design Neural Recursive Ordinary Differential Equation Networks (NRODEs) to capture the temporal evolution of stock volatility in a continuous dynamic manner. Moreover, we build a hierarchical hypergraph to incorporate the domain-aware dependencies among the stocks. Experiments conducted on two real-world stock market datasets demonstrate that StockODE significantly outperforms several baselines, such as up to 18.57% average improvement regarding Sharpe Ratio.
△ Less
Submitted 28 October, 2022;
originally announced October 2022.
-
Learn Continuously, Act Discretely: Hybrid Action-Space Reinforcement Learning For Optimal Execution
Authors:
Feiyang Pan,
Tongzhe Zhang,
Ling Luo,
Jia He,
Shuoling Liu
Abstract:
Optimal execution is a sequential decision-making problem for cost-saving in algorithmic trading. Studies have found that reinforcement learning (RL) can help decide the order-splitting sizes. However, a problem remains unsolved: how to place limit orders at appropriate limit prices? The key challenge lies in the "continuous-discrete duality" of the action space. On the one hand, the continuous ac…
▽ More
Optimal execution is a sequential decision-making problem for cost-saving in algorithmic trading. Studies have found that reinforcement learning (RL) can help decide the order-splitting sizes. However, a problem remains unsolved: how to place limit orders at appropriate limit prices? The key challenge lies in the "continuous-discrete duality" of the action space. On the one hand, the continuous action space using percentage changes in prices is preferred for generalization. On the other hand, the trader eventually needs to choose limit prices discretely due to the existence of the tick size, which requires specialization for every single stock with different characteristics (e.g., the liquidity and the price range). So we need continuous control for generalization and discrete control for specialization. To this end, we propose a hybrid RL method to combine the advantages of both of them. We first use a continuous control agent to scope an action subset, then deploy a fine-grained agent to choose a specific limit price. Extensive experiments show that our method has higher sample efficiency and better training stability than existing RL algorithms and significantly outperforms previous learning-based methods for order execution.
△ Less
Submitted 22 July, 2022;
originally announced July 2022.
-
A Word is Worth A Thousand Dollars: Adversarial Attack on Tweets Fools Stock Predictions
Authors:
Yong Xie,
Dakuo Wang,
Pin-Yu Chen,
Jinjun Xiong,
Sijia Liu,
Sanmi Koyejo
Abstract:
More and more investors and machine learning models rely on social media (e.g., Twitter and Reddit) to gather real-time information and sentiment to predict stock price movements. Although text-based models are known to be vulnerable to adversarial attacks, whether stock prediction models have similar vulnerability is underexplored. In this paper, we experiment with a variety of adversarial attack…
▽ More
More and more investors and machine learning models rely on social media (e.g., Twitter and Reddit) to gather real-time information and sentiment to predict stock price movements. Although text-based models are known to be vulnerable to adversarial attacks, whether stock prediction models have similar vulnerability is underexplored. In this paper, we experiment with a variety of adversarial attack configurations to fool three stock prediction victim models. We address the task of adversarial generation by solving combinatorial optimization problems with semantics and budget constraints. Our results show that the proposed attack method can achieve consistent success rates and cause significant monetary loss in trading simulation by simply concatenating a perturbed but semantically similar tweet.
△ Less
Submitted 12 July, 2022; v1 submitted 1 May, 2022;
originally announced May 2022.
-
Solution of integrals with fractional Brownian motion for different Hurst indices
Authors:
Fei Gao,
Shuaiqiang Liu,
Cornelis W. Oosterlee,
Nico M. Temme
Abstract:
In this paper, we will evaluate integrals that define the conditional expectation, variance and characteristic function of stochastic processes with respect to fractional Brownian motion (fBm) for all relevant Hurst indices, i.e. $H \in (0,1)$. The fractional Ornstein-Uhlenbeck (fOU) process, for example, gives rise to highly nontrivial integration formulas that need careful analysis when consider…
▽ More
In this paper, we will evaluate integrals that define the conditional expectation, variance and characteristic function of stochastic processes with respect to fractional Brownian motion (fBm) for all relevant Hurst indices, i.e. $H \in (0,1)$. The fractional Ornstein-Uhlenbeck (fOU) process, for example, gives rise to highly nontrivial integration formulas that need careful analysis when considering the whole range of Hurst indices. We will show that the classical technique of analytic continuation, from complex analysis, provides a way of extending the domain of validity of an integral, from $H\in(1/2,1)$, to the larger domain, $H\in(0,1)$. Numerical experiments for different Hurst indices confirm the robustness and efficiency of the integral formulations presented here. Moreover, we provide accurate and highly efficient financial option pricing results for processes that are related to the fOU process, with the help of Fourier cosine expansions.
△ Less
Submitted 11 March, 2022; v1 submitted 4 March, 2022;
originally announced March 2022.
-
Superhedging duality for multi-action options under model uncertainty with information delay
Authors:
Anna Aksamit,
Ivan Guo,
Shidan Liu,
Zhou Zhou
Abstract:
We consider the superhedging price of an exotic option under nondominated model uncertainty in discrete time in which the option buyer chooses some action from an (uncountable) action space at each time step. By introducing an enlarged space we reformulate the superhedging problem for such an exotic option as a problem for a European option, which enables us to prove the pricing-hedging duality. N…
▽ More
We consider the superhedging price of an exotic option under nondominated model uncertainty in discrete time in which the option buyer chooses some action from an (uncountable) action space at each time step. By introducing an enlarged space we reformulate the superhedging problem for such an exotic option as a problem for a European option, which enables us to prove the pricing-hedging duality. Next, we present a duality result that, when the option buyers action is observed by the seller up to $l$ periods later, the superhedging price equals the model-based price where the option buyer has the power to look into the future for $l$ periods.
△ Less
Submitted 1 November, 2023; v1 submitted 29 November, 2021;
originally announced November 2021.
-
Monte Carlo Simulation of SDEs using GANs
Authors:
Jorino van Rhijn,
Cornelis W. Oosterlee,
Lech A. Grzelak,
Shuaiqiang Liu
Abstract:
Generative adversarial networks (GANs) have shown promising results when applied on partial differential equations and financial time series generation. We investigate if GANs can also be used to approximate one-dimensional Ito stochastic differential equations (SDEs). We propose a scheme that approximates the path-wise conditional distribution of SDEs for large time steps. Standard GANs are only…
▽ More
Generative adversarial networks (GANs) have shown promising results when applied on partial differential equations and financial time series generation. We investigate if GANs can also be used to approximate one-dimensional Ito stochastic differential equations (SDEs). We propose a scheme that approximates the path-wise conditional distribution of SDEs for large time steps. Standard GANs are only able to approximate processes in distribution, yielding a weak approximation to the SDE. A conditional GAN architecture is proposed that enables strong approximation. We inform the discriminator of this GAN with the map between the prior input to the generator and the corresponding output samples, i.e. we introduce a `supervised GAN'. We compare the input-output map obtained with the standard GAN and supervised GAN and show experimentally that the standard GAN may fail to provide a path-wise approximation. The GAN is trained on a dataset obtained with exact simulation. The architecture was tested on geometric Brownian motion (GBM) and the Cox-Ingersoll-Ross (CIR) process. The supervised GAN outperformed the Euler and Milstein schemes in strong error on a discretisation with large time steps. It also outperformed the standard conditional GAN when approximating the conditional distribution. We also demonstrate how standard GANs may give rise to non-parsimonious input-output maps that are sensitive to perturbations, which motivates the need for constraints and regularisation on GAN generators.
△ Less
Submitted 3 April, 2021;
originally announced April 2021.
-
Knowledge Discovery in Cryptocurrency Transactions: A Survey
Authors:
Xiao Fan Liu,
Xin-Jian Jiang,
Si-Hao Liu,
Chi Kong Tse
Abstract:
Cryptocurrencies gain trust in users by publicly disclosing the full creation and transaction history. In return, the transaction history faithfully records the whole spectrum of cryptocurrency user behaviors. This article analyzes and summarizes the existing research on knowledge discovery in the cryptocurrency transactions using data mining techniques. Specifically, we classify the existing rese…
▽ More
Cryptocurrencies gain trust in users by publicly disclosing the full creation and transaction history. In return, the transaction history faithfully records the whole spectrum of cryptocurrency user behaviors. This article analyzes and summarizes the existing research on knowledge discovery in the cryptocurrency transactions using data mining techniques. Specifically, we classify the existing research into three aspects, i.e., transaction tracings and blockchain address linking, the analyses of collective user behaviors, and the study of individual user behaviors. For each aspect, we present the problems, summarize the methodologies, and discuss major findings in the literature. Furthermore, an enumeration of transaction data parsing and visualization tools and services is also provided. Finally, we outline several future directions in this research area, such as the current rapid development of Decentralized Finance (De-Fi) and digital fiat money.
△ Less
Submitted 2 October, 2020;
originally announced October 2020.
-
The Seven-League Scheme: Deep learning for large time step Monte Carlo simulations of stochastic differential equations
Authors:
Shuaiqiang Liu,
Lech A. Grzelak,
Cornelis W. Oosterlee
Abstract:
We propose an accurate data-driven numerical scheme to solve Stochastic Differential Equations (SDEs), by taking large time steps. The SDE discretization is built up by means of a polynomial chaos expansion method, on the basis of accurately determined stochastic collocation (SC) points. By employing an artificial neural network to learn these SC points, we can perform Monte Carlo simulations with…
▽ More
We propose an accurate data-driven numerical scheme to solve Stochastic Differential Equations (SDEs), by taking large time steps. The SDE discretization is built up by means of a polynomial chaos expansion method, on the basis of accurately determined stochastic collocation (SC) points. By employing an artificial neural network to learn these SC points, we can perform Monte Carlo simulations with large time steps. Error analysis confirms that this data-driven scheme results in accurate SDE solutions in the sense of strong convergence, provided the learning methodology is robust and accurate. With a method variant called the compression-decompression collocation and interpolation technique, we can drastically reduce the number of neural network functions that have to be learned, so that computational speed is enhanced. Numerical experiments confirm a high-quality strong convergence error when using large time steps, and the novel scheme outperforms some classical numerical SDE discretizations. Some applications, here in financial option valuation, are also presented.
△ Less
Submitted 23 September, 2021; v1 submitted 7 September, 2020;
originally announced September 2020.
-
On Calibration Neural Networks for extracting implied information from American options
Authors:
Shuaiqiang Liu,
Álvaro Leitao,
Anastasia Borovykh,
Cornelis W. Oosterlee
Abstract:
Extracting implied information, like volatility and/or dividend, from observed option prices is a challenging task when dealing with American options, because of the computational costs needed to solve the corresponding mathematical problem many thousands of times. We will employ a data-driven machine learning approach to estimate the Black-Scholes implied volatility and the dividend yield for Ame…
▽ More
Extracting implied information, like volatility and/or dividend, from observed option prices is a challenging task when dealing with American options, because of the computational costs needed to solve the corresponding mathematical problem many thousands of times. We will employ a data-driven machine learning approach to estimate the Black-Scholes implied volatility and the dividend yield for American options in a fast and robust way. To determine the implied volatility, the inverse function is approximated by an artificial neural network on the computational domain of interest, which decouples the offline (training) and online (prediction) phases and thus eliminates the need for an iterative process. For the implied dividend yield, we formulate the inverse problem as a calibration problem and determine simultaneously the implied volatility and dividend yield. For this, a generic and robust calibration framework, the Calibration Neural Network (CaNN), is introduced to estimate multiple parameters. It is shown that machine learning can be used as an efficient numerical technique to extract implied information from American options.
△ Less
Submitted 31 January, 2020;
originally announced January 2020.
-
Do Chinese Internet Users Exist Heterogeneity in Search Behavior?
Authors:
Ren-jie Han,
Shi-yuan Liu,
Qian Li
Abstract:
Investor attention is an important concept in behavioral finance. Many articles have conducted cross-disciplinary research leading by this concept. In this paper, we use data extraction technology to collect a large number of Baidu Index keyword search volume data. After analyzing the data, we draw a conclusion that has not been paid attention to in all the past research. We find heterogeneity in…
▽ More
Investor attention is an important concept in behavioral finance. Many articles have conducted cross-disciplinary research leading by this concept. In this paper, we use data extraction technology to collect a large number of Baidu Index keyword search volume data. After analyzing the data, we draw a conclusion that has not been paid attention to in all the past research. We find heterogeneity in searching by internet users in China. Firstly, in terms of search behavior, internet users are more inclined to use the PC end to obtain information when facing areas which need to be taken seriously by them. Secondly, attention is heterogeneous while searching. When Internet users search for information in mobile end, their attention is divergent, and search for seemingly unrelated keywords at the same time which limits their attention to information.
△ Less
Submitted 2 November, 2019;
originally announced November 2019.
-
A neural network-based framework for financial model calibration
Authors:
Shuaiqiang Liu,
Anastasia Borovykh,
Lech A. Grzelak,
Cornelis W. Oosterlee
Abstract:
A data-driven approach called CaNN (Calibration Neural Network) is proposed to calibrate financial asset price models using an Artificial Neural Network (ANN). Determining optimal values of the model parameters is formulated as training hidden neurons within a machine learning framework, based on available financial option prices. The framework consists of two parts: a forward pass in which we tra…
▽ More
A data-driven approach called CaNN (Calibration Neural Network) is proposed to calibrate financial asset price models using an Artificial Neural Network (ANN). Determining optimal values of the model parameters is formulated as training hidden neurons within a machine learning framework, based on available financial option prices. The framework consists of two parts: a forward pass in which we train the weights of the ANN off-line, valuing options under many different asset model parameter settings; and a backward pass, in which we evaluate the trained ANN-solver on-line, aiming to find the weights of the neurons in the input layer. The rapid on-line learning of implied volatility by ANNs, in combination with the use of an adapted parallel global optimization method, tackles the computation bottleneck and provides a fast and reliable technique for calibrating model parameters while avoiding, as much as possible, getting stuck in local minima. Numerical experiments confirm that this machine-learning framework can be employed to calibrate parameters of high-dimensional stochastic volatility models efficiently and accurately.
△ Less
Submitted 23 April, 2019;
originally announced April 2019.
-
Pricing options and computing implied volatilities using neural networks
Authors:
Shuaiqiang Liu,
Cornelis W. Oosterlee,
Sander M. Bohte
Abstract:
This paper proposes a data-driven approach, by means of an Artificial Neural Network (ANN), to value financial options and to calculate implied volatilities with the aim of accelerating the corresponding numerical methods. With ANNs being universal function approximators, this method trains an optimized ANN on a data set generated by a sophisticated financial model, and runs the trained ANN as an…
▽ More
This paper proposes a data-driven approach, by means of an Artificial Neural Network (ANN), to value financial options and to calculate implied volatilities with the aim of accelerating the corresponding numerical methods. With ANNs being universal function approximators, this method trains an optimized ANN on a data set generated by a sophisticated financial model, and runs the trained ANN as an agent of the original solver in a fast and efficient way. We test this approach on three different types of solvers, including the analytic solution for the Black-Scholes equation, the COS method for the Heston stochastic volatility model and Brent's iterative root-finding method for the calculation of implied volatilities. The numerical results show that the ANN solver can reduce the computing time significantly.
△ Less
Submitted 23 April, 2019; v1 submitted 25 January, 2019;
originally announced January 2019.