-
Leveraging Convolutional Neural Network-Transformer Synergy for Predictive Modeling in Risk-Based Applications
Authors:
Yuhan Wang,
Zhen Xu,
Yue Yao,
Jinsong Liu,
Jiating Lin
Abstract:
With the development of the financial industry, credit default prediction, as an important task in financial risk management, has received increasing attention. Traditional credit default prediction methods mostly rely on machine learning models, such as decision trees and random forests, but these methods have certain limitations in processing complex data and capturing potential risk patterns. T…
▽ More
With the development of the financial industry, credit default prediction, as an important task in financial risk management, has received increasing attention. Traditional credit default prediction methods mostly rely on machine learning models, such as decision trees and random forests, but these methods have certain limitations in processing complex data and capturing potential risk patterns. To this end, this paper proposes a deep learning model based on the combination of convolutional neural networks (CNN) and Transformer for credit user default prediction. The model combines the advantages of CNN in local feature extraction with the ability of Transformer in global dependency modeling, effectively improving the accuracy and robustness of credit default prediction. Through experiments on public credit default datasets, the results show that the CNN+Transformer model outperforms traditional machine learning models, such as random forests and XGBoost, in multiple evaluation indicators such as accuracy, AUC, and KS value, demonstrating its powerful ability in complex financial data modeling. Further experimental analysis shows that appropriate optimizer selection and learning rate adjustment play a vital role in improving model performance. In addition, the ablation experiment of the model verifies the advantages of the combination of CNN and Transformer and proves the complementarity of the two in credit default prediction. This study provides a new idea for credit default prediction and provides strong support for risk assessment and intelligent decision-making in the financial field. Future research can further improve the prediction effect and generalization ability by introducing more unstructured data and improving the model architecture.
△ Less
Submitted 24 December, 2024;
originally announced December 2024.
-
Security Issuance, Institutional Investors and Quid Pro Quo
Authors:
Gaurab Aryal,
Zhaohui Chen,
Yuchi Yao,
Chris Yung
Abstract:
Securities issuance through intermediaries is subject to agency problems and informational frictions. We examine these effects using SPAC data. We identify ``premium'' investors whose participation is linked to lower liquidation risk, higher returns, and lower redemption rates, consistent with both informational rents and agency frictions. In contrast, ``non-premium'' investors engage in non-agenc…
▽ More
Securities issuance through intermediaries is subject to agency problems and informational frictions. We examine these effects using SPAC data. We identify ``premium'' investors whose participation is linked to lower liquidation risk, higher returns, and lower redemption rates, consistent with both informational rents and agency frictions. In contrast, ``non-premium'' investors engage in non-agency quid pro quo relationships. Specifically, they receive high returns from an intermediary (quid) in exchange for a tacit agreement to participate in weaker future deals (quo). These relationships serve as insurance for issuers and intermediaries, enabling more issuers to access markets.
△ Less
Submitted 26 July, 2024; v1 submitted 29 November, 2022;
originally announced November 2022.
-
Combination of window-sliding and prediction range method based on LSTM model for predicting cryptocurrency
Authors:
Yifan Yao,
Lina Wang
Abstract:
The present study aims to establish the model of the cryptocurrency price trend based on financial theory using the LSTM model with multiple combinations between the window length and the predicting horizons, the random walk model is also applied with different parameter settings.
The present study aims to establish the model of the cryptocurrency price trend based on financial theory using the LSTM model with multiple combinations between the window length and the predicting horizons, the random walk model is also applied with different parameter settings.
△ Less
Submitted 8 February, 2021;
originally announced February 2021.
-
Practical Option Valuations of Futures Contracts with Negative Underlying Prices
Authors:
Anatoliy Swishchuk,
Ana Roldan-Contreras,
Elham Soufiani,
Guillermo Martinez,
Mohsen Seifi,
Nishant Agrawal,
Yao Yao
Abstract:
Here we propose two alternatives to Black 76 to value European option future contracts in which the underlying market prices can be negative or mean reverting. The two proposed models are Ornstein-Uhlenbeck (OU) and continuous time GARCH (generalized autoregressive conditionally heteroscedastic). We then analyse the values and compare them with Black 76, the most commonly used model, when the unde…
▽ More
Here we propose two alternatives to Black 76 to value European option future contracts in which the underlying market prices can be negative or mean reverting. The two proposed models are Ornstein-Uhlenbeck (OU) and continuous time GARCH (generalized autoregressive conditionally heteroscedastic). We then analyse the values and compare them with Black 76, the most commonly used model, when the underlying market prices are positive
△ Less
Submitted 25 September, 2020;
originally announced September 2020.
-
Pricing Options Under Rough Volatility with Backward SPDEs
Authors:
Christian Bayer,
Jinniao Qiu,
Yao Yao
Abstract:
In this paper, we study the option pricing problems for rough volatility models. As the framework is non-Markovian, the value function for a European option is not deterministic; rather, it is random and satisfies a backward stochastic partial differential equation (BSPDE). The existence and uniqueness of weak solution is proved for general nonlinear BSPDEs with unbounded random leading coefficien…
▽ More
In this paper, we study the option pricing problems for rough volatility models. As the framework is non-Markovian, the value function for a European option is not deterministic; rather, it is random and satisfies a backward stochastic partial differential equation (BSPDE). The existence and uniqueness of weak solution is proved for general nonlinear BSPDEs with unbounded random leading coefficients whose connections with certain forward-backward stochastic differential equations are derived as well. These BSPDEs are then used to approximate American option prices. A deep leaning-based method is also investigated for the numerical approximations to such BSPDEs and associated non-Markovian pricing problems. Finally, the examples of rough Bergomi type are numerically computed for both European and American options.
△ Less
Submitted 3 August, 2020;
originally announced August 2020.
-
Improving Stock Market Prediction via Heterogeneous Information Fusion
Authors:
Xi Zhang,
Yunjia Zhang,
Senzhang Wang,
Yuntao Yao,
Binxing Fang,
Philip S. Yu
Abstract:
Traditional stock market prediction approaches commonly utilize the historical price-related data of the stocks to forecast their future trends. As the Web information grows, recently some works try to explore financial news to improve the prediction. Effective indicators, e.g., the events related to the stocks and the people's sentiments towards the market and stocks, have been proved to play imp…
▽ More
Traditional stock market prediction approaches commonly utilize the historical price-related data of the stocks to forecast their future trends. As the Web information grows, recently some works try to explore financial news to improve the prediction. Effective indicators, e.g., the events related to the stocks and the people's sentiments towards the market and stocks, have been proved to play important roles in the stocks' volatility, and are extracted to feed into the prediction models for improving the prediction accuracy. However, a major limitation of previous methods is that the indicators are obtained from only a single source whose reliability might be low, or from several data sources but their interactions and correlations among the multi-sourced data are largely ignored.
In this work, we extract the events from Web news and the users' sentiments from social media, and investigate their joint impacts on the stock price movements via a coupled matrix and tensor factorization framework. Specifically, a tensor is firstly constructed to fuse heterogeneous data and capture the intrinsic relations among the events and the investors' sentiments. Due to the sparsity of the tensor, two auxiliary matrices, the stock quantitative feature matrix and the stock correlation matrix, are constructed and incorporated to assist the tensor decomposition. The intuition behind is that stocks that are highly correlated with each other tend to be affected by the same event. Thus, instead of conducting each stock prediction task separately and independently, we predict multiple correlated stocks simultaneously through their commonalities, which are enabled via sharing the collaboratively factorized low rank matrices between matrices and the tensor. Evaluations on the China A-share stock data and the HK stock data in the year 2015 demonstrate the effectiveness of the proposed model.
△ Less
Submitted 2 January, 2018;
originally announced January 2018.