-
The Lepto-Variance of Stock Returns
Authors:
Vassilis Polimenis
Abstract:
The Regression Tree (RT) sorts the samples using a specific feature and finds the split point that produces the maximum variance reduction from a node to its children. Our key observation is that the best factor to use (in terms of MSE drop) is always the target itself, as this most clearly separates the target. Thus using the target as the splitting factor provides an upper bound on MSE drop (or…
▽ More
The Regression Tree (RT) sorts the samples using a specific feature and finds the split point that produces the maximum variance reduction from a node to its children. Our key observation is that the best factor to use (in terms of MSE drop) is always the target itself, as this most clearly separates the target. Thus using the target as the splitting factor provides an upper bound on MSE drop (or lower bound on the residual children MSE). Based on this observation, we define the k-bit lepto-variance $λk^2$ of a target variable (or equivalently the lepto-variance at a specific depth k) as the variance that cannot be removed by any regression tree of a depth equal to k. As the upper bound performance for any feature, we believe $λk^2$ to be an interesting statistical concept related to the underlying structure of the sample as it quantifies the resolving power of the RT for the sample. The max variance that may be explained using RTs of depth up to k is called the sample k-bit macro-variance. At any depth, total sample variance is thus decomposed into lepto-variance $λ^2$ and macro-variance $μ^2$. We demonstrate the concept, by performing 1- and 2-bit RT based lepto-structure analysis for daily IBM stock returns.
△ Less
Submitted 16 October, 2022; v1 submitted 29 June, 2022;
originally announced July 2022.
-
Uncovering a factor-based expected return conditioning structure with Regression Trees jointly for many stocks
Authors:
Vassilis Polimenis
Abstract:
Given the success and almost universal acceptance of the simple linear regression three-factor model, it is interesting to analyze the informational content of the three factors in explaining stock returns when the analysis is allowed to consider non-linear dependencies between factors and stock returns. In order to better understand factor-based conditioning information with respect to expected s…
▽ More
Given the success and almost universal acceptance of the simple linear regression three-factor model, it is interesting to analyze the informational content of the three factors in explaining stock returns when the analysis is allowed to consider non-linear dependencies between factors and stock returns. In order to better understand factor-based conditioning information with respect to expected stock returns within a regression tree setting, the analysis of stock returns is demonstrated using daily stock return data for 5 major US corporations. The first finding is that in all cases (solo and joint) the most informative factor is always the market excess return factor. Further, three major issues are discussed: a) the balance of a depth=1 tree as it relates to properties of the stock return distribution, b) the mechanism behind depth=1 tree balance in a joint regression tree and c) the dominant stock in a joint regression tree. It is shown that high skew values alone cannot explain the imbalance of the resulting tree split as stocks with pronounced skew may produce balanced tree splits.
△ Less
Submitted 16 July, 2020;
originally announced July 2020.
-
Trading on the Floor after Sweeping the Book
Authors:
Vassilis Polimenis
Abstract:
Informed traders need to trade fast in order to profit from their private information before it becomes public. Fast electronic markets provide such liquidity. Slow markets provide execution in an auction based trading floor. Hybrid markets combine both execution venues. In its main result, the paper shows that to compensate for their slow and risky executions, trading floors need to be at least t…
▽ More
Informed traders need to trade fast in order to profit from their private information before it becomes public. Fast electronic markets provide such liquidity. Slow markets provide execution in an auction based trading floor. Hybrid markets combine both execution venues. In its main result, the paper shows that to compensate for their slow and risky executions, trading floors need to be at least twice as deep as the sweeping facility. Furthermore, when a stand-alone trading floor is enhanced with the addition of a sweeping facility, overall informed trading will decline because it is easier for informed traders to extract the full value of their private info.
△ Less
Submitted 17 January, 2020;
originally announced January 2020.
-
Non-Stationary Dividend-Price Ratios
Authors:
Vassilis Polimenis,
Ioannis Neokosmidis
Abstract:
Dividend yields have been widely used in previous research to relate stock market valuations to cash flow fundamentals. However, this approach relies on the assumption that dividend yields are stationary. Due to the failure to reject the hypothesis of a unit root in the classical dividend-price ratio for the US stock market, Polimenis and Neokosmidis (2016) proposed the use of a modified dividend…
▽ More
Dividend yields have been widely used in previous research to relate stock market valuations to cash flow fundamentals. However, this approach relies on the assumption that dividend yields are stationary. Due to the failure to reject the hypothesis of a unit root in the classical dividend-price ratio for the US stock market, Polimenis and Neokosmidis (2016) proposed the use of a modified dividend price ratio (mdp) as the deviation between d and p from their long run equilibrium, and showed that mdp provides substantially improved forecasting results over the classical dp ratio. Here, we extend that paper by performing multivariate regressions based on the Campbell-Shiller approximation, by utilizing a dynamic econometric procedure to estimate the modified dp, and by testing the modified ratios against reinvested dividend-yields. By comparing the performance of mdp and dp in the period after 1965, we are not only able to enhance the robustness of the findings, but also to debunk a possible false explanation that the enhanced mdp performance in predicting future returns comes from a capacity to predict the risk-free return component. Depending on whether one uses the recursive or population methodology to measure the performance of mdp, the Out-of-Sample performance gain is between 30% to 50%.
△ Less
Submitted 16 February, 2019;
originally announced February 2019.