-
Measurement of carbon finance level and exploration of its influencing factors
Authors:
Peng Zhang,
Yuwei Zhang,
Nuo Xu
Abstract:
Faced with increasingly severe environmental problems, carbon trading markets and related financial activities aiming at limiting carbon dioxide emissions are booming. Considering the complexity and urgency of carbon market, it is necessary to construct an effective evaluation index system. This paper selected carbon finance index as a composite indicator. Taking Beijing, Shanghai, and Guangdong a…
▽ More
Faced with increasingly severe environmental problems, carbon trading markets and related financial activities aiming at limiting carbon dioxide emissions are booming. Considering the complexity and urgency of carbon market, it is necessary to construct an effective evaluation index system. This paper selected carbon finance index as a composite indicator. Taking Beijing, Shanghai, and Guangdong as examples, we adopted the classic method of multiple criteria decision analysis (MCDA) to analyze the composite indicator. Potential impact factors were screened extensively and calculated through normalization, weighting by coefficient of variation and different aggregation methods. Under the measurement of Shannon-Spearman Measure, the method with the least loss of information was used to obtain the carbon finance index (CFI) of the pilot areas. Through panel model analysis, we found that company size, the number of patents per 10,000 people and the proportion of new energy generation were the factors with significant influence. Based on the research, corresponding suggestions were put forward for different market entities. Hopefully, this research will contribute to the steady development of the national carbon market.
△ Less
Submitted 31 May, 2022;
originally announced June 2022.
-
Generalization error minimization: a new approach to model evaluation and selection with an application to penalized regression
Authors:
Ning Xu,
Jian Hong,
Timothy C. G. Fisher
Abstract:
We study model evaluation and model selection from the perspective of generalization ability (GA): the ability of a model to predict outcomes in new samples from the same population. We believe that GA is one way formally to address concerns about the external validity of a model. The GA of a model estimated on a sample can be measured by its empirical out-of-sample errors, called the generalizati…
▽ More
We study model evaluation and model selection from the perspective of generalization ability (GA): the ability of a model to predict outcomes in new samples from the same population. We believe that GA is one way formally to address concerns about the external validity of a model. The GA of a model estimated on a sample can be measured by its empirical out-of-sample errors, called the generalization errors (GE). We derive upper bounds for the GE, which depend on sample sizes, model complexity and the distribution of the loss function. The upper bounds can be used to evaluate the GA of a model, ex ante. We propose using generalization error minimization (GEM) as a framework for model selection. Using GEM, we are able to unify a big class of penalized regression estimators, including lasso, ridge and bridge, under the same set of assumptions. We establish finite-sample and asymptotic properties (including $\mathcal{L}_2$-consistency) of the GEM estimator for both the $n \geqslant p$ and the $n < p$ cases. We also derive the $\mathcal{L}_2$-distance between the penalized and corresponding unpenalized regression estimates. In practice, GEM can be implemented by validation or cross-validation. We show that the GE bounds can be used for selecting the optimal number of folds in $K$-fold cross-validation. We propose a variant of $R^2$, the $GR^2$, as a measure of GA, which considers both both in-sample and out-of-sample goodness of fit. Simulations are used to demonstrate our key results.
△ Less
Submitted 18 October, 2016;
originally announced October 2016.
-
Finite-sample and asymptotic analysis of generalization ability with an application to penalized regression
Authors:
Ning Xu,
Jian Hong,
Timothy C. G. Fisher
Abstract:
In this paper, we study the performance of extremum estimators from the perspective of generalization ability (GA): the ability of a model to predict outcomes in new samples from the same population. By adapting the classical concentration inequalities, we derive upper bounds on the empirical out-of-sample prediction errors as a function of the in-sample errors, in-sample data size, heaviness in t…
▽ More
In this paper, we study the performance of extremum estimators from the perspective of generalization ability (GA): the ability of a model to predict outcomes in new samples from the same population. By adapting the classical concentration inequalities, we derive upper bounds on the empirical out-of-sample prediction errors as a function of the in-sample errors, in-sample data size, heaviness in the tails of the error distribution, and model complexity. We show that the error bounds may be used for tuning key estimation hyper-parameters, such as the number of folds $K$ in cross-validation. We also show how $K$ affects the bias-variance trade-off for cross-validation. We demonstrate that the $\mathcal{L}_2$-norm difference between penalized and the corresponding un-penalized regression estimates is directly explained by the GA of the estimates and the GA of empirical moment conditions. Lastly, we prove that all penalized regression estimates are $L_2$-consistent for both the $n \geqslant p$ and the $n < p$ cases. Simulations are used to demonstrate key results.
Keywords: generalization ability, upper bound of generalization error, penalized regression, cross-validation, bias-variance trade-off, $\mathcal{L}_2$ difference between penalized and unpenalized regression, lasso, high-dimensional data.
△ Less
Submitted 13 September, 2016; v1 submitted 12 September, 2016;
originally announced September 2016.
-
Model selection consistency from the perspective of generalization ability and VC theory with an application to Lasso
Authors:
Ning Xu,
Jian Hong,
Timothy C. G. Fisher
Abstract:
Model selection is difficult to analyse yet theoretically and empirically important, especially for high-dimensional data analysis. Recently the least absolute shrinkage and selection operator (Lasso) has been applied in the statistical and econometric literature. Consis- tency of Lasso has been established under various conditions, some of which are difficult to verify in practice. In this paper,…
▽ More
Model selection is difficult to analyse yet theoretically and empirically important, especially for high-dimensional data analysis. Recently the least absolute shrinkage and selection operator (Lasso) has been applied in the statistical and econometric literature. Consis- tency of Lasso has been established under various conditions, some of which are difficult to verify in practice. In this paper, we study model selection from the perspective of generalization ability, under the framework of structural risk minimization (SRM) and Vapnik-Chervonenkis (VC) theory. The approach emphasizes the balance between the in-sample and out-of-sample fit, which can be achieved by using cross-validation to select a penalty on model complexity. We show that an exact relationship exists between the generalization ability of a model and model selection consistency. By implementing SRM and the VC inequality, we show that Lasso is L2-consistent for model selection under assumptions similar to those imposed on OLS. Furthermore, we derive a probabilistic bound for the distance between the penalized extremum estimator and the extremum estimator without penalty, which is dominated by overfitting. We also propose a new measurement of overfitting, GR2, based on generalization ability, that converges to zero if model selection is consistent. Using simulations, we demonstrate that the proposed CV-Lasso algorithm performs well in terms of model selection and overfitting control.
△ Less
Submitted 1 June, 2016;
originally announced June 2016.