Search | arXiv e-print repository

Convex Combination of Ordinary Least Squares and Two-stage Least Squares Estimators

Authors: Cedric E. Ginestet, Richard Emsley, Sabine Landau

Abstract: In the presence of confounders, the ordinary least squares (OLS) estimator is known to be biased. This problem can be remedied by using the two-stage least squares (TSLS) estimator, based on the availability of valid instrumental variables (IVs). This reduction in bias, however, is offset by an increase in variance. Under standard assumptions, the OLS has indeed a larger bias than the TSLS estimat… ▽ More In the presence of confounders, the ordinary least squares (OLS) estimator is known to be biased. This problem can be remedied by using the two-stage least squares (TSLS) estimator, based on the availability of valid instrumental variables (IVs). This reduction in bias, however, is offset by an increase in variance. Under standard assumptions, the OLS has indeed a larger bias than the TSLS estimator; and moreover, one can prove that the sample variance of the OLS estimator is no greater than the one of the TSLS. Therefore, it is natural to ask whether one could combine the desirable properties of the OLS and TSLS estimators. Such a trade-off can be achieved through a convex combination of these two estimators, thereby producing our proposed convex least squares (CLS) estimator. The relative contribution of the OLS and TSLS estimators is here chosen to minimize a sample estimate of the mean squared error (MSE) of their convex combination. This proportion parameter is proved to be unique, whenever the OLS and TSLS differ in MSEs. Remarkably, we show that this proportion parameter can be estimated from the data, and that the resulting CLS estimator is consistent. We also show how the CLS framework can incorporate other asymptotically unbiased estimators, such as the jackknife IV estimator (JIVE). The finite-sample properties of the CLS estimator are investigated using Monte Carlo simulations, in which we independently vary the amount of confounding and the strength of the instrument. Overall, the CLS estimator is found to outperform the TSLS estimator in terms of MSE. The method is also applied to a classic data set from econometrics, which models the financial return to education. △ Less

Submitted 13 April, 2015; originally announced April 2015.

Comments: 33 pages. 8 figures, 1 table. To be presented at UK-CIM (Causal Inference Meeting) in Bristol, in April 2015

arXiv:1204.3183 [pdf, other]

Strong Consistency of Frechet Sample Mean Sets for Graph-Valued Random Variables

Authors: Cedric E. Ginestet

Abstract: The Frechet mean or barycenter generalizes the idea of averaging in spaces where pairwise addition is not well-defined. In general metric spaces, the Frechet sample mean is not a consistent estimator of the theoretical Frechet mean. For graph-valued random variables, for instance, the Frechet sample mean may fail to converge to a unique value. Hence, it becomes necessary to consider the convergenc… ▽ More The Frechet mean or barycenter generalizes the idea of averaging in spaces where pairwise addition is not well-defined. In general metric spaces, the Frechet sample mean is not a consistent estimator of the theoretical Frechet mean. For graph-valued random variables, for instance, the Frechet sample mean may fail to converge to a unique value. Hence, it becomes necessary to consider the convergence of sequences of sets of graphs. We show that a specific type of almost sure convergence for the Frechet sample mean previously introduced by Ziezold (1977) is, in fact, equivalent to the Kuratowski outer limit of a sequence of Frechet sample means. Equipped with this outer limit, we provide a new proof of the strong consistency of the Frechet sample mean for graph-valued random variables in separable (pseudo-)metric space. Our proof strategy exploits the fact that the metric of interest is bounded, since we are considering graphs over a finite number of vertices. In this setting, we describe two strong laws of large numbers for both the restricted and unrestricted Frechet sample means of all orders, thereby generalizing a previous result, due to Sverdrup-Thygeson (1981). △ Less

Submitted 15 May, 2013; v1 submitted 14 April, 2012; originally announced April 2012.

Comments: 21 pages, 3 figures

arXiv:1204.2194 [pdf, ps, other]

Weighted Frechet Means as Convex Combinations in Metric Spaces: Properties and Generalized Median Inequalities

Authors: Cedric E. Ginestet, Andrew Simmons, Eric D. Kolaczyk

Abstract: In this short note, we study the properties of the weighted Frechet mean as a convex combination operator on an arbitrary metric space, (Y,d). We show that this binary operator is commutative, non-associative, idempotent, invariant to multiplication by a constant weight and possesses an identity element. We also treat the properties of the weighted cumulative Frechet mean. These tools allow us to… ▽ More In this short note, we study the properties of the weighted Frechet mean as a convex combination operator on an arbitrary metric space, (Y,d). We show that this binary operator is commutative, non-associative, idempotent, invariant to multiplication by a constant weight and possesses an identity element. We also treat the properties of the weighted cumulative Frechet mean. These tools allow us to derive several types of median inequalities for abstract metric spaces that hold for both negative and positive Alexandrov spaces. In particular, we show through an example that these bounds cannot be improved upon in general metric spaces. For weighted Frechet means, however, such inequalities can solely be derived for weights equal or greater than one. This latter limitation highlights the inherent difficulties associated with working with abstract-valued random variables. △ Less

Submitted 12 June, 2012; v1 submitted 10 April, 2012; originally announced April 2012.

Comments: 7 pages, 1 figure. Submitted to Probability and Statistics Letters

arXiv:1105.6322 [pdf, other]

Classification Loss Function for Parameter Ensembles in Bayesian Hierarchical Models

Authors: Cedric E. Ginestet, Nicky G. Best, Sylvia Richardson

Abstract: Parameter ensembles or sets of point estimates constitute one of the cornerstones of modern statistical practice. This is especially the case in Bayesian hierarchical models, where different decision-theoretic frameworks can be deployed to summarize such parameter ensembles. The estimation of these parameter ensembles may thus substantially vary depending on which inferential goals are prioritised… ▽ More Parameter ensembles or sets of point estimates constitute one of the cornerstones of modern statistical practice. This is especially the case in Bayesian hierarchical models, where different decision-theoretic frameworks can be deployed to summarize such parameter ensembles. The estimation of these parameter ensembles may thus substantially vary depending on which inferential goals are prioritised by the modeller. In this note, we consider the problem of classifying the elements of a parameter ensemble above or below a given threshold. Two threshold classification losses (TCLs) --weighted and unweighted-- are formulated. The weighted TCL can be used to emphasize the estimation of false positives over false negatives or the converse. We prove that the weighted and unweighted TCLs are optimized by the ensembles of unit-specific posterior quantiles and posterior medians, respectively. In addition, we relate these classification loss functions on parameter ensembles to the concepts of posterior sensitivity and specificity. Finally, we find some relationships between the unweighted TCL and the absolute value loss, which explain why both functions are minimized by posterior medians. △ Less

Submitted 9 June, 2011; v1 submitted 31 May, 2011; originally announced May 2011.

Comments: Submitted to Probability and Statistics Letters

arXiv:1105.5004 [pdf, other]

Bayesian Decision-theoretic Methods for Parameter Ensembles with Application to Epidemiology

Authors: Cedric E. Ginestet

Abstract: Parameter ensembles or sets of random effects constitute one of the cornerstones of modern statistical practice. This is especially the case in Bayesian hierarchical models, where several decision theoretic frameworks can be deployed. The estimation of these parameter ensembles may substantially vary depending on which inferential goals are prioritised by the modeller. Since one may wish to satisf… ▽ More Parameter ensembles or sets of random effects constitute one of the cornerstones of modern statistical practice. This is especially the case in Bayesian hierarchical models, where several decision theoretic frameworks can be deployed. The estimation of these parameter ensembles may substantially vary depending on which inferential goals are prioritised by the modeller. Since one may wish to satisfy a range of desiderata, it is therefore of interest to investigate whether some sets of point estimates can simultaneously meet several inferential objectives. In this thesis, we will be especially concerned with identifying ensembles of point estimates that produce good approximations of (i) the true empirical quantiles and empirical quartile ratio (QR) and (ii) provide an accurate classification of the ensemble's elements above and below a given threshold. For this purpose, we review various decision-theoretic frameworks, which have been proposed in the literature in relation to the optimisation of different aspects of the empirical distribution of a parameter ensemble. This includes the constrained Bayes (CB), weighted-rank squared error loss (WRSEL), and triple-goal (GR) ensembles of point estimates. In addition, we also consider the set of maximum likelihood estimates (MLEs) and the ensemble of posterior means --the latter being optimal under the summed squared error loss (SSEL). Firstly, we test the performance of these different sets of point estimates as plug-in estimators for the empirical quantiles and empirical QR under a range of synthetic scenarios encompassing both spatial and non-spatial simulated data sets. Performance evaluation is here conducted using the posterior regret. Secondly, two threshold classification losses (TCLs) --weighted and unweighted-- are formulated and formally optimised. The performance of these decision-theoretic tools is also evaluated on real data sets. △ Less

Submitted 18 March, 2014; v1 submitted 25 May, 2011; originally announced May 2011.

Comments: Imperial College London PhD thesis

Showing 1–5 of 5 results for author: Ginestet, C E