Skip to main content

Showing 1–6 of 6 results for author: Dowty, J G

Searching in archive cs. Search in all archives.
.
  1. arXiv:1701.08895  [pdf, ps, other

    math.ST cs.IT math.DG math.PR

    Chentsov's theorem for exponential families

    Authors: James G. Dowty

    Abstract: Chentsov's theorem characterizes the Fisher information metric on statistical models as essentially the only Riemannian metric that is invariant under sufficient statistics. This implies that each statistical model is naturally equipped with a geometry, so Chentsov's theorem explains why many statistical properties can be described in geometric terms. However, despite being one of the foundational… ▽ More

    Submitted 22 May, 2017; v1 submitted 30 January, 2017; originally announced January 2017.

    Comments: Minor wording changes

  2. arXiv:1510.00112  [pdf, other

    cs.IT stat.ME stat.ML

    Higher-order asymptotics for the parametric complexity

    Authors: James G. Dowty

    Abstract: The parametric complexity is the key quantity in the minimum description length (MDL) approach to statistical model selection. Rissanen and others have shown that the parametric complexity of a statistical model approaches a simple function of the Fisher information volume of the model as the sample size $n$ goes to infinity. This paper derives higher-order asymptotic expansions for the parametric… ▽ More

    Submitted 29 October, 2015; v1 submitted 1 October, 2015; originally announced October 2015.

    Comments: Version 3: Fixed a minor error in the introduction

  3. arXiv:1408.0881  [pdf, other

    math.ST cs.IT stat.ME stat.ML

    Volumes of logistic regression models with applications to model selection

    Authors: James G. Dowty

    Abstract: Logistic regression models with $n$ observations and $q$ linearly-independent covariates are shown to have Fisher information volumes which are bounded below by $π^q$ and above by ${n \choose q} π^q$. This is proved with a novel generalization of the classical theorems of Pythagoras and de Gua, which is of independent interest. The finding that the volume is always finite is new, and it implies th… ▽ More

    Submitted 16 October, 2014; v1 submitted 5 August, 2014; originally announced August 2014.

    Comments: Improved the section on volume jumps and added a new volume bound (Theorem 13) for models with generic design matrices

  4. arXiv:1403.2201  [pdf, ps, other

    cs.IT

    SMML estimators for linear regression and tessellations of hyperbolic space

    Authors: James G. Dowty

    Abstract: The strict minimum message length (SMML) principle links data compression with inductive inference. The corresponding estimators have many useful properties but they can be hard to calculate. We investigate SMML estimators for linear regression models and we show that they have close connections to hyperbolic geometry. When equipped with the Fisher information metric, the linear regression model w… ▽ More

    Submitted 18 March, 2014; v1 submitted 10 March, 2014; originally announced March 2014.

  5. arXiv:1302.0581  [pdf, ps, other

    cs.IT math.ST stat.ML

    SMML estimators for exponential families with continuous sufficient statistics

    Authors: James G. Dowty

    Abstract: The minimum message length principle is an information theoretic criterion that links data compression with statistical inference. This paper studies the strict minimum message length (SMML) estimator for $d$-dimensional exponential families with continuous sufficient statistics, for all $d \ge 1$. The partition of an SMML estimator is shown to consist of convex polytopes (i.e. convex polygons whe… ▽ More

    Submitted 20 March, 2014; v1 submitted 3 February, 2013; originally announced February 2013.

    Comments: Revised to include new insights and results

  6. arXiv:1212.4906  [pdf, ps, other

    cs.IT math.ST stat.ML

    SMML estimators for 1-dimensional continuous data

    Authors: James G. Dowty

    Abstract: A method is given for calculating the strict minimum message length (SMML) estimator for 1-dimensional exponential families with continuous sufficient statistics. A set of $n$ equations are found that the $n$ cut-points of the SMML estimator must satisfy. These equations can be solved using Newton's method and this approach is used to produce new results and to replicate results that C. S. Wallace… ▽ More

    Submitted 19 December, 2012; originally announced December 2012.

    Comments: 10 pages, 2 tables and 1 figure