Skip to main content

Showing 1–26 of 26 results for author: Harremoes, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.17554  [pdf, other

    cs.IT math.PR

    Information Theory for Expectation Measures

    Authors: Peter Harremoës

    Abstract: Shannon based his information theory on the notion of probability measures as it we developed by Kolmogorov. In this paper we study some fundamental problems in information theory based on expectation measures. In the theory of expectation measures it is natural to study data sets where no randomness is present and it is also natural to study information theory for point processes as well as sampl… ▽ More

    Submitted 29 January, 2025; originally announced January 2025.

    Comments: 6 pages 2, figures, conference

    MSC Class: 94A17; 62B10

  2. arXiv:2306.16646  [pdf, ps, other

    cs.IT math.ST

    Reverse Information Projections and Optimal E-statistics

    Authors: Tyron Lardy, Peter Grünwald, Peter Harremoës

    Abstract: Information projections have found important applications in probability theory, statistics, and related areas. In the field of hypothesis testing in particular, the reverse information projection (RIPr) has recently been shown to lead to growth-rate optimal (GRO) e-statistics for testing simple alternatives against composite null hypotheses. However, the RIPr as well as the GRO criterion are unde… ▽ More

    Submitted 30 July, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: A five-page abstract of this paper, containing a subset of the theorems but no proofs, was presented at ISIT 2023, Taipei

    MSC Class: 62B10 (primary); 94A17 (secondary)

  3. arXiv:2202.02668  [pdf, other

    cs.IT math.PR

    Unnormalized Measures in Information Theory

    Authors: Peter Harremoës

    Abstract: Information theory is built on probability measures and by definition a probability measure has total mass 1. Probability measures are used to model uncertainty, and one may ask how important it is that the total mass is one. We claim that the main reason to normalize measures is that probability measures are related to codes via Kraft's inequality. Using a minimum description length approach to s… ▽ More

    Submitted 5 February, 2022; originally announced February 2022.

    Comments: 6 pages, 3 figures

    MSC Class: 94A17

  4. arXiv:2201.03707  [pdf, other

    stat.AP cs.IT

    Rate Distortion Theory for Descriptive Statistics

    Authors: Peter Harremoës

    Abstract: Rate distortion theory was developed for optimizing lossy compression of data, but it also has a lot of applications in statistics. In this paper we will see how rate distortion theory can be used to analyze a complicated data set involving orientations of early Islamic mosques. The analysis involves testing, identification of outliers, choice of compression rate, calculation of optimal reconstruc… ▽ More

    Submitted 16 February, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

    Comments: 6 pages, 4 figures

    MSC Class: 94-10; 94A34

  5. arXiv:2002.03002  [pdf, other

    math.PR cs.IT

    Bounds on the Information Divergence for Hypergeometric Distributions

    Authors: Peter Harremoës, František Matúš

    Abstract: The hypergeometric distributions have many important applications, but they have not had sufficient attention in information theory. Hypergeometric distributions can be approximated by binomial distributions or Poisson distributions. In this paper we present upper and lower bounds on information divergence. These bounds are important for statistical testing and a better understanding of the notion… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

    Comments: 21 pages, 2 figures

    MSC Class: 60E15 94A17

  6. arXiv:1805.02234  [pdf, ps, other

    math.ST cs.IT

    Statistical Inference and Exact Saddle Point Approximations

    Authors: Peter Harremoës

    Abstract: Statistical inference may follow a frequentist approach or it may follow a Bayesian approach or it may use the minimum description length principle (MDL). Our goal is to identify situations in which these different approaches to statistical inference coincide. It is proved that for exponential families MDL and Bayesian inference coincide if and only if the renormalized saddle point approximation f… ▽ More

    Submitted 6 May, 2018; originally announced May 2018.

    Comments: 5 pages

    MSC Class: 62B10;

  7. arXiv:1701.06688  [pdf, ps, other

    cs.IT quant-ph

    Quantum Information on Spectral Sets

    Authors: Peter Harremoës

    Abstract: For convex optimization problems Bregman divergences appear as regret functions. Such regret functions can be defined on any convex set but if a sufficiency condition is added the regret function must be proportional to information divergence and the convex set must be spectral. Spectral set are sets where different orthogonal decompositions of a state into pure states have unique mixing coefficie… ▽ More

    Submitted 10 February, 2017; v1 submitted 23 January, 2017; originally announced January 2017.

    Comments: 13 pages, 2 figures. arXiv admin note: text overlap with arXiv:1701.01010

    MSC Class: 81P16; 94B75

  8. arXiv:1701.01010  [pdf, other

    cs.IT cond-mat.stat-mech math.OC

    Divergence and Sufficiency for Convex Optimization

    Authors: Peter Harremoës

    Abstract: Logarithmic score and information divergence appear in information theory, statistics, statistical mechanics, and portfolio theory. We demonstrate that all these topics involve some kind of optimization that leads directly to regret functions and such regret functions are often given by a Bregman divergence. If the regret function also fulfills a sufficiency condition it must be proportional to in… ▽ More

    Submitted 10 April, 2017; v1 submitted 4 January, 2017; originally announced January 2017.

    Comments: 39 pages, 3 figures

    MSC Class: 94A17

  9. arXiv:1607.02259  [pdf, ps, other

    math-ph cs.IT

    Maximum Entropy and Sufficiency

    Authors: Peter Harremoës

    Abstract: The notion of Bregman divergence and sufficiency will be defined on general convex state spaces. It is demonstrated that only spectral sets can have a Bregman divergence that satisfies a sufficiency condition. Positive elements with trace 1 in a Jordan algebra are examples of spectral sets, and the most important example is the set of density matrices with complex entries. It is conjectured that i… ▽ More

    Submitted 3 September, 2016; v1 submitted 8 July, 2016; originally announced July 2016.

    MSC Class: 81P16; 94A17

  10. arXiv:1601.07593  [pdf, ps, other

    cs.IT q-fin.PM

    Sufficiency on the Stock Market

    Authors: Peter Harremoës

    Abstract: It is well-known that there are a number of relations between theoretical finance theory and information theory. Some of these relations are exact and some are approximate. In this paper we will explore some of these relations and determine under which conditions the relations are exact. It turns out that portfolio theory always leads to Bregman divergences. The Bregman divergence is only proporti… ▽ More

    Submitted 27 January, 2016; originally announced January 2016.

    MSC Class: 91B25

  11. arXiv:1502.04336  [pdf, ps, other

    cs.IT

    Lattices with non-Shannon Inequalities

    Authors: Peter Harremoës

    Abstract: We study the existence or absence of non-Shannon inequalities for variables that are related by functional dependencies. Although the power-set on four variables is the smallest Boolean lattice with non-Shannon inequalities there exist lattices with many more variables without non-Shannon inequalities. We search for conditions that ensures that no non-Shannon inequalities exist. It is demonstrated… ▽ More

    Submitted 15 February, 2015; originally announced February 2015.

    Comments: Ten pages. Submitted to ISIT 2015. The appendix will not appear in the proceedings

  12. arXiv:1402.0092  [pdf, other

    math.ST cs.IT

    Mutual information of Contingency Tables and Related Inequalities

    Authors: Peter Harremoës

    Abstract: For testing independence it is very popular to use either the $χ^{2}$-statistic or $G^{2}$-statistics (mutual information). Asymptotically both are $χ^{2}$-distributed so an obvious question is which of the two statistics that has a distribution that is closest to the $χ^{2}$-distribution. Surprisingly the distribution of mutual information is much better approximated by a $χ^{2}$-distribution tha… ▽ More

    Submitted 1 February, 2014; originally announced February 2014.

    Comments: A version without the appendix has been submitted to a conference

  13. arXiv:1305.4324  [pdf, ps, other

    cs.LG stat.ML

    Horizon-Independent Optimal Prediction with Log-Loss in Exponential Families

    Authors: Peter Bartlett, Peter Grunwald, Peter Harremoes, Fares Hedayati, Wojciech Kotlowski

    Abstract: We study online learning under logarithmic loss with regular parametric models. Hedayati and Bartlett (2012b) showed that a Bayesian prediction strategy with Jeffreys prior and sequential normalized maximum likelihood (SNML) coincide and are optimal if and only if the latter is exchangeable, and if and only if the optimal strategy can be calculated without knowing the time horizon in advance. They… ▽ More

    Submitted 19 May, 2013; originally announced May 2013.

    Comments: 23 pages

  14. arXiv:1301.6465  [pdf, ps, other

    cs.IT math.ST

    Extendable MDL

    Authors: Peter Harremoës

    Abstract: In this paper we show that combination of the minimum description length principle and a exchange-ability condition leads directly to the use of Jeffreys prior. This approach works in most cases even when Jeffreys prior cannot be normalized. Kraft's inequality links codes and distributions but a closer look at this inequality demonstrates that this link only makes sense when sequences are consider… ▽ More

    Submitted 19 May, 2013; v1 submitted 28 January, 2013; originally announced January 2013.

    Comments: 9 pages

    MSC Class: 62B10; 94A15

  15. arXiv:1206.6544  [pdf, ps, other

    cs.IT

    Minimum KL-divergence on complements of $L_1$ balls

    Authors: Daniel Berend, Peter Harremoës, Aryeh Kontorovich

    Abstract: Pinsker's widely used inequality upper-bounds the total variation distance $||P-Q||_1$ in terms of the Kullback-Leibler divergence $D(P||Q)$. Although in general a bound in the reverse direction is impossible, in many applications the quantity of interest is actually $D^*(P,\eps)$ --- defined, for an arbitrary fixed $P$, as the infimum of $D(P||Q)$ over all distributions $Q$ that are $\eps$-far aw… ▽ More

    Submitted 20 February, 2014; v1 submitted 27 June, 2012; originally announced June 2012.

    Comments: A previous version had the title "A Reverse Pinsker Inequality"

    MSC Class: 60F10; 94A15

  16. arXiv:1206.2459  [pdf, other

    cs.IT math.ST stat.ML

    Rényi Divergence and Kullback-Leibler Divergence

    Authors: Tim van Erven, Peter Harremoës

    Abstract: Rényi divergence is related to Rényi entropy much like Kullback-Leibler divergence is related to Shannon's entropy, and comes up in many settings. It was introduced by Rényi as a measure of information that satisfies almost the same axioms as Kullback-Leibler divergence, and depends on a parameter that is called its order. In particular, the Rényi divergence of order 1 equals the Kullback-Leibler… ▽ More

    Submitted 24 April, 2014; v1 submitted 12 June, 2012; originally announced June 2012.

    Comments: To appear in IEEE Transactions on Information Theory

  17. arXiv:1202.1125  [pdf, ps, other

    math.ST cs.IT

    Information Divergence is more chi squared distributed than the chi squared statistics

    Authors: Peter Harremoës, Gábor Tusnády

    Abstract: For testing goodness of fit it is very popular to use either the chi square statistic or G statistics (information divergence). Asymptotically both are chi square distributed so an obvious question is which of the two statistics that has a distribution that is closest to the chi square distribution. Surprisingly, when there is only one degree of freedom it seems like the distribution of informatio… ▽ More

    Submitted 17 June, 2012; v1 submitted 6 February, 2012; originally announced February 2012.

    Comments: 5 pages, accepted for presentation at ISIT 2012

    MSC Class: 62E15

  18. arXiv:1102.2536  [pdf, ps, other

    cs.IT math.PR

    Lower bounds on Information Divergence

    Authors: Peter Harremoës, Christophe Vignat

    Abstract: In this paper we establish lower bounds on information divergence from a distribution to certain important classes of distributions as Gaussian, exponential, Gamma, Poisson, geometric, and binomial. These lower bounds are tight and for several convergence theorems where a rate of convergence can be computed, this rate is determined by the lower bounds proved in this paper. General techniques for g… ▽ More

    Submitted 12 February, 2011; originally announced February 2011.

    Comments: Submitted for the conference ISIT 2011

    MSC Class: 94A15

  19. arXiv:1007.0097  [pdf, ps, other

    cs.IT math.ST

    On Pairs of $f$-divergences and their Joint Range

    Authors: Peter Harremoës, Igor Vajda

    Abstract: We compare two f-divergences and prove that their joint range is the convex hull of the joint range for distributions supported on only two points. Some applications of this result are given.

    Submitted 1 July, 2010; originally announced July 2010.

    Comments: 7 pages, 4 figures

    MSC Class: 94A17; 26Dxx

  20. arXiv:1001.4448  [pdf, ps, other

    cs.IT

    Rényi Divergence and Majorization

    Authors: Tim van Erven, Peter Harremoës

    Abstract: Rényi divergence is related to Rényi entropy much like information divergence (also called Kullback-Leibler divergence or relative entropy) is related to Shannon's entropy, and comes up in many settings. It was introduced by Rényi as a measure of information that satisfies almost the same axioms as information divergence. We review the most important properties of Rényi divergence, including its r… ▽ More

    Submitted 27 May, 2010; v1 submitted 25 January, 2010; originally announced January 2010.

    MSC Class: 94A17

  21. arXiv:1001.4432  [pdf, ps, other

    cs.IT math.ST

    Joint Range of f-divergences

    Authors: Peter Harremoës, Igor Vajda

    Abstract: We provide a general method for evaluation of the joint range of f-divergences for two different functions f. Via topological arguments we prove that the joint range for general distributions equals the convex hull of the joint range achieved by the distributions on a two-element set. The joint range technique provides important inequalities between different f-divergences with various application… ▽ More

    Submitted 27 May, 2010; v1 submitted 25 January, 2010; originally announced January 2010.

    Comments: Accepted for presentation at ISIT 2010

  22. Thinning, Entropy and the Law of Thin Numbers

    Authors: Peter Harremoes, Oliver Johnson, Ioannis Kontoyiannis

    Abstract: Renyi's "thinning" operation on a discrete random variable is a natural discrete analog of the scaling operation for continuous random variables. The properties of thinning are investigated in an information-theoretic context, especially in connection with information-theoretic inequalities related to Poisson approximation results. The classical Binomial-to-Poisson convergence (sometimes referre… ▽ More

    Submitted 3 June, 2009; originally announced June 2009.

    Journal ref: IEEE Transactions on Information Theory, Vol 56/9, 2010, pages 4228-4244

  23. arXiv:0904.2477  [pdf, other

    cs.IT math.PR

    Joint Range of Rényi Entropies

    Authors: Peter Harremoës

    Abstract: The exact range of the joined values of several Rényi entropies is determined. The method is based on topology with special emphasis on the orientation of the objects studied. Like in the case when only two orders of Rényi entropies are studied one can parametrize upper and lower bounds but an explicit formula for a tight upper or lower bound cannot be given.

    Submitted 16 April, 2009; originally announced April 2009.

    MSC Class: 94A17; 62B10

  24. arXiv:0903.5426  [pdf, ps, other

    cs.IT math.ST

    Testing Goodness-of-Fit via Rate Distortion

    Authors: Peter Harremoes

    Abstract: A framework is developed using techniques from rate distortion theory in statistical testing. The idea is first to do optimal compression according to a certain distortion function and then use information divergence from the compressed empirical distribution to the compressed null hypothesis as statistic. Only very special cases have been studied in more detail, but they indicate that the appro… ▽ More

    Submitted 31 March, 2009; originally announced March 2009.

    MSC Class: 94A34; 62G10

  25. arXiv:0903.5399  [pdf, ps, other

    cs.IT

    Regret and Jeffreys Integrals in Exp. Families

    Authors: Peter Grunwald, Peter Harremoes

    Abstract: The problem of whether minimax redundancy, minimax regret and Jeffreys integrals are finite or infinite are discussed.

    Submitted 31 March, 2009; originally announced March 2009.

  26. arXiv:0901.0015  [pdf, other

    cs.IT math.PR

    Maximum Entropy on Compact Groups

    Authors: Peter Harremoes

    Abstract: On a compact group the Haar probability measure plays the role of uniform distribution. The entropy and rate distortion theory for this uniform distribution is studied. New results and simplified proofs on convergence of convolutions on compact groups are presented and they can be formulated as entropy increases to its maximum. Information theoretic techniques and Markov chains play a crucial ro… ▽ More

    Submitted 29 March, 2009; v1 submitted 30 December, 2008; originally announced January 2009.

    Journal ref: Entropy 2009, 11(2), 222-237