Skip to main content

Showing 1–2 of 2 results for author: Reshef, Y A

Searching in archive math. Search in all archives.
.
  1. arXiv:1505.02212  [pdf, other

    math.ST cs.LG q-bio.QM stat.ME stat.ML

    Equitability, interval estimation, and statistical power

    Authors: Yakir A. Reshef, David N. Reshef, Pardis C. Sabeti, Michael M. Mitzenmacher

    Abstract: For analysis of a high-dimensional dataset, a common approach is to test a null hypothesis of statistical independence on all variable pairs using a non-parametric measure of dependence. However, because this approach attempts to identify any non-trivial relationship no matter how weak, it often identifies too many relationships to be useful. What is needed is a way of identifying a smaller set of… ▽ More

    Submitted 12 May, 2015; v1 submitted 8 May, 2015; originally announced May 2015.

    Comments: Yakir A. Reshef and David N. Reshef are co-first authors, Pardis C. Sabeti and Michael M. Mitzenmacher are co-last authors. This paper, together with arXiv:1505.02212, subsumes arXiv:1408.4908

  2. arXiv:1408.4908  [pdf, other

    stat.ME cs.IT math.ST q-bio.QM stat.ML

    Theoretical Foundations of Equitability and the Maximal Information Coefficient

    Authors: Yakir A. Reshef, David N. Reshef, Pardis C. Sabeti, Michael Mitzenmacher

    Abstract: The maximal information coefficient (MIC) is a tool for finding the strongest pairwise relationships in a data set with many variables (Reshef et al., 2011). MIC is useful because it gives similar scores to equally noisy relationships of different types. This property, called {\em equitability}, is important for analyzing high-dimensional data sets. Here we formalize the theory behind both equit… ▽ More

    Submitted 12 May, 2015; v1 submitted 21 August, 2014; originally announced August 2014.

    Comments: 46 pages, 3 figures, 2 tables. This paper has been subsumed by arXiv:1505.02213 and arXiv:1505.02212. Please cite those papers instead