Skip to main content

Showing 1–4 of 4 results for author: Ukkonen, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:1707.07576  [pdf, ps, other

    stat.ML cs.LG

    Interpreting Classifiers through Attribute Interactions in Datasets

    Authors: Andreas Henelius, Kai Puolamäki, Antti Ukkonen

    Abstract: In this work we present the novel ASTRID method for investigating which attribute interactions classifiers exploit when making predictions. Attribute interactions in classification tasks mean that two or more attributes together provide stronger evidence for a particular class label. Knowledge of such interactions makes models more interpretable by revealing associations between attributes. This h… ▽ More

    Submitted 24 July, 2017; originally announced July 2017.

    Comments: presented at 2017 ICML Workshop on Human Interpretability in Machine Learning (WHI 2017), Sydney, NSW, Australia

  2. arXiv:1701.05763  [pdf, other

    stat.AP stat.ML

    Multivariate Confidence Intervals

    Authors: Jussi Korpela, Emilia Oikarinen, Kai Puolamäki, Antti Ukkonen

    Abstract: Confidence intervals are a popular way to visualize and analyze data distributions. Unlike p-values, they can convey information both about statistical significance as well as effect size. However, very little work exists on applying confidence intervals to multivariate data. In this paper we define confidence intervals for multivariate data that extend the one-dimensional definition in a natural… ▽ More

    Submitted 20 January, 2017; originally announced January 2017.

    Comments: A short version of this paper appeared in the 2017 SIAM International Conference on Data Mining, SDM'17. This extended version contains proofs of theorems in the appendix

    MSC Class: 62G15; 62H99; 62M10; ACM Class: G.3

  3. arXiv:1612.07597  [pdf, other

    stat.ML cs.LG

    Finding Statistically Significant Attribute Interactions

    Authors: Andreas Henelius, Antti Ukkonen, Kai Puolamäki

    Abstract: In many data exploration tasks it is meaningful to identify groups of attribute interactions that are specific to a variable of interest. For instance, in a dataset where the attributes are medical markers and the variable of interest (class variable) is binary indicating presence/absence of disease, we would like to know which medical markers interact with respect to the binary class label. These… ▽ More

    Submitted 16 March, 2017; v1 submitted 22 December, 2016; originally announced December 2016.

    Comments: 9 pages, 4 tables, 1 figure

  4. arXiv:1612.00086  [pdf, other

    cs.LG stat.ML

    Semi-supervised Kernel Metric Learning Using Relative Comparisons

    Authors: Ehsan Amid, Aristides Gionis, Antti Ukkonen

    Abstract: We consider the problem of metric learning subject to a set of constraints on relative-distance comparisons between the data items. Such constraints are meant to reflect side-information that is not expressed directly in the feature vectors of the data items. The relative-distance constraints used in this work are particularly effective in expressing structures at finer level of detail than must-l… ▽ More

    Submitted 3 December, 2016; v1 submitted 30 November, 2016; originally announced December 2016.