Skip to main content

Showing 1–45 of 45 results for author: Nielsen, F

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.10775  [pdf, other

    cs.LG cs.AI stat.ML

    A Rate-Distortion View of Uncertainty Quantification

    Authors: Ifigeneia Apostolopoulou, Benjamin Eysenbach, Frank Nielsen, Artur Dubrawski

    Abstract: In supervised learning, understanding an input's proximity to the training data can help a model decide whether it has sufficient evidence for reaching a reliable prediction. While powerful probabilistic models such as Gaussian Processes naturally have this property, deep neural networks often lack it. In this paper, we introduce Distance Aware Bottleneck (DAB), i.e., a new method for enriching de… ▽ More

    Submitted 18 June, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

    Journal ref: International Conference on Machine Learning, 2024

  2. A bivariate two-state Markov modulated Poisson process for failure modelling

    Authors: Yoel G. Yera, Rosa E. Lillo, Bo F. Nielsen, Pepa Ramírez-Cobo, Fabrizio Ruggeri

    Abstract: Motivated by a real failure dataset in a two-dimensional context, this paper presents an extension of the Markov modulated Poisson process (MMPP) to two dimensions. The one-dimensional MMPP has been proposed for the modeling of dependent and non-exponential inter-failure times (in contexts as queuing, risk or reliability, among others). The novel two-dimensional MMPP allows for dependence between… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Journal ref: Reliability Engineering and System Safety 208(2021) 107318

  3. arXiv:2311.13459  [pdf, other

    cs.LG stat.ML

    The Tempered Hilbert Simplex Distance and Its Application To Non-linear Embeddings of TEMs

    Authors: Ehsan Amid, Frank Nielsen, Richard Nock, Manfred K. Warmuth

    Abstract: Tempered Exponential Measures (TEMs) are a parametric generalization of the exponential family of distributions maximizing the tempered entropy function among positive measures subject to a probability normalization of their power densities. Calculus on TEMs relies on a deformed algebra of arithmetic operators induced by the deformed logarithms used to define the tempered entropy. In this work, we… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  4. arXiv:2307.10644  [pdf, other

    cs.LG stat.ML

    Fisher-Rao distance and pullback SPD cone distances between multivariate normal distributions

    Authors: Frank Nielsen

    Abstract: Data sets of multivariate normal distributions abound in many scientific areas like diffusion tensor imaging, structure tensor computer vision, radar signal processing, machine learning, just to name a few. In order to process those normal data sets for downstream tasks like filtering, classification or clustering, one needs to define proper notions of dissimilarities between normals and paths joi… ▽ More

    Submitted 9 June, 2024; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: 38 pages, 11 figures

    Journal ref: 2nd Annual Topology, Algebra, and Geometry in Machine Learning Workshop, ICML TAG-ML, 2023

  5. arXiv:2303.05910  [pdf, ps, other

    stat.ML cs.LG

    Product Jacobi-Theta Boltzmann machines with score matching

    Authors: Andrea Pasquale, Daniel Krefl, Stefano Carrazza, Frank Nielsen

    Abstract: The estimation of probability density functions is a non trivial task that over the last years has been tackled with machine learning techniques. Successful applications can be obtained using models inspired by the Boltzmann machine (BM) architecture. In this manuscript, the product Jacobi-Theta Boltzmann machine (pJTBM) is introduced as a restricted version of the Riemann-Theta Boltzmann machine… ▽ More

    Submitted 12 January, 2024; v1 submitted 10 March, 2023; originally announced March 2023.

    Comments: 7 pages, 3 figures, ACAT22 proceedings

    Report number: TIF-UNIMI-2023-8

  6. arXiv:2302.09738  [pdf, other

    stat.ML cs.LG

    Simplifying Momentum-based Positive-definite Submanifold Optimization with Applications to Deep Learning

    Authors: Wu Lin, Valentin Duruisseaux, Melvin Leok, Frank Nielsen, Mohammad Emtiyaz Khan, Mark Schmidt

    Abstract: Riemannian submanifold optimization with momentum is computationally challenging because, to ensure that the iterates remain on the submanifold, we often need to solve difficult differential equations. Here, we simplify such difficulties for a class of sparse or structured symmetric positive-definite matrices with the affine-invariant metric. We do so by proposing a generalized version of the Riem… ▽ More

    Submitted 16 March, 2024; v1 submitted 19 February, 2023; originally announced February 2023.

    Comments: A long version of the ICML 2023 paper. Updated the main text to emphasize challenges of using existing Riemannian methods to estimate sparse and structured SPD matrices

  7. arXiv:2209.07481  [pdf, other

    cs.LG cs.IT math.ST stat.ML

    Variational Representations of Annealing Paths: Bregman Information under Monotonic Embedding

    Authors: Rob Brekelmans, Frank Nielsen

    Abstract: Markov Chain Monte Carlo methods for sampling from complex distributions and estimating normalization constants often simulate samples from a sequence of intermediate distributions along an annealing path, which bridges between a tractable initial distribution and a target density of interest. Prior works have constructed annealing paths using quasi-arithmetic means, and interpreted the resulting… ▽ More

    Submitted 6 February, 2024; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: Published in Information Geometry (Info. Geo. 2024)

  8. arXiv:2206.08598  [pdf, other

    cs.LG stat.ML

    On the Influence of Enforcing Model Identifiability on Learning dynamics of Gaussian Mixture Models

    Authors: Pascal Mattia Esser, Frank Nielsen

    Abstract: A common way to learn and analyze statistical models is to consider operations in the model parameter space. But what happens if we optimize in the parameter space and there is no one-to-one mapping between the parameter space and the underlying statistical model space? Such cases frequently occur for hierarchical models which include statistical mixtures or stochastic neural networks, and these m… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

  9. arXiv:2107.10884  [pdf, other

    stat.ML cs.LG

    Structured second-order methods via natural gradient descent

    Authors: Wu Lin, Frank Nielsen, Mohammad Emtiyaz Khan, Mark Schmidt

    Abstract: In this paper, we propose new structured second-order methods and structured adaptive-gradient methods obtained by performing natural-gradient descent on structured parameter spaces. Natural-gradient descent is an attractive approach to design new algorithms in many settings such as gradient-free, adaptive-gradient, and second-order methods. Our structured methods not only enjoy a structural invar… ▽ More

    Submitted 19 February, 2022; v1 submitted 22 July, 2021; originally announced July 2021.

    Comments: Fixed some typos and added a new figure. ICML 2021 workshop paper. A short version of arXiv:2102.07405 with a focus on optimization tasks

  10. arXiv:2107.00745  [pdf, other

    cs.LG cs.AI stat.ML

    q-Paths: Generalizing the Geometric Annealing Path using Power Means

    Authors: Vaden Masrani, Rob Brekelmans, Thang Bui, Frank Nielsen, Aram Galstyan, Greg Ver Steeg, Frank Wood

    Abstract: Many common machine learning methods involve the geometric annealing path, a sequence of intermediate densities between two distributions of interest constructed using the geometric average. While alternatives such as the moment-averaging path have demonstrated performance gains in some settings, their practical applicability remains limited by exponential family endpoint assumptions and a lack of… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: arXiv admin note: text overlap with arXiv:2012.07823

  11. arXiv:2102.07405  [pdf, other

    stat.ML cs.LG

    Tractable structured natural gradient descent using local parameterizations

    Authors: Wu Lin, Frank Nielsen, Mohammad Emtiyaz Khan, Mark Schmidt

    Abstract: Natural-gradient descent (NGD) on structured parameter spaces (e.g., low-rank covariances) is computationally challenging due to difficult Fisher-matrix computations. We address this issue by using \emph{local-parameter coordinates} to obtain a flexible and efficient NGD method that works well for a wide-variety of structured parameterizations. We show four applications where our method (1) genera… ▽ More

    Submitted 17 January, 2022; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: An extended version of the ICML 2021 paper. Note: A workshop (short) paper with a focus on optimization tasks can be found at arXiv:2107.10884

  12. arXiv:2012.15480  [pdf, other

    cs.LG cs.IT stat.ML

    Likelihood Ratio Exponential Families

    Authors: Rob Brekelmans, Frank Nielsen, Alireza Makhzani, Aram Galstyan, Greg Ver Steeg

    Abstract: The exponential family is well known in machine learning and statistical physics as the maximum entropy distribution subject to a set of observed constraints, while the geometric mixture path is common in MCMC methods such as annealed importance sampling. Linking these two ideas, recent work has interpreted the geometric mixture path as an exponential family of distributions to analyze the thermod… ▽ More

    Submitted 15 January, 2021; v1 submitted 31 December, 2020; originally announced December 2020.

    Comments: NeurIPS Workshop on Deep Learning through Information Geometry

  13. arXiv:2007.10677  [pdf, other

    stat.AP physics.soc-ph

    Clustering patterns connecting COVID-19 dynamics and Human mobility using optimal transport

    Authors: Frank Nielsen, Gautier Marti, Sumanta Ray, Saumyadipta Pyne

    Abstract: Social distancing and stay-at-home are among the few measures that are known to be effective in checking the spread of a pandemic such as COVID-19 in a given population. The patterns of dependency between such measures and their effects on disease incidence may vary dynamically and across different populations. We described a new computational framework to measure and compare the temporal relation… ▽ More

    Submitted 21 July, 2020; originally announced July 2020.

    Comments: 16 pages

    Journal ref: Sankhya B (16th March 2021)

  14. arXiv:2002.08345   

    cs.LG stat.ML

    Schoenberg-Rao distances: Entropy-based and geometry-aware statistical Hilbert distances

    Authors: Gaëtan Hadjeres, Frank Nielsen

    Abstract: Distances between probability distributions that take into account the geometry of their sample space,like the Wasserstein or the Maximum Mean Discrepancy (MMD) distances have received a lot of attention in machine learning as they can, for instance, be used to compare probability distributions with disjoint supports. In this paper, we study a class of statistical Hilbert distances that we term th… ▽ More

    Submitted 28 April, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: Most results were already known. The distances described therein do not generalize MMD: it is an MMD with a distance-induced kernel (see [Sejdinovic et al. (2013)]

  15. arXiv:1911.12463  [pdf, other

    cs.LG stat.ML

    Information-Geometric Set Embeddings (IGSE): From Sets to Probability Distributions

    Authors: Ke Sun, Frank Nielsen

    Abstract: This letter introduces an abstract learning problem called the "set embedding": The objective is to map sets into probability distributions so as to lose less information. We relate set union and intersection operations with corresponding interpolations of probability distributions. We also demonstrate a preliminary solution with experimental results on toy set embedding examples.

    Submitted 11 December, 2019; v1 submitted 27 November, 2019; originally announced November 2019.

    Comments: To be presented at Sets & Partitions (NeurIPS 2019 workshop)

  16. arXiv:1905.11027  [pdf, other

    cs.LG stat.ML

    A Geometric Modeling of Occam's Razor in Deep Learning

    Authors: Ke Sun, Frank Nielsen

    Abstract: Why do deep neural networks (DNNs) benefit from very high dimensional parameter spaces? Their huge parameter complexities vs stunning performances in practice is all the more intriguing and not explainable using the standard theory of model selection for regular models. In this work, we propose a geometrically flavored information-theoretic approach to study this phenomenon. With the belief that s… ▽ More

    Submitted 26 March, 2025; v1 submitted 27 May, 2019; originally announced May 2019.

    Comments: This work first appeared under the former title "Lightlike Neuromanifolds, Occam's Razor and Deep Learning"

  17. arXiv:1901.03732  [pdf, ps, other

    math.PR cs.IT cs.LG stat.ML

    The statistical Minkowski distances: Closed-form formula for Gaussian Mixture Models

    Authors: Frank Nielsen

    Abstract: The traditional Minkowski distances are induced by the corresponding Minkowski norms in real-valued vector spaces. In this work, we propose novel statistical symmetric distances based on the Minkowski's inequality for probability densities belonging to Lebesgue spaces. These statistical Minkowski distances admit closed-form formula for Gaussian mixture models when parameterized by integer exponent… ▽ More

    Submitted 17 January, 2019; v1 submitted 9 January, 2019; originally announced January 2019.

    Comments: 14 pages

  18. arXiv:1901.03634  [pdf, other

    cs.LG stat.ML

    Variation Network: Learning High-level Attributes for Controlled Input Manipulation

    Authors: Gaëtan Hadjeres, Frank Nielsen

    Abstract: This paper presents the Variation Network (VarNet), a generative model providing means to manipulate the high-level attributes of a given input. The originality of our approach is that VarNet is not only capable of handling pre-defined attributes but can also learn the relevant attributes of the dataset by itself. These two settings can also be easily considered at the same time, which makes this… ▽ More

    Submitted 16 September, 2019; v1 submitted 11 January, 2019; originally announced January 2019.

    Comments: 15 pages, 7 figures

  19. arXiv:1812.08113  [pdf, other

    cs.LG stat.ML

    On The Chain Rule Optimal Transport Distance

    Authors: Frank Nielsen, Ke Sun

    Abstract: We define a novel class of distances between statistical multivariate distributions by modeling an optimal transport problem on their marginals with respect to a ground distance defined on their conditionals. These new distances are metrics whenever the ground distance between the marginals is a metric, generalize both the Wasserstein distances between discrete measures and a recently introduced m… ▽ More

    Submitted 2 November, 2020; v1 submitted 19 December, 2018; originally announced December 2018.

    Comments: 23 pages, 6 figures

  20. arXiv:1810.10770  [pdf, other

    cs.LG stat.ML

    Geometry and clustering with metrics derived from separable Bregman divergences

    Authors: Erika Gomes-Gonçalves, Henryk Gzyl, Frank Nielsen

    Abstract: Separable Bregman divergences induce Riemannian metric spaces that are isometric to the Euclidean space after monotone embeddings. We investigate fixed rate quantization and its codebook Voronoi diagrams, and report on experimental performances of partition-based, hierarchical, and soft clustering algorithms with respect to these Riemann-Bregman distances.

    Submitted 25 October, 2018; originally announced October 2018.

    Comments: 23 pages

  21. The Bregman chord divergence

    Authors: Frank Nielsen, Richard Nock

    Abstract: Distances are fundamental primitives whose choice significantly impacts the performances of algorithms in machine learning and signal processing. However selecting the most appropriate distance for a given task is an endeavor. Instead of testing one by one the entries of an ever-expanding dictionary of {\em ad hoc} distances, one rather prefers to consider parametric classes of distances that are… ▽ More

    Submitted 22 October, 2018; originally announced October 2018.

    Comments: 10 pages

    Journal ref: GSI 2019: Geometric Science of Information pp 299-308

  22. arXiv:1810.01118  [pdf, other

    cs.LG cs.CV stat.ML

    Sinkhorn AutoEncoders

    Authors: Giorgio Patrini, Rianne van den Berg, Patrick Forré, Marcello Carioni, Samarth Bhargav, Max Welling, Tim Genewein, Frank Nielsen

    Abstract: Optimal transport offers an alternative to maximum likelihood for learning generative autoencoding models. We show that minimizing the p-Wasserstein distance between the generator and the true data distribution is equivalent to the unconstrained min-min optimization of the p-Wasserstein distance between the encoder aggregated posterior and the prior in latent space, plus a reconstruction error. We… ▽ More

    Submitted 15 July, 2019; v1 submitted 2 October, 2018; originally announced October 2018.

    Comments: Accepted for oral presentation at UAI19

  23. arXiv:1808.08271  [pdf, other

    cs.LG cs.IT stat.ML

    An elementary introduction to information geometry

    Authors: Frank Nielsen

    Abstract: In this survey, we describe the fundamental differential-geometric structures of information manifolds, state the fundamental theorem of information geometry, and illustrate some use cases of these information manifolds in information sciences. The exposition is self-contained by concisely introducing the necessary concepts of differential geometry, but proofs are omitted for brevity.

    Submitted 6 September, 2020; v1 submitted 16 August, 2018; originally announced August 2018.

    Comments: 56 pages, 16 figures

    Journal ref: Entropy 2020, 22(10), 1100

  24. arXiv:1806.11311  [pdf, other

    cs.LG cs.CV stat.ML

    Guaranteed Deterministic Bounds on the Total Variation Distance between Univariate Mixtures

    Authors: Frank Nielsen, Ke Sun

    Abstract: The total variation distance is a core statistical distance between probability measures that satisfies the metric axioms, with value always falling in $[0,1]$. This distance plays a fundamental role in machine learning and signal processing: It is a member of the broader class of $f$-divergences, and it is related to the probability of error in Bayesian hypothesis testing. Since the total variati… ▽ More

    Submitted 29 June, 2018; originally announced June 2018.

    Comments: 11 pages, 2 figures

  25. arXiv:1806.08195  [pdf, other

    stat.ML cs.LG

    Probabilistic PARAFAC2

    Authors: Philip J. H. Jørgensen, Søren F. V. Nielsen, Jesper L. Hinrich, Mikkel N. Schmidt, Kristoffer H. Madsen, Morten Mørup

    Abstract: The PARAFAC2 is a multimodal factor analysis model suitable for analyzing multi-way data when one of the modes has incomparable observation units, for example because of differences in signal sampling or batch sizes. A fully probabilistic treatment of the PARAFAC2 is desirable in order to improve robustness to noise and provide a well founded principle for determining the number of factors, but ch… ▽ More

    Submitted 21 June, 2018; originally announced June 2018.

    Comments: 16 pages (incl. 4 pages of supplemental material), 5 figures

  26. arXiv:1806.00149  [pdf, other

    cs.NE cs.LG stat.ML

    q-Neurons: Neuron Activations based on Stochastic Jackson's Derivative Operators

    Authors: Frank Nielsen, Ke Sun

    Abstract: We propose a new generic type of stochastic neurons, called $q$-neurons, that considers activation functions based on Jackson's $q$-derivatives with stochastic parameters $q$. Our generalization of neural network architectures with $q$-neurons is shown to be both scalable and very easy to implement. We demonstrate experimentally consistently improved performances over state-of-the-art standard act… ▽ More

    Submitted 13 June, 2018; v1 submitted 31 May, 2018; originally announced June 2018.

    Comments: 12 pages, 5 figures, 1 table

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems (2020)

  27. arXiv:1803.07225  [pdf, other

    cs.LG stat.ML

    Monte Carlo Information Geometry: The dually flat case

    Authors: Frank Nielsen, Gaëtan Hadjeres

    Abstract: Exponential families and mixture families are parametric probability models that can be geometrically studied as smooth statistical manifolds with respect to any statistical divergence like the Kullback-Leibler (KL) divergence or the Hellinger divergence. When equipping a statistical manifold with the KL divergence, the induced manifold structure is dually flat, and the KL divergence between distr… ▽ More

    Submitted 19 March, 2018; originally announced March 2018.

    Comments: 25 pages

  28. arXiv:1710.04099  [pdf, other

    stat.ML cs.CL cs.LG

    Wembedder: Wikidata entity embedding web service

    Authors: Finn Årup Nielsen

    Abstract: I present a web service for querying an embedding of entities in the Wikidata knowledge graph. The embedding is trained on the Wikidata dump using Gensim's Word2Vec implementation and a simple graph walk. A REST API is implemented. Together with the Wikidata API the web service exposes a multilingual resource for over 600'000 Wikidata items and properties.

    Submitted 11 October, 2017; originally announced October 2017.

    Comments: 3 pages, 2 figures

    ACM Class: I.2.4; H.3.5

  29. arXiv:1709.06404  [pdf, other

    cs.AI cs.LG stat.ML

    Interactive Music Generation with Positional Constraints using Anticipation-RNNs

    Authors: Gaëtan Hadjeres, Frank Nielsen

    Abstract: Recurrent Neural Networks (RNNS) are now widely used on sequence generation tasks due to their ability to learn long-range dependencies and to generate sequences of arbitrary length. However, their left-to-right generation procedure only allows a limited control from a potential user which makes them unsuitable for interactive and creative usages such as interactive music generation. This paper in… ▽ More

    Submitted 19 September, 2017; originally announced September 2017.

    Comments: 9 pages, 7 figures

  30. arXiv:1707.04588  [pdf, other

    cs.LG cs.AI stat.ML

    GLSR-VAE: Geodesic Latent Space Regularization for Variational AutoEncoder Architectures

    Authors: Gaëtan Hadjeres, Frank Nielsen, François Pachet

    Abstract: VAEs (Variational AutoEncoders) have proved to be powerful in the context of density modeling and have been used in a variety of contexts for creative purposes. In many settings, the data we model possesses continuous attributes that we would like to take into account at generation time. We propose in this paper GLSR-VAE, a Geodesic Latent Space Regularization for the Variational AutoEncoder archi… ▽ More

    Submitted 14 July, 2017; originally announced July 2017.

    Comments: 11 pages

  31. arXiv:1612.04555  [pdf, ps, other

    stat.AP stat.ML

    Scalable Group Level Probabilistic Sparse Factor Analysis

    Authors: Jesper L. Hinrich, Søren F. V. Nielsen, Nicolai A. B. Riis, Casper T. Eriksen, Jacob Frøsig, Marco D. F. Kristensen, Mikkel N. Schmidt, Kristoffer H. Madsen, Morten Mørup

    Abstract: Many data-driven approaches exist to extract neural representations of functional magnetic resonance imaging (fMRI) data, but most of them lack a proper probabilistic formulation. We propose a group level scalable probabilistic sparse factor analysis (psFA) allowing spatially sparse maps, component pruning using automatic relevance determination (ARD) and subject specific heteroscedastic spatial n… ▽ More

    Submitted 14 December, 2016; originally announced December 2016.

    Comments: 10 pages plus 5 pages appendix, Submitted to ICASSP 17

  32. arXiv:1610.09659  [pdf, other

    stat.ML

    Exploring and measuring non-linear correlations: Copulas, Lightspeed Transportation and Clustering

    Authors: Gautier Marti, Sebastien Andler, Frank Nielsen, Philippe Donnat

    Abstract: We propose a methodology to explore and measure the pairwise correlations that exist between variables in a dataset. The methodology leverages copulas for encoding dependence between two variables, state-of-the-art optimal transport for providing a relevant geometry to the copulas, and clustering for summarizing the main dependence patterns found between the variables. Some of the clusters centers… ▽ More

    Submitted 30 October, 2016; originally announced October 2016.

  33. arXiv:1606.05850  [pdf, other

    cs.LG cs.IT stat.ML

    Guaranteed bounds on the Kullback-Leibler divergence of univariate mixtures using piecewise log-sum-exp inequalities

    Authors: Frank Nielsen, Ke Sun

    Abstract: Information-theoretic measures such as the entropy, cross-entropy and the Kullback-Leibler divergence between two mixture models is a core primitive in many signal processing tasks. Since the Kullback-Leibler divergence of mixtures provably does not admit a closed-form formula, it is in practice either estimated using costly Monte-Carlo stochastic integration, approximated, or bounded using variou… ▽ More

    Submitted 16 August, 2016; v1 submitted 19 June, 2016; originally announced June 2016.

    Comments: 20 pages, 3 figures

  34. Optimal Transport vs. Fisher-Rao distance between Copulas for Clustering Multivariate Time Series

    Authors: Gautier Marti, Sébastien Andler, Frank Nielsen, Philippe Donnat

    Abstract: We present a methodology for clustering N objects which are described by multivariate time series, i.e. several sequences of real-valued random variables. This clustering methodology leverages copulas which are distributions encoding the dependence structure between several random variables. To take fully into account the dependence information while clustering, we need a distance between copulas.… ▽ More

    Submitted 14 November, 2016; v1 submitted 28 April, 2016; originally announced April 2016.

    Comments: Accepted at IEEE Workshop on Statistical Signal Processing (SSP 2016)

  35. arXiv:1603.07822  [pdf, other

    q-fin.ST stat.ME

    On clustering financial time series: a need for distances between dependent random variables

    Authors: Gautier Marti, Frank Nielsen, Philippe Donnat, Sébastien Andler

    Abstract: The following working document summarizes our work on the clustering of financial time series. It was written for a workshop on information geometry and its application for image and signal processing. This workshop brought several experts in pure and applied mathematics together with applied researchers from medical imaging, radar signal processing and finance. The authors belong to the latter gr… ▽ More

    Submitted 25 March, 2016; originally announced March 2016.

    Comments: Work presented during a workshop on Information Geometry at the International Centre for Mathematical Sciences, Edinburgh, UK

  36. arXiv:1603.04017  [pdf, other

    stat.ML q-fin.ST

    Clustering Financial Time Series: How Long is Enough?

    Authors: Gautier Marti, Sébastien Andler, Frank Nielsen, Philippe Donnat

    Abstract: Researchers have used from 30 days to several years of daily returns as source data for clustering financial time series based on their correlations. This paper sets up a statistical framework to study the validity of such practices. We first show that clustering correlated random variables from their observed values is statistically consistent. Then, we also give a first empirical answer to the m… ▽ More

    Submitted 14 April, 2016; v1 submitted 13 March, 2016; originally announced March 2016.

    Comments: Accepted at IJCAI 2016

  37. arXiv:1602.02450  [pdf, ps, other

    cs.LG stat.ML

    Loss factorization, weakly supervised learning and label noise robustness

    Authors: Giorgio Patrini, Frank Nielsen, Richard Nock, Marcello Carioni

    Abstract: We prove that the empirical risk of most well-known loss functions factors into a linear term aggregating all labels with a term that is label free, and can further be expressed by sums of the loss. This holds true even for non-smooth, non-convex losses and in any RKHS. The first term is a (kernel) mean operator --the focal quantity of this work-- which we characterize as the sufficient statistic… ▽ More

    Submitted 9 February, 2016; v1 submitted 7 February, 2016; originally announced February 2016.

  38. arXiv:1601.00496  [pdf, other

    stat.AP q-bio.NC stat.ML

    Nonparametric Modeling of Dynamic Functional Connectivity in fMRI Data

    Authors: Søren F. V. Nielsen, Kristoffer H. Madsen, Rasmus Røge, Mikkel N. Schmidt, Morten Mørup

    Abstract: Dynamic functional connectivity (FC) has in recent years become a topic of interest in the neuroimaging community. Several models and methods exist for both functional magnetic resonance imaging (fMRI) and electroencephalography (EEG), and the results point towards the conclusion that FC exhibits dynamic changes. The existing approaches modeling dynamic connectivity have primarily been based on ti… ▽ More

    Submitted 8 June, 2016; v1 submitted 4 January, 2016; originally announced January 2016.

    Comments: 8 pages, 1 figure. Presented at the Machine Learning and Interpretation in Neuroimaging Workshop (MLINI-2015), 2015 (arXiv:1605.04435)

    Report number: MLINI/2015/08

  39. arXiv:1509.08144  [pdf, other

    cs.LG stat.ML

    Optimal Copula Transport for Clustering Multivariate Time Series

    Authors: Gautier Marti, Frank Nielsen, Philippe Donnat

    Abstract: This paper presents a new methodology for clustering multivariate time series leveraging optimal transport between copulas. Copulas are used to encode both (i) intra-dependence of a multivariate time series, and (ii) inter-dependence between two time series. Then, optimal copula transport allows us to define two distances between multivariate time series: (i) one for measuring intra-dependence dis… ▽ More

    Submitted 11 January, 2016; v1 submitted 27 September, 2015; originally announced September 2015.

    Comments: Accepted at ICASSP 2016

  40. arXiv:1506.09163  [pdf, other

    cs.CE stat.ME

    Comment partitionner automatiquement des marches aléatoires ? Avec application à la finance quantitative

    Authors: Gautier Marti, Frank Nielsen, Philippe Very, Philippe Donnat

    Abstract: We present in this paper a novel non-parametric approach useful for clustering Markov processes. We introduce a pre-processing step consisting in mapping multivariate independent and identically distributed samples from random variables to a generic non-parametric representation which factorizes dependency and marginal distribution apart without losing any. An associated metric is defined where th… ▽ More

    Submitted 30 June, 2015; originally announced June 2015.

    Comments: in French

  41. arXiv:1406.6314  [pdf, other

    cs.LG cs.CV cs.IR stat.ML

    Further heuristics for $k$-means: The merge-and-split heuristic and the $(k,l)$-means

    Authors: Frank Nielsen, Richard Nock

    Abstract: Finding the optimal $k$-means clustering is NP-hard in general and many heuristics have been designed for minimizing monotonically the $k$-means objective. We first show how to extend Lloyd's batched relocation heuristic and Hartigan's single-point relocation heuristic to take into account empty-cluster and single-point cluster events, respectively. Those events tend to increasingly occur when… ▽ More

    Submitted 22 June, 2014; originally announced June 2014.

    Comments: 14 pages

  42. arXiv:1303.7286  [pdf, other

    cs.IT cs.LG stat.ML

    On the symmetrical Kullback-Leibler Jeffreys centroids

    Authors: Frank Nielsen

    Abstract: Due to the success of the bag-of-word modeling paradigm, clustering histograms has become an important ingredient of modern information processing. Clustering histograms can be performed using the celebrated $k$-means centroid-based algorithm. From the viewpoint of applications, it is usually required to deal with symmetric distances. In this letter, we consider the Jeffreys divergence that symmet… ▽ More

    Submitted 22 January, 2014; v1 submitted 28 March, 2013; originally announced March 2013.

    Comments: 17 pages, 1 figure, source code in R

    Journal ref: IEEE Signal Processing Letters (Volume:20 , Issue: 7 ), pp. 657-660, 2013

  43. arXiv:1206.2742  [pdf, other

    cs.DL cs.AI stat.AP

    Online open neuroimaging mass meta-analysis

    Authors: Finn Årup Nielsen, Matthew J. Kempton, Steven C. R. Williams

    Abstract: We describe a system for meta-analysis where a wiki stores numerical data in a simple format and a web service performs the numerical computation. We initially apply the system on multiple meta-analyses of structural neuroimaging data results. The described system allows for mass meta-analysis, e.g., meta-analysis across multiple brain regions and multiple mental disorders.

    Submitted 13 June, 2012; originally announced June 2012.

    Comments: 5 pages, 4 figures SePublica 2012, ESWC 2012 Workshop, 28 May 2012, Heraklion, Greece

    MSC Class: 68U35 ACM Class: H.5.4; J.3; G.3

  44. arXiv:1206.2054  [pdf, other

    stat.ME

    Maximum A Posteriori Covariance Estimation Using a Power Inverse Wishart Prior

    Authors: Søren Feodor Nielsen, Jon Sporring

    Abstract: The estimation of the covariance matrix is an initial step in many multivariate statistical methods such as principal components analysis and factor analysis, but in many practical applications the dimensionality of the sample space is large compared to the number of samples, and the usual maximum likelihood estimate is poor. Typically, improvements are obtained by modelling or regularization. Fro… ▽ More

    Submitted 10 June, 2012; originally announced June 2012.

    Comments: 29 pages, 8 figures, 2 tables

  45. $k$-MLE: A fast algorithm for learning statistical mixture models

    Authors: Frank Nielsen

    Abstract: We describe $k$-MLE, a fast and efficient local search algorithm for learning finite statistical mixtures of exponential families such as Gaussian mixture models. Mixture models are traditionally learned using the expectation-maximization (EM) soft clustering technique that monotonically increases the incomplete (expected complete) likelihood. Given prescribed mixture weights, the hard clustering… ▽ More

    Submitted 23 March, 2012; originally announced March 2012.

    Comments: 31 pages, Extend preliminary paper presented at IEEE ICASSP 2012