Skip to main content

Showing 1–17 of 17 results for author: Bardenet, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.05906  [pdf, other

    quant-ph cs.LG stat.CO

    Bypassing orthogonalization in the quantum DPP sampler

    Authors: Michaël Fanuel, Rémi Bardenet

    Abstract: Given an $n\times r$ matrix $X$ of rank $r$, consider the problem of sampling $r$ integers $\mathtt{C}\subset \{1, \dots, n\}$ with probability proportional to the squared determinant of the rows of $X$ indexed by $\mathtt{C}$. The distribution of $\mathtt{C}$ is called a projection determinantal point process (DPP). The vanilla classical algorithm to sample a DPP works in two steps, an orthogonal… ▽ More

    Submitted 21 March, 2025; v1 submitted 7 March, 2025; originally announced March 2025.

    Comments: 44 pages, 16 figures. Minor corrections and details about the sketching cost

  2. arXiv:2502.07285  [pdf, other

    stat.ML cs.LG math.PR

    Negative Dependence as a toolbox for machine learning : review and new developments

    Authors: Hoang-Son Tran, Vladimir Petrovic, Remi Bardenet, Subhroshekhar Ghosh

    Abstract: Negative dependence is becoming a key driver in advancing learning capabilities beyond the limits of traditional independence. Recent developments have evidenced support towards negatively dependent systems as a learning paradigm in a broad range of fundamental machine learning challenges including optimization, sampling, dimensionality reduction and sparse signal recovery, often surpassing the pe… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

    Comments: Dedicated to the memory of Prof K.R. Parthasarathy: visionary, guru, and scientist par excellence

  3. arXiv:2411.00611  [pdf, other

    stat.ML cs.LG math.PR

    Small coresets via negative dependence: DPPs, linear statistics, and concentration

    Authors: Rémi Bardenet, Subhroshekhar Ghosh, Hugo Simon-Onfroy, Hoang-Son Tran

    Abstract: Determinantal point processes (DPPs) are random configurations of points with tunable negative dependence. Because sampling is tractable, DPPs are natural candidates for subsampling tasks, such as minibatch selection or coreset construction. A \emph{coreset} is a subset of a (large) training set, such that minimizing an empirical loss averaged over the coreset is a controlled replacement for the i… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

    Comments: Accepted at NeurIPS 2024 (Spotlight Paper). Authors are listed in alphabetical order

  4. arXiv:2404.14803  [pdf, ps, other

    cs.DS math-ph math.PR

    Cycling in the forest with Wilson's algorithm

    Authors: Michaël Fanuel, Rémi Bardenet

    Abstract: We consider a probability measure on cycle-rooted spanning forests (CRSFs) introduced by Kenyon. CRSFs are spanning subgraphs, each connected component of which has a unique cycle; they generalize spanning trees. A generalization of Wilson's celebrated CyclePopping algorithm for uniform spanning trees has been proposed for CRSFs, and several concise proofs have been given that the algorithm sample… ▽ More

    Submitted 9 July, 2025; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: 41 pages, 7 figures. Improved presentation and an extra section about a viewpoint on this algorithm using 'Partial Rejection Sampling'

  5. arXiv:2402.19172  [pdf, other

    eess.SP cs.SD eess.AS math.PR

    Point Processes and spatial statistics in time-frequency analysis

    Authors: Barbara Pascal, Rémi Bardenet

    Abstract: A finite-energy signal is represented by a square-integrable, complex-valued function $t\mapsto s(t)$ of a real variable $t$, interpreted as time. Similarly, a noisy signal is represented by a random process. Time-frequency analysis, a subfield of signal processing, amounts to describing the temporal evolution of the frequency content of a signal. Loosely speaking, if $s$ is the audio recording of… ▽ More

    Submitted 15 April, 2025; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: To be published as a chapter of the book "Stochastic Geometry: Percolation, Tesselations, Gaussian Fields and Point Processes"

  6. arXiv:2402.11736  [pdf, other

    cs.LG math.PR stat.ML

    Monte Carlo with kernel-based Gibbs measures: Guarantees for probabilistic herding

    Authors: Martin Rouault, Rémi Bardenet, Mylène Maïda

    Abstract: Kernel herding belongs to a family of deterministic quadratures that seek to minimize the worst-case integration error over a reproducing kernel Hilbert space (RKHS). In spite of strong experimental support, it has revealed difficult to prove that this worst-case error decreases at a faster rate than the standard square root of the number of quadrature nodes, at least in the usual case where the R… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: 16 pages, 2 figures. Comments are welcome

  7. arXiv:2402.08521  [pdf, other

    eess.SP cs.SD eess.AS

    Benchmarking multi-component signal processing methods in the time-frequency plane

    Authors: Juan M. Miramont, Rémi Bardenet, Pierre Chainais, Francois Auger

    Abstract: Signal processing in the time-frequency plane has a long history and remains a field of methodological innovation. For instance, detection and denoising based on the zeros of the spectrogram have been proposed since 2015, contrasting with a long history of focusing on larger values of the spectrogram. Yet, unlike neighboring fields like optimization and machine learning, time-frequency signal proc… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  8. arXiv:2305.15851  [pdf, other

    stat.CO cs.LG quant-ph

    On sampling determinantal and Pfaffian point processes on a quantum computer

    Authors: Rémi Bardenet, Michaël Fanuel, Alexandre Feller

    Abstract: DPPs were introduced by Macchi as a model in quantum optics the 1970s. Since then, they have been widely used as models and subsampling tools in statistics and computer science. Most applications require sampling from a DPP, and given their quantum origin, it is natural to wonder whether sampling a DPP on a quantum computer is easier than on a classical one. We focus here on DPPs over a finite sta… ▽ More

    Submitted 22 November, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: 53 pages, 9 figures. Additional results about parity of cardinality of PfPP samples. Minor corrections in Section 5 and slight generalization of Lemma 5.4. Extra example and derivations in appendix

  9. arXiv:2208.14797  [pdf, other

    cs.SI cs.LG stat.ML

    Sparsification of the regularized magnetic Laplacian with multi-type spanning forests

    Authors: Michaël Fanuel, Rémi Bardenet

    Abstract: In this paper, we consider a ${\rm U}(1)$-connection graph, that is, a graph where each oriented edge is endowed with a unit modulus complex number that is conjugated under orientation flip. A natural replacement for the combinatorial Laplacian is then the magnetic Laplacian, an Hermitian matrix that includes information about the graph's connection. Magnetic Laplacians appear, e.g., in the proble… ▽ More

    Submitted 20 March, 2024; v1 submitted 31 August, 2022; originally announced August 2022.

    Comments: 51 pages, 15 figures. Improved presentation of the theoretical results and simulations of larger scale

  10. arXiv:2112.06007  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG math.OC math.PR

    Determinantal point processes based on orthogonal polynomials for sampling minibatches in SGD

    Authors: Remi Bardenet, Subhro Ghosh, Meixia Lin

    Abstract: Stochastic gradient descent (SGD) is a cornerstone of machine learning. When the number N of data items is large, SGD relies on constructing an unbiased estimator of the gradient of the empirical risk using a small subset of the original dataset, called a minibatch. Default minibatch construction involves uniformly sampling a subset of the desired size, but alternatives have been explored for vari… ▽ More

    Submitted 11 December, 2021; originally announced December 2021.

    Comments: Accepted at NeurIPS 2021 (Spotlight Paper). Authors are listed in alphabetical order

  11. arXiv:2106.14210  [pdf, other

    cs.LG stat.ML

    Nonparametric estimation of continuous DPPs with kernel methods

    Authors: Michaël Fanuel, Rémi Bardenet

    Abstract: Determinantal Point Process (DPPs) are statistical models for repulsive point patterns. Both sampling and inference are tractable for DPPs, a rare feature among models with negative dependence that explains their popularity in machine learning and spatial statistics. Parametric and nonparametric inference methods have been proposed in the finite case, i.e. when the point patterns live in a finite… ▽ More

    Submitted 27 November, 2021; v1 submitted 27 June, 2021; originally announced June 2021.

    Comments: 26 pages, 7 figures. To appear at NeurIPS 2021

  12. arXiv:2007.04287  [pdf, other

    stat.ML cs.LG

    Learning from DPPs via Sampling: Beyond HKPV and symmetry

    Authors: Rémi Bardenet, Subhroshekhar Ghosh

    Abstract: Determinantal point processes (DPPs) have become a significant tool for recommendation systems, feature selection, or summary extraction, harnessing the intrinsic ability of these probabilistic models to facilitate sample diversity. The ability to sample from DPPs is paramount to the empirical investigation of these models. Most exact samplers are variants of a spectral meta-algorithm due to Hough… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

  13. arXiv:2002.09677  [pdf, other

    stat.ML cs.LG math.NA math.PR

    Kernel interpolation with continuous volume sampling

    Authors: Ayoub Belhadji, Rémi Bardenet, Pierre Chainais

    Abstract: A fundamental task in kernel methods is to pick nodes and weights, so as to approximate a given function from an RKHS by the weighted sum of kernel translates located at the nodes. This is the crux of kernel density estimation, kernel quadrature, or interpolation from discrete samples. Furthermore, RKHSs offer a convenient mathematical and computational framework. We introduce and analyse continuo… ▽ More

    Submitted 22 February, 2020; originally announced February 2020.

  14. arXiv:1906.07832  [pdf, other

    stat.ML cs.LG

    Kernel quadrature with DPPs

    Authors: Ayoub Belhadji, Rémi Bardenet, Pierre Chainais

    Abstract: We study quadrature rules for functions from an RKHS, using nodes sampled from a determinantal point process (DPP). DPPs are parametrized by a kernel, and we use a truncated and saturated version of the RKHS kernel. This link between the two kernels, along with DPP machinery, leads to relatively tight bounds on the quadrature error, that depends on the spectrum of the RKHS kernel. Finally, we expe… ▽ More

    Submitted 31 December, 2019; v1 submitted 18 June, 2019; originally announced June 2019.

    Journal ref: Advances in Neural Information Processing Systems (NeurIPS), Dec 2019, Vancouver, Canada

  15. arXiv:1812.09771  [pdf, other

    stat.ML cs.LG

    A determinantal point process for column subset selection

    Authors: Ayoub Belhadji, Rémi Bardenet, Pierre Chainais

    Abstract: Dimensionality reduction is a first step of many machine learning pipelines. Two popular approaches are principal component analysis, which projects onto a small number of well chosen but non-interpretable directions, and feature selection, which selects a small number of the original features. Feature selection can be abstracted as a numerical linear algebra problem called the column subset selec… ▽ More

    Submitted 23 December, 2018; originally announced December 2018.

  16. arXiv:1809.07258  [pdf, other

    cs.LG stat.ML

    DPPy: Sampling DPPs with Python

    Authors: Guillaume Gautier, Guillermo Polito, Rémi Bardenet, Michal Valko

    Abstract: Determinantal point processes (DPPs) are specific probability distributions over clouds of points that are used as models and computational tools across physics, probability, statistics, and more recently machine learning. Sampling from DPPs is a challenge and therefore we present DPPy, a Python toolbox that gathers known exact and approximate sampling algorithms for both finite and continuous DPP… ▽ More

    Submitted 12 August, 2019; v1 submitted 19 September, 2018; originally announced September 2018.

    Comments: Code at http://github.com/guilgautier/DPPy/ Documentation at http://dppy.readthedocs.io/

    Journal ref: Journal of Machine Learning Research 20 (2019) 1-7

  17. arXiv:1705.10498  [pdf, other

    stat.ML cs.LG stat.CO

    Zonotope hit-and-run for efficient sampling from projection DPPs

    Authors: Guillaume Gautier, Rémi Bardenet, Michal Valko

    Abstract: Determinantal point processes (DPPs) are distributions over sets of items that model diversity using kernels. Their applications in machine learning include summary extraction and recommendation systems. Yet, the cost of sampling from a DPP is prohibitive in large-scale applications, which has triggered an effort towards efficient approximate samplers. We build a novel MCMC sampler that combines i… ▽ More

    Submitted 15 June, 2017; v1 submitted 30 May, 2017; originally announced May 2017.

    Comments: 12 pages, 12 figures, 2 columns, accepted to ICML 2017

    Journal ref: Proceedings of the 34th International Conference on Machine Learning 70 (2017) 1223-1232