Skip to main content

Showing 1–24 of 24 results for author: Ralaivola, L

.
  1. arXiv:2310.01973  [pdf, other

    cs.LG cs.DC

    Federated Wasserstein Distance

    Authors: Alain Rakotomamonjy, Kimia Nadjahi, Liva Ralaivola

    Abstract: We introduce a principled way of computing the Wasserstein distance between two distributions in a federated manner. Namely, we show how to estimate the Wasserstein distance between two samples stored and kept on different devices/clients whilst a central entity/server orchestrates the computations (again, without having access to the samples). To achieve this feat, we take advantage of the geomet… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 23 pages

  2. arXiv:2301.11447  [pdf, other

    cs.LG cs.AI stat.ML

    Personalised Federated Learning On Heterogeneous Feature Spaces

    Authors: Alain Rakotomamonjy, Maxime Vono, Hamlet Jesse Medina Ruiz, Liva Ralaivola

    Abstract: Most personalised federated learning (FL) approaches assume that raw data of all clients are defined in a common subspace i.e. all clients store their data according to the same schema. For real-world applications, this assumption is restrictive as clients, having their own systems to collect and then store data, may use heterogeneous data representations. We aim at filling this gap. To this end,… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

  3. arXiv:2206.03230  [pdf, other

    stat.ML cs.LG

    Shedding a PAC-Bayesian Light on Adaptive Sliced-Wasserstein Distances

    Authors: Ruben Ohana, Kimia Nadjahi, Alain Rakotomamonjy, Liva Ralaivola

    Abstract: The Sliced-Wasserstein distance (SW) is a computationally efficient and theoretically grounded alternative to the Wasserstein distance. Yet, the literature on its statistical properties -- or, more accurately, its generalization properties -- with respect to the distribution of slices, beyond the uniform measure, is scarce. To bring new contributions to this line of research, we leverage the PAC-B… ▽ More

    Submitted 31 May, 2023; v1 submitted 7 June, 2022; originally announced June 2022.

  4. arXiv:2107.01848  [pdf, other

    cs.LG stat.ML

    Differentially Private Sliced Wasserstein Distance

    Authors: Alain Rakotomamonjy, Liva Ralaivola

    Abstract: Developing machine learning methods that are privacy preserving is today a central topic of research, with huge practical impacts. Among the numerous ways to address privacy-preserving learning, we here take the perspective of computing the divergences between distributions under the Differential Privacy (DP) framework -- being able to compute divergences between distributions is pivotal for many… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Journal ref: International Conference of Machine Learning, Jul 2021, Virtual, France

  5. arXiv:2106.03645  [pdf, other

    cs.LG cs.CR

    Photonic Differential Privacy with Direct Feedback Alignment

    Authors: Ruben Ohana, Hamlet J. Medina Ruiz, Julien Launay, Alessandro Cappelli, Iacopo Poli, Liva Ralaivola, Alain Rakotomamonjy

    Abstract: Optical Processing Units (OPUs) -- low-power photonic chips dedicated to large scale random projections -- have been used in previous work to train deep neural networks using Direct Feedback Alignment (DFA), an effective alternative to backpropagation. Here, we demonstrate how to leverage the intrinsic noise of optical random projections to build a differentially private DFA mechanism, making OPUs… ▽ More

    Submitted 25 March, 2022; v1 submitted 7 June, 2021; originally announced June 2021.

    Journal ref: NeurIPS 2021

  6. arXiv:2007.00935  [pdf, other

    cs.LG quant-ph stat.ML

    Partial Trace Regression and Low-Rank Kraus Decomposition

    Authors: Hachem Kadri, Stéphane Ayache, Riikka Huusari, Alain Rakotomamonjy, Liva Ralaivola

    Abstract: The trace regression model, a direct extension of the well-studied linear regression model, allows one to map matrices to real-valued outputs. We here introduce an even more general model, namely the partial-trace regression model, a family of linear mappings from matrix-valued inputs to matrix-valued outputs; this model subsumes the trace regression model and thus the linear regression model. Bor… ▽ More

    Submitted 25 August, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

    Journal ref: International Conference on Machine Learning, Jul 2020, Vienne (Online), Austria

  7. arXiv:2002.06395  [pdf, ps, other

    cs.LG cs.AI quant-ph stat.ML

    Quantum Bandits

    Authors: Balthazar Casalé, Giuseppe Di Molfetta, Hachem Kadri, Liva Ralaivola

    Abstract: We consider the quantum version of the bandit problem known as {\em best arm identification} (BAI). We first propose a quantum modeling of the BAI problem, which assumes that both the learning agent and the environment are quantum; we then propose an algorithm based on quantum amplitude amplification to solve BAI. We formally analyze the behavior of the algorithm on all instances of the problem an… ▽ More

    Submitted 22 September, 2020; v1 submitted 15 February, 2020; originally announced February 2020.

    Comments: All your comments are very welcome!

    Journal ref: Quantum Machine Intelligence 2, 1-7 (2020)

  8. arXiv:1908.08713  [pdf, other

    cs.LG stat.ML

    QuicK-means: Acceleration of K-means by learning a fast transform

    Authors: Luc Giffon, Valentin Emiya, Liva Ralaivola, Hachem Kadri

    Abstract: K-means -- and the celebrated Lloyd algorithm -- is more than the clustering method it was originally designed to be. It has indeed proven pivotal to help increase the speed of many machine learning and data analysis techniques such as indexing, nearest-neighbor search and prediction, data compression; its beneficial use has been shown to carry over to the acceleration of kernel machines (when usi… ▽ More

    Submitted 23 August, 2019; originally announced August 2019.

  9. arXiv:1905.10443  [pdf, other

    math.OC eess.SP math.CT

    Recovery and convergence rate of the Frank-Wolfe Algorithm for the m-EXACT-SPARSE Problem

    Authors: Farah Cherfaoui, Valentin Emiya, Liva Ralaivola, Sandrine Anthoine

    Abstract: We study the properties of the Frank-Wolfe algorithm to solve the m-EXACT-SPARSE reconstruction problem, where a signal y must be expressed as a sparse linear combination of a predefined set of atoms, called dictionary. We prove that when the signal is sparse enough with respect to the coherence of the dictionary, then the iterative process implemented by the Frank-Wolfe algorithm only recruits at… ▽ More

    Submitted 22 May, 2019; originally announced May 2019.

  10. arXiv:1812.07201  [pdf, ps, other

    cs.LG stat.ML

    Frank-Wolfe Algorithm for the Exact Sparse Problem

    Authors: Farah Cherfaoui, Valentin Emiya, Liva Ralaivola, Sandrine Anthoine

    Abstract: In this paper, we study the properties of the Frank-Wolfe algorithm to solve the \ExactSparse reconstruction problem. We prove that when the dictionary is quasi-incoherent, at each iteration, the Frank-Wolfe algorithm picks up an atom indexed by the support. We also prove that when the dictionary is quasi-incoherent, there exists an iteration beyond which the algorithm converges exponentially fast… ▽ More

    Submitted 18 December, 2018; originally announced December 2018.

    Journal ref: iTWIST: international Traveling Workshop on Interactions between low-complexity data models and Sensing Techniques, Nov 2018, Marseille, France. https://sites.google.com/view/itwist18

  11. arXiv:1811.01404  [pdf, ps, other

    math.PR

    Dependency-dependent Bounds for Sums of Dependent Random Variables

    Authors: Christoph H. Lampert, Liva Ralaivola, Alexander Zimin

    Abstract: We consider the problem of bounding large deviations for non-i.i.d. random variables that are allowed to have arbitrary dependencies. Previous works typically assumed a specific dependence structure, namely the existence of independent components. Bounds that depend on the degree of dependence between the observations have only been studied in the theory of mixing processes, where variables are ti… ▽ More

    Submitted 4 November, 2018; originally announced November 2018.

  12. arXiv:1508.06477  [pdf, other

    cs.LG

    Greedy methods, randomization approaches and multi-arm bandit algorithms for efficient sparsity-constrained optimization

    Authors: A Rakotomamonjy, S Koço, Liva Ralaivola

    Abstract: Several sparsity-constrained algorithms such as Orthogonal Matching Pursuit or the Frank-Wolfe algorithm with sparsity constraints work by iteratively selecting a novel atom to add to the current non-zero set of variables. This selection step is usually performed by computing the gradient and then by looking for the gradient component with maximal absolute entry. This step can be computationally e… ▽ More

    Submitted 22 August, 2016; v1 submitted 26 August, 2015; originally announced August 2015.

  13. arXiv:1508.02986  [pdf, other

    cs.LG

    From Cutting Planes Algorithms to Compression Schemes and Active Learning

    Authors: Liva Ralaivola, Ugo Louche

    Abstract: Cutting-plane methods are well-studied localization(and optimization) algorithms. We show that they provide a natural framework to perform machinelearning ---and not just to solve optimization problems posed by machinelearning--- in addition to their intended optimization use. In particular, theyallow one to learn sparse classifiers and provide good compression schemes.Moreover, we show that very… ▽ More

    Submitted 12 August, 2015; originally announced August 2015.

    Comments: IJCNN 2015, Jul 2015, Killarney, Ireland. 2015, \<http://www.ijcnn.org/\&gt

  14. Unconfused ultraconservative multiclass algorithms

    Authors: Ugo Louche, Liva Ralaivola

    Abstract: We tackle the problem of learning linear classifiers from noisy datasets in a multiclass setting. The two-class version of this problem was studied a few years ago where the proposed approaches to combat the noise revolve around a Per-ceptron learning scheme fed with peculiar examples computed through a weighted average of points from the noisy training set. We propose to build upon these approach… ▽ More

    Submitted 24 June, 2015; originally announced June 2015.

    Report number: MLJ-2015

    Journal ref: Machine Learning, Springer Verlag (Germany), 2015, Machine learning, 99 (2), pp.351

  15. arXiv:1501.03001  [pdf, other

    stat.ML cs.LG

    On Generalizing the C-Bound to the Multiclass and Multi-label Settings

    Authors: Francois Laviolette, Emilie Morvant, Liva Ralaivola, Jean-Francis Roy

    Abstract: The C-bound, introduced in Lacasse et al., gives a tight upper bound on the risk of a binary majority vote classifier. In this work, we present a first step towards extending this work to more complex outputs, by providing generalizations of the C-bound to the multiclass and multi-label settings.

    Submitted 13 January, 2015; originally announced January 2015.

    Comments: NIPS 2014 Workshop on Representation and Learning Methods for Complex Outputs, Dec 2014, Montr{é}al, Canada

  16. Dynamic Screening: Accelerating First-Order Algorithms for the Lasso and Group-Lasso

    Authors: Antoine Bonnefoy, Valentin Emiya, Liva Ralaivola, Rémi Gribonval

    Abstract: Recent computational strategies based on screening tests have been proposed to accelerate algorithms addressing penalized sparse regression problems such as the Lasso. Such approaches build upon the idea that it is worth dedicating some small computational effort to locate inactive atoms and remove them from the dictionary in a preprocessing stage so that the regression algorithm working with a sm… ▽ More

    Submitted 12 December, 2014; originally announced December 2014.

  17. arXiv:1408.1336  [pdf, other

    stat.ML

    On the Generalization of the C-Bound to Structured Output Ensemble Methods

    Authors: François Laviolette, Emilie Morvant, Liva Ralaivola, Jean-Francis Roy

    Abstract: This paper generalizes an important result from the PAC-Bayesian literature for binary classification to the case of ensemble methods for structured outputs. We prove a generic version of the \Cbound, an upper bound over the risk of models expressed as a weighted majority vote that is based on the first and second statistical moments of the vote's margin. This bound may advantageously $(i)$ be app… ▽ More

    Submitted 15 June, 2015; v1 submitted 6 August, 2014; originally announced August 2014.

  18. arXiv:1406.6020  [pdf, ps, other

    cs.LG

    Stationary Mixing Bandits

    Authors: Julien Audiffren, Liva Ralaivola

    Abstract: We study the bandit problem where arms are associated with stationary phi-mixing processes and where rewards are therefore dependent: the question that arises from this setting is that of recovering some independence by ignoring the value of some rewards. As we shall see, the bandit problem we tackle requires us to address the exploration/exploitation/independence trade-off. To do so, we provide a… ▽ More

    Submitted 23 June, 2014; originally announced June 2014.

  19. arXiv:1403.5115  [pdf, other

    cs.LG

    Unconfused Ultraconservative Multiclass Algorithms

    Authors: Ugo Louche, Liva Ralaivola

    Abstract: We tackle the problem of learning linear classifiers from noisy datasets in a multiclass setting. The two-class version of this problem was studied a few years ago by, e.g. Bylander (1994) and Blum et al. (1996): in these contributions, the proposed approaches to fight the noise revolve around a Perceptron learning scheme fed with peculiar examples computed through a weighted average of points fro… ▽ More

    Submitted 20 March, 2014; originally announced March 2014.

    Comments: ACML, Australia (2013)

  20. arXiv:1202.6228  [pdf, other

    stat.ML cs.LG

    PAC-Bayesian Generalization Bound on Confusion Matrix for Multi-Class Classification

    Authors: Emilie Morvant, Sokol Koço, Liva Ralaivola

    Abstract: In this work, we propose a PAC-Bayes bound for the generalization risk of the Gibbs classifier in the multi-class classification framework. The novelty of our work is the critical use of the confusion matrix of a classifier as an error measure; this puts our contribution in the line of work aiming at dealing with performance measure that are richer than mere scalar criterion such as the misclassif… ▽ More

    Submitted 22 October, 2013; v1 submitted 28 February, 2012; originally announced February 2012.

    Comments: Arxiv: https://arxiv.boxedpaper.com/abs/1202.6228, Accepted at ICML 2012

  21. arXiv:1202.6221  [pdf, ps, other

    cs.LG

    Confusion Matrix Stability Bounds for Multiclass Classification

    Authors: Pierre Machart, Liva Ralaivola

    Abstract: In this paper, we provide new theoretical results on the generalization properties of learning algorithms for multiclass classification problems. The originality of our work is that we propose to use the confusion matrix of a classifier as a measure of its quality; our contribution is in the line of work which attempts to set up and study the statistical properties of new evaluation measures such… ▽ More

    Submitted 24 May, 2012; v1 submitted 28 February, 2012; originally announced February 2012.

  22. arXiv:1201.2416  [pdf, other

    cs.LG

    Stochastic Low-Rank Kernel Learning for Regression

    Authors: Pierre Machart, Thomas Peel, Liva Ralaivola, Sandrine Anthoine, Hervé Glotin

    Abstract: We present a novel approach to learn a kernel-based regression function. It is based on the useof conical combinations of data-based parameterized kernels and on a new stochastic convex optimization procedure of which we establish convergence guarantees. The overall learning procedure has the nice properties that a) the learned conical combination is automatically designed to perform the regressio… ▽ More

    Submitted 11 January, 2012; originally announced January 2012.

    Comments: International Conference on Machine Learning (ICML'11), Bellevue (Washington) : United States (2011)

  23. arXiv:0909.1933  [pdf, ps, other

    cs.LG math.ST stat.ML

    Chromatic PAC-Bayes Bounds for Non-IID Data: Applications to Ranking and Stationary $β$-Mixing Processes

    Authors: Liva Ralaivola, Marie Szafranski, Guillaume Stempfel

    Abstract: Pac-Bayes bounds are among the most accurate generalization bounds for classifiers learned from independently and identically distributed (IID) data, and it is particularly so for margin classifiers: there have been recent contributions showing how practical these bounds can be either to perform model selection (Ambroladze et al., 2007) or even to directly guide the learning of linear classifiers… ▽ More

    Submitted 4 June, 2010; v1 submitted 10 September, 2009; originally announced September 2009.

    Comments: Long version of the AISTATS 09 paper: http://jmlr.csail.mit.edu/proceedings/papers/v5/ralaivola09a/ralaivola09a.pdf

  24. arXiv:q-bio/0603006  [pdf, ps, other

    q-bio.QM

    The pharmacophore kernel for virtual screening with support vector machines

    Authors: Pierre Mahé, Liva Ralaivola, Véronique Stoven, Jean-Philippe Vert

    Abstract: We introduce a family of positive definite kernels specifically optimized for the manipulation of 3D structures of molecules with kernel methods. The kernels are based on the comparison of the three-points pharmacophores present in the 3D structures of molecul es, a set of molecular features known to be particularly relevant for virtual screening applications. We present a computationally demand… ▽ More

    Submitted 3 March, 2006; originally announced March 2006.