Skip to main content

Showing 1–43 of 43 results for author: Mairal, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2503.17117  [pdf, other

    astro-ph.IM astro-ph.EP cs.CV cs.LG stat.AP

    A New Statistical Model of Star Speckles for Learning to Detect and Characterize Exoplanets in Direct Imaging Observations

    Authors: Théo Bodrito, Olivier Flasseur, Julien Mairal, Jean Ponce, Maud Langlois, Anne-Marie Lagrange

    Abstract: The search for exoplanets is an active field in astronomy, with direct imaging as one of the most challenging methods due to faint exoplanet signals buried within stronger residual starlight. Successful detection requires advanced image processing to separate the exoplanet signal from this nuisance component. This paper presents a novel statistical model that captures nuisance fluctuations using a… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

    Comments: Accepted to CVPR 2025

  2. arXiv:2403.20233  [pdf, other

    stat.ML cs.LG

    Functional Bilevel Optimization for Machine Learning

    Authors: Ieva Petrulionyte, Julien Mairal, Michael Arbel

    Abstract: In this paper, we introduce a new functional point of view on bilevel optimization problems for machine learning, where the inner objective is minimized over a function space. These types of problems are most often solved by using methods developed in the parametric setting, where the inner objective is strongly convex with respect to the parameters of the prediction function. The functional point… ▽ More

    Submitted 6 December, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

  3. arXiv:2202.13733  [pdf, other

    stat.ML cs.LG math.OC

    On the Benefits of Large Learning Rates for Kernel Methods

    Authors: Gaspard Beugnot, Julien Mairal, Alessandro Rudi

    Abstract: This paper studies an intriguing phenomenon related to the good generalization performance of estimators obtained by using large learning rates within gradient descent algorithms. First observed in the deep learning literature, we show that a phenomenon can be precisely characterized in the context of kernel methods, even though the resulting optimization problem is convex. Specifically, we consid… ▽ More

    Submitted 3 June, 2022; v1 submitted 28 February, 2022; originally announced February 2022.

    Comments: Accepted paper at Conference COLT 2022. To be published to Proceedings of Machine Learning Research (PMLR)

  4. arXiv:2106.08855  [pdf, other

    cs.LG stat.ML

    Beyond Tikhonov: Faster Learning with Self-Concordant Losses via Iterative Regularization

    Authors: Gaspard Beugnot, Julien Mairal, Alessandro Rudi

    Abstract: The theory of spectral filtering is a remarkable tool to understand the statistical properties of learning with kernels. For least squares, it allows to derive various regularization schemes that yield faster convergence rates of the excess risk than with Tikhonov regularization. This is typically achieved by leveraging classical assumptions called source and capacity conditions, which characteriz… ▽ More

    Submitted 10 November, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: To be published in NeurIPS 2021

  5. arXiv:2006.12065  [pdf, other

    cs.LG stat.ML

    A Trainable Optimal Transport Embedding for Feature Aggregation and its Relationship to Attention

    Authors: Grégoire Mialon, Dexiong Chen, Alexandre d'Aspremont, Julien Mairal

    Abstract: We address the problem of learning on sets of features, motivated by the need of performing pooling operations in long biological sequences of varying sizes, with long-range dependencies, and possibly few labeled data. To address this challenging task, we introduce a parametrized representation of fixed size, which embeds and then aggregates elements from a given input set according to the optimal… ▽ More

    Submitted 9 February, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: ICLR 2021

  6. arXiv:2004.11722  [pdf, other

    stat.ML cs.LG

    Counterfactual Learning of Stochastic Policies with Continuous Actions

    Authors: Houssam Zenati, Alberto Bietti, Matthieu Martin, Eustache Diemert, Pierre Gaillard, Julien Mairal

    Abstract: Counterfactual reasoning from logged data has become increasingly important for many applications such as web advertising or healthcare. In this paper, we address the problem of learning stochastic policies with continuous actions from the viewpoint of counterfactual risk minimization (CRM). While the CRM framework is appealing and well studied for discrete actions, the continuous action case rais… ▽ More

    Submitted 21 February, 2025; v1 submitted 22 April, 2020; originally announced April 2020.

  7. arXiv:2003.05189  [pdf, other

    stat.ML cs.LG

    Convolutional Kernel Networks for Graph-Structured Data

    Authors: Dexiong Chen, Laurent Jacob, Julien Mairal

    Abstract: We introduce a family of multilayer graph kernels and establish new links between graph convolutional neural networks and kernel methods. Our approach generalizes convolutional kernel networks to graph-structured data, by representing graphs as a sequence of kernel feature maps, where each node carries information about local graph substructures. On the one hand, the kernel point of view offers an… ▽ More

    Submitted 29 June, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

    Report number: hal-02151135

    Journal ref: International Conference on Machine Learning (ICML), Jul 2020

  8. arXiv:1912.08165  [pdf, other

    stat.ML cs.LG

    Cyanure: An Open-Source Toolbox for Empirical Risk Minimization for Python, C++, and soon more

    Authors: Julien Mairal

    Abstract: Cyanure is an open-source C++ software package with a Python interface. The goal of Cyanure is to provide state-of-the-art solvers for learning linear models, based on stochastic variance-reduced stochastic optimization with acceleration mechanisms. Cyanure can handle a large variety of loss functions (logistic, square, squared hinge, multinomial logistic) and regularization functions (l_2, l_1, e… ▽ More

    Submitted 20 December, 2019; v1 submitted 17 December, 2019; originally announced December 2019.

    Comments: http://julien.mairal.org/cyanure/welcome.html

  9. arXiv:1912.02566  [pdf, other

    cs.LG stat.ML

    Screening Data Points in Empirical Risk Minimization via Ellipsoidal Regions and Safe Loss Functions

    Authors: Grégoire Mialon, Alexandre d'Aspremont, Julien Mairal

    Abstract: We design simple screening tests to automatically discard data samples in empirical risk minimization without losing optimization guarantees. We derive loss functions that produce dual objectives with a sparse solution. We also show how to regularize convex losses to ensure such a dual sparsity-inducing property, and propose a general method to design screening tests for classification or regressi… ▽ More

    Submitted 12 June, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: AISTATS 2020

  10. arXiv:1906.03200  [pdf, other

    stat.ML cs.LG

    Recurrent Kernel Networks

    Authors: Dexiong Chen, Laurent Jacob, Julien Mairal

    Abstract: Substring kernels are classical tools for representing biological sequences or text. However, when large amounts of annotated data are available, models that allow end-to-end training such as neural networks are often preferred. Links between recurrent neural networks (RNNs) and substring kernels have recently been drawn, by formally showing that RNNs with specific activation functions were points… ▽ More

    Submitted 17 October, 2019; v1 submitted 7 June, 2019; originally announced June 2019.

    Report number: hal-02151135

    Journal ref: Advances in Neural Information Processing Systems (NeurIPS), Dec 2019, Vancouver, Canada

  11. arXiv:1906.01164  [pdf, other

    math.OC cs.LG stat.ML

    A Generic Acceleration Framework for Stochastic Composite Optimization

    Authors: Andrei Kulunchakov, Julien Mairal

    Abstract: In this paper, we introduce various mechanisms to obtain accelerated first-order stochastic optimization algorithms when the objective function is convex or strongly convex. Specifically, we extend the Catalyst approach originally designed for deterministic objectives to the stochastic setting. Given an optimization method with mild convergence guarantees for strongly convex problems, the challeng… ▽ More

    Submitted 9 October, 2019; v1 submitted 3 June, 2019; originally announced June 2019.

    Journal ref: Advances in Neural Information Processing Systems (NeurIPS), Dec 2019, Vancouver, Canada

  12. arXiv:1905.12173  [pdf, other

    stat.ML cs.LG

    On the Inductive Bias of Neural Tangent Kernels

    Authors: Alberto Bietti, Julien Mairal

    Abstract: State-of-the-art neural networks are heavily over-parameterized, making the optimization algorithm a crucial ingredient for learning predictive models with good generalization properties. A recent line of work has shown that in a certain over-parameterized regime, the learning dynamics of gradient descent are governed by a certain kernel obtained at initialization, called the neural tangent kernel… ▽ More

    Submitted 31 October, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: NeurIPS 2019

  13. arXiv:1905.02374  [pdf, other

    stat.ML cs.LG math.OC

    Estimate Sequences for Variance-Reduced Stochastic Composite Optimization

    Authors: Andrei Kulunchakov, Julien Mairal

    Abstract: In this paper, we propose a unified view of gradient-based algorithms for stochastic convex composite optimization by extending the concept of estimate sequence introduced by Nesterov. This point of view covers the stochastic gradient descent method, variants of the approaches SAGA, SVRG, and has several advantages: (i) we provide a generic proof of convergence for the aforementioned methods; (ii)… ▽ More

    Submitted 7 May, 2019; originally announced May 2019.

    Comments: short version of preprint arXiv:1901.08788

    Journal ref: International Conference on Machine Learning (ICML), Jun 2019, Long Beach, United States

  14. arXiv:1901.08788  [pdf, other

    stat.ML cs.LG math.OC

    Estimate Sequences for Stochastic Composite Optimization: Variance Reduction, Acceleration, and Robustness to Noise

    Authors: Andrei Kulunchakov, Julien Mairal

    Abstract: In this paper, we propose a unified view of gradient-based algorithms for stochastic convex composite optimization by extending the concept of estimate sequence introduced by Nesterov. More precisely, we interpret a large class of stochastic optimization methods as procedures that iteratively minimize a surrogate of the objective, which covers the stochastic gradient descent method and variants of… ▽ More

    Submitted 4 September, 2020; v1 submitted 25 January, 2019; originally announced January 2019.

    Comments: Journal of Machine Learning Research, Microtome Publishing, In press

  15. arXiv:1810.00363  [pdf, other

    stat.ML cs.LG

    A Kernel Perspective for Regularizing Deep Neural Networks

    Authors: Alberto Bietti, Grégoire Mialon, Dexiong Chen, Julien Mairal

    Abstract: We propose a new point of view for regularizing deep neural networks by using the norm of a reproducing kernel Hilbert space (RKHS). Even though this norm cannot be computed, it admits upper and lower approximations leading to various practical strategies. Specifically, this perspective (i) provides a common umbrella for many existing regularization principles, including spectral norm and gradient… ▽ More

    Submitted 13 May, 2019; v1 submitted 30 September, 2018; originally announced October 2018.

    Comments: ICML

  16. arXiv:1809.06035  [pdf, other

    stat.ML cs.CV cs.LG q-bio.QM

    Extracting representations of cognition across neuroimaging studies improves brain decoding

    Authors: Arthur Mensch, Julien Mairal, Bertrand Thirion, Gaël Varoquaux

    Abstract: Cognitive brain imaging is accumulating datasets about the neural substrate of many different mental processes. Yet, most studies are based on few subjects and have low statistical power. Analyzing data across studies could bring more statistical power; yet the current brain-imaging analytic framework cannot be used at scale as it requires casting all cognitive tasks in a unified theoretical frame… ▽ More

    Submitted 19 May, 2021; v1 submitted 17 September, 2018; originally announced September 2018.

    Journal ref: PLoS Computational Biology, Public Library of Science, 2021

  17. arXiv:1805.11155  [pdf, other

    stat.ML cs.CV cs.LG

    Unsupervised Learning of Artistic Styles with Archetypal Style Analysis

    Authors: Daan Wynen, Cordelia Schmid, Julien Mairal

    Abstract: In this paper, we introduce an unsupervised learning approach to automatically discover, summarize, and manipulate artistic styles from large collections of paintings. Our method is based on archetypal analysis, which is an unsupervised learning technique akin to sparse coding with a geometric interpretation. When applied to deep image representations from a collection of artworks, it learns a dic… ▽ More

    Submitted 2 October, 2018; v1 submitted 28 May, 2018; originally announced May 2018.

    Comments: Accepted at NIPS 2018, Montréal, Canada

  18. arXiv:1712.05654  [pdf, other

    stat.ML math.OC

    Catalyst Acceleration for First-order Convex Optimization: from Theory to Practice

    Authors: Hongzhou Lin, Julien Mairal, Zaid Harchaoui

    Abstract: We introduce a generic scheme for accelerating gradient-based optimization methods in the sense of Nesterov. The approach, called Catalyst, builds upon the inexact accelerated proximal point algorithm for minimizing a convex objective function, and consists of approximately solving a sequence of well-chosen auxiliary problems, leading to faster convergence. One of the keys to achieve acceleration… ▽ More

    Submitted 19 June, 2018; v1 submitted 15 December, 2017; originally announced December 2017.

    Comments: link to publisher website: http://jmlr.org/papers/volume18/17-748/17-748.pdf

    Journal ref: Journal of Machine Learning Research (JMLR), 18(212):1--54, 2018

  19. arXiv:1710.11438  [pdf, other

    stat.ML cs.LG q-bio.NC

    Learning Neural Representations of Human Cognition across Many fMRI Studies

    Authors: Arthur Mensch, Julien Mairal, Danilo Bzdok, Bertrand Thirion, Gaël Varoquaux

    Abstract: Cognitive neuroscience is enjoying rapid increase in extensive public brain-imaging datasets. It opens the door to large-scale statistical models. Finding a unified perspective for all available data calls for scalable and automated solutions to an old challenge: how to aggregate heterogeneous information on brain function into a universal cognitive system that relates mental operations/cognitive… ▽ More

    Submitted 10 November, 2017; v1 submitted 31 October, 2017; originally announced October 2017.

    Comments: Advances in Neural Information Processing Systems, Dec 2017, Long Beach, United States. 2017

    Journal ref: Advances in Neural Information Processing Systems, 2017

  20. arXiv:1706.03078  [pdf, other

    stat.ML cs.LG

    Group Invariance, Stability to Deformations, and Complexity of Deep Convolutional Representations

    Authors: Alberto Bietti, Julien Mairal

    Abstract: The success of deep convolutional architectures is often attributed in part to their ability to learn multiscale and invariant representations of natural signals. However, a precise study of these properties and how they affect learning guarantees is still missing. In this paper, we consider deep convolutional representations of signals; we study their invariance to translations and to more genera… ▽ More

    Submitted 10 October, 2018; v1 submitted 9 June, 2017; originally announced June 2017.

    Journal ref: Journal of Machine Learning Research 20 (2019) 1-49

  21. arXiv:1703.10993  [pdf, other

    stat.ML math.OC

    Catalyst Acceleration for Gradient-Based Non-Convex Optimization

    Authors: Courtney Paquette, Hongzhou Lin, Dmitriy Drusvyatskiy, Julien Mairal, Zaid Harchaoui

    Abstract: We introduce a generic scheme to solve nonconvex optimization problems using gradient-based algorithms originally designed for minimizing convex functions. Even though these methods may originally require convexity to operate, the proposed approach allows one to use them on weakly convex objectives, which covers a large class of non-convex functions typically appearing in machine learning and sign… ▽ More

    Submitted 31 December, 2018; v1 submitted 31 March, 2017; originally announced March 2017.

  22. arXiv:1701.05363  [pdf, other

    stat.ML cs.LG math.OC q-bio.NC

    Stochastic Subsampling for Factorizing Huge Matrices

    Authors: Arthur Mensch, Julien Mairal, Bertrand Thirion, Gael Varoquaux

    Abstract: We present a matrix-factorization algorithm that scales to input matrices with both huge number of rows and columns. Learned factors may be sparse or dense and/or non-negative, which makes our algorithm suitable for dictionary learning, sparse component analysis, and non-negative matrix factorization. Our algorithm streams matrix columns while subsampling them to iteratively learn the matrix facto… ▽ More

    Submitted 30 October, 2017; v1 submitted 19 January, 2017; originally announced January 2017.

    Comments: IEEE Transactions on Signal Processing, Institute of Electrical and Electronics Engineers, A Paraître

    Journal ref: IEEE Transactions on Signal Processing, 2018, 66 (1), pp 113-128

  23. arXiv:1611.10041  [pdf, other

    math.OC cs.LG stat.ML

    Subsampled online matrix factorization with convergence guarantees

    Authors: Arthur Mensch, Julien Mairal, Gaël Varoquaux, Bertrand Thirion

    Abstract: We present a matrix factorization algorithm that scales to input matrices that are large in both dimensions (i.e., that contains morethan 1TB of data). The algorithm streams the matrix columns while subsampling them, resulting in low complexity per iteration andreasonable memory footprint. In contrast to previous online matrix factorization methods, our approach relies on low-dimensional statistic… ▽ More

    Submitted 30 November, 2016; originally announced November 2016.

    Journal ref: 9th NIPS Workshop on Optimization for Machine Learning, Dec 2016, Barcelone, Spain

  24. arXiv:1610.00970  [pdf, other

    stat.ML cs.LG math.OC

    Stochastic Optimization with Variance Reduction for Infinite Datasets with Finite-Sum Structure

    Authors: Alberto Bietti, Julien Mairal

    Abstract: Stochastic optimization algorithms with variance reduction have proven successful for minimizing large finite sums of functions. Unfortunately, these techniques are unable to deal with stochastic perturbations of input data, induced for example by data augmentation. In such cases, the objective is no longer a finite sum, and the main candidate for optimization is the stochastic gradient descent me… ▽ More

    Submitted 15 November, 2017; v1 submitted 4 October, 2016; originally announced October 2016.

    Comments: Advances in Neural Information Processing Systems (NIPS), Dec 2017, Long Beach, CA, United States

  25. arXiv:1610.00960  [pdf, other

    stat.ML math.OC

    An Inexact Variable Metric Proximal Point Algorithm for Generic Quasi-Newton Acceleration

    Authors: Hongzhou Lin, Julien Mairal, Zaid Harchaoui

    Abstract: We propose an inexact variable-metric proximal point algorithm to accelerate gradient-based optimization algorithms. The proposed scheme, called QNing can be notably applied to incremental first-order methods such as the stochastic variance-reduced gradient descent algorithm (SVRG) and other randomized incremental optimization algorithms. QNing is also compatible with composite objectives, meaning… ▽ More

    Submitted 29 January, 2019; v1 submitted 4 October, 2016; originally announced October 2016.

    Comments: to appear in SIAM Journal on Optimization

  26. arXiv:1605.06265  [pdf, other

    stat.ML cs.CV cs.LG

    End-to-End Kernel Learning with Supervised Convolutional Kernel Networks

    Authors: Julien Mairal

    Abstract: In this paper, we introduce a new image representation based on a multilayer kernel machine. Unlike traditional kernel methods where data representation is decoupled from the prediction task, we learn how to shape the kernel with supervision. We proceed by first proposing improvements of the recently-introduced convolutional kernel networks (CKNs) in the context of unsupervised learning; then, we… ▽ More

    Submitted 25 October, 2016; v1 submitted 20 May, 2016; originally announced May 2016.

    Comments: to appear in Advances in Neural Information Processing Systems (NIPS)

  27. arXiv:1605.00937  [pdf, other

    stat.ML cs.LG q-bio.QM

    Dictionary Learning for Massive Matrix Factorization

    Authors: Arthur Mensch, Julien Mairal, Bertrand Thirion, Gaël Varoquaux

    Abstract: Sparse matrix factorization is a popular tool to obtain interpretable data decompositions, which are also effective to perform data completion or denoising. Its applicability to large datasets has been addressed with online and randomized methods, that reduce the complexity in one of the matrix dimension, but not in both of them. In this paper, we tackle very large matrices in both dimensions. We… ▽ More

    Submitted 26 May, 2016; v1 submitted 3 May, 2016; originally announced May 2016.

    Journal ref: Proceedings of the International Conference on Machine Learning, 2016, pp 1737-1746

  28. arXiv:1602.02263  [pdf, other

    math.OC cs.IT cs.LG stat.ML

    DOLPHIn - Dictionary Learning for Phase Retrieval

    Authors: Andreas M. Tillmann, Yonina C. Eldar, Julien Mairal

    Abstract: We propose a new algorithm to learn a dictionary for reconstructing and sparsely encoding signals from measurements without phase. Specifically, we consider the task of estimating a two-dimensional image from squared-magnitude measurements of a complex-valued linear transformation of the original image. Several recent phase retrieval algorithms exploit underlying sparsity of the unknown signal in… ▽ More

    Submitted 3 August, 2016; v1 submitted 6 February, 2016; originally announced February 2016.

  29. arXiv:1406.3332  [pdf, ps, other

    cs.CV cs.LG stat.ML

    Convolutional Kernel Networks

    Authors: Julien Mairal, Piotr Koniusz, Zaid Harchaoui, Cordelia Schmid

    Abstract: An important goal in visual recognition is to devise image representations that are invariant to particular transformations. In this paper, we address this goal with a new type of convolutional neural network (CNN) whose invariance is encoded by a reproducing kernel. Unlike traditional approaches where neural networks are learned either to represent data or for solving a classification task, our n… ▽ More

    Submitted 14 November, 2014; v1 submitted 12 June, 2014; originally announced June 2014.

    Comments: appears in Advances in Neural Information Processing Systems (NIPS), Dec 2014, Montreal, Canada, http://nips.cc

  30. arXiv:1405.6472  [pdf, other

    cs.CV cs.LG stat.ML

    Fast and Robust Archetypal Analysis for Representation Learning

    Authors: Yuansi Chen, Julien Mairal, Zaid Harchaoui

    Abstract: We revisit a pioneer unsupervised learning technique called archetypal analysis, which is related to successful data analysis methods such as sparse coding and non-negative matrix factorization. Since it was proposed, archetypal analysis did not gain a lot of popularity even though it produces more interpretable models than other alternatives. Because no efficient implementation has ever been made… ▽ More

    Submitted 26 May, 2014; originally announced May 2014.

    Journal ref: CVPR 2014 - IEEE Conference on Computer Vision \& Pattern Recognition (2014)

  31. arXiv:1402.4419  [pdf, ps, other

    math.OC cs.LG stat.ML

    Incremental Majorization-Minimization Optimization with Application to Large-Scale Machine Learning

    Authors: Julien Mairal

    Abstract: Majorization-minimization algorithms consist of successively minimizing a sequence of upper bounds of the objective function. These upper bounds are tight at the current estimate, and each iteration monotonically drives the objective function downhill. Such a simple principle is widely applicable and has been very popular in various scientific fields, especially in signal processing and statistics… ▽ More

    Submitted 1 February, 2015; v1 submitted 18 February, 2014; originally announced February 2014.

    Comments: to appear in SIAM Journal on Optimization; final author's version

  32. arXiv:1306.4650  [pdf, ps, other

    stat.ML cs.LG math.OC

    Stochastic Majorization-Minimization Algorithms for Large-Scale Optimization

    Authors: Julien Mairal

    Abstract: Majorization-minimization algorithms consist of iteratively minimizing a majorizing surrogate of an objective function. Because of its simplicity and its wide applicability, this principle has been very popular in statistics and in signal processing. In this paper, we intend to make this principle scalable. We introduce a stochastic majorization-minimization scheme which is able to deal with large… ▽ More

    Submitted 10 September, 2013; v1 submitted 19 June, 2013; originally announced June 2013.

    Comments: accepted for publication for Neural Information Processing Systems (NIPS) 2013. This is the 9-pages version followed by 16 pages of appendices. The title has changed compared to the first technical report

  33. arXiv:1305.3120  [pdf, ps, other

    stat.ML cs.LG math.OC

    Optimization with First-Order Surrogate Functions

    Authors: Julien Mairal

    Abstract: In this paper, we study optimization methods consisting of iteratively minimizing surrogates of an objective function. By proposing several algorithmic variants and simple convergence analyses, we make two main contributions. First, we provide a unified viewpoint for several first-order optimization techniques such as accelerated proximal gradient, block coordinate descent, or Frank-Wolfe algorith… ▽ More

    Submitted 14 May, 2013; originally announced May 2013.

    Comments: to appear in the proceedings of ICML 2013; the arxiv paper contains the 9 pages main text followed by 26 pages of supplemental material. International Conference on Machine Learning (ICML 2013) (2013)

  34. arXiv:1205.0079  [pdf, ps, other

    stat.ML cs.LG math.OC

    Complexity Analysis of the Lasso Regularization Path

    Authors: Julien Mairal, Bin Yu

    Abstract: The regularization path of the Lasso can be shown to be piecewise linear, making it possible to "follow" and explicitly compute the entire path. We analyze in this paper this popular strategy, and prove that its worst case complexity is exponential in the number of variables. We then oppose this pessimistic result to an (optimistic) approximate analysis: We show that an approximate path with at mo… ▽ More

    Submitted 19 May, 2012; v1 submitted 30 April, 2012; originally announced May 2012.

    Comments: To appear in the proceedings of 29th International Conference on Machine Learning (ICML 2012)

  35. arXiv:1204.4539  [pdf, ps, other

    stat.ML cs.LG math.OC

    Supervised Feature Selection in Graphs with Path Coding Penalties and Network Flows

    Authors: Julien Mairal, Bin Yu

    Abstract: We consider supervised learning problems where the features are embedded in a graph, such as gene expressions in a gene network. In this context, it is of much interest to automatically select a subgraph with few connected components; by exploiting prior knowledge, one can indeed improve the prediction performance or obtain results that are easier to interpret. Regularization or penalty functions… ▽ More

    Submitted 29 August, 2013; v1 submitted 20 April, 2012; originally announced April 2012.

    Comments: 37 pages; to appear in the Journal of Machine Learning Research (JMLR)

    Journal ref: Journal of Machine Learning Research 14(Aug) (2013) 2449-2485

  36. arXiv:1110.2855  [pdf, other

    cs.LG cs.CV stat.ML

    Sparse Image Representation with Epitomes

    Authors: Louise Benoît, Julien Mairal, Francis Bach, Jean Ponce

    Abstract: Sparse coding, which is the decomposition of a vector using only a few basis elements, is widely used in machine learning and image processing. The basis set, also called dictionary, is learned to adapt to specific data. This approach has proven to be very effective in many image processing tasks. Traditionally, the dictionary is an unstructured "flat" set of atoms. In this paper, we study structu… ▽ More

    Submitted 13 October, 2011; originally announced October 2011.

    Comments: Computer Vision and Pattern Recognition, Colorado Springs : United States (2011)

    Journal ref: Computer Vision and Pattern Recognition, Colorado Springs : États-Unis (2011)

  37. arXiv:1109.2397  [pdf, ps, other

    cs.LG stat.ML

    Structured sparsity through convex optimization

    Authors: Francis Bach, Rodolphe Jenatton, Julien Mairal, Guillaume Obozinski

    Abstract: Sparse estimation methods are aimed at using or obtaining parsimonious representations of data or models. While naturally cast as a combinatorial optimization problem, variable or feature selection admits a convex relaxation through the regularization by the $\ell_1$-norm. In this paper, we consider situations where we are not only interested in sparsity, but where some structural prior knowledge… ▽ More

    Submitted 20 April, 2012; v1 submitted 12 September, 2011; originally announced September 2011.

    Comments: Statistical Science (2012) To appear

  38. arXiv:1108.0775  [pdf, ps, other

    cs.LG math.OC stat.ML

    Optimization with Sparsity-Inducing Penalties

    Authors: Francis Bach, Rodolphe Jenatton, Julien Mairal, Guillaume Obozinski

    Abstract: Sparse estimation methods are aimed at using or obtaining parsimonious representations of data or models. They were first dedicated to linear variable selection but numerous extensions have now emerged such as structured sparsity or kernel selection. It turns out that many of the related estimation problems can be cast as convex optimization problems by regularizing the empirical risk with appropr… ▽ More

    Submitted 22 November, 2011; v1 submitted 3 August, 2011; originally announced August 2011.

  39. arXiv:1104.1872  [pdf, ps, other

    math.OC cs.LG stat.ML

    Convex and Network Flow Optimization for Structured Sparsity

    Authors: Julien Mairal, Rodolphe Jenatton, Guillaume Obozinski, Francis Bach

    Abstract: We consider a class of learning problems regularized by a structured sparsity-inducing norm defined as the sum of l_2- or l_infinity-norms over groups of variables. Whereas much effort has been put in developing fast optimization techniques when the groups are disjoint or embedded in a hierarchy, we address here the case of general overlapping groups. To this end, we present two different strategi… ▽ More

    Submitted 16 September, 2011; v1 submitted 11 April, 2011; originally announced April 2011.

    Comments: to appear in the Journal of Machine Learning Research (JMLR)

    Journal ref: Journal of Machine Learning Research 12 (2011) 2681?2720

  40. Task-Driven Dictionary Learning

    Authors: Julien Mairal, Francis Bach, Jean Ponce

    Abstract: Modeling data with linear combinations of a few elements from a learned dictionary has been the focus of much recent research in machine learning, neuroscience and signal processing. For signals such as natural images that admit such sparse representations, it is now well established that these models are well suited to restoration tasks. In this context, learning the dictionary amounts to solving… ▽ More

    Submitted 9 September, 2013; v1 submitted 27 September, 2010; originally announced September 2010.

    Comments: final draft post-refereeing

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence 34, 4 (2012) 30

  41. arXiv:1009.2139  [pdf, ps, other

    stat.ML

    Proximal Methods for Hierarchical Sparse Coding

    Authors: Rodolphe Jenatton, Julien Mairal, Guillaume Obozinski, Francis Bach

    Abstract: Sparse coding consists in representing signals as sparse linear combinations of atoms selected from a dictionary. We consider an extension of this framework where the atoms are further assumed to be embedded in a tree. This is achieved using a recently introduced tree-structured sparse regularization norm, which has proven useful in several applications. This norm leads to regularized problems tha… ▽ More

    Submitted 5 July, 2011; v1 submitted 11 September, 2010; originally announced September 2010.

    Journal ref: Journal of Machine Learning Research, 12 (2011) 2297-2334

  42. arXiv:1008.5209  [pdf, ps, other

    cs.LG stat.ML

    Network Flow Algorithms for Structured Sparsity

    Authors: Julien Mairal, Rodolphe Jenatton, Guillaume Obozinski, Francis Bach

    Abstract: We consider a class of learning problems that involve a structured sparsity-inducing norm defined as the sum of $\ell_\infty$-norms over groups of variables. Whereas a lot of effort has been put in developing fast optimization methods when the groups are disjoint or embedded in a specific hierarchical structure, we address here the case of general overlapping groups. To this end, we show that the… ▽ More

    Submitted 30 August, 2010; originally announced August 2010.

    Comments: accepted for publication in Adv. Neural Information Processing Systems, 2010

    Report number: RR-7372

  43. arXiv:0908.0050  [pdf, ps, other

    stat.ML cs.LG math.OC

    Online Learning for Matrix Factorization and Sparse Coding

    Authors: Julien Mairal, Francis Bach, Jean Ponce, Guillermo Sapiro

    Abstract: Sparse coding--that is, modelling data vectors as sparse linear combinations of basis elements--is widely used in machine learning, neuroscience, signal processing, and statistics. This paper focuses on the large-scale matrix factorization problem that consists of learning the basis set, adapting it to specific data. Variations of this problem include dictionary learning in signal processing, no… ▽ More

    Submitted 11 February, 2010; v1 submitted 1 August, 2009; originally announced August 2009.

    Comments: revised version

    Journal ref: Journal of Machine Learning Research 11 (2010) 19--60