Skip to main content

Showing 1–7 of 7 results for author: Sebbouh, O

Searching in archive math. Search in all archives.
.
  1. arXiv:2311.05788  [pdf, other

    cs.LG math.OC stat.ML

    Structured Transforms Across Spaces with Cost-Regularized Optimal Transport

    Authors: Othmane Sebbouh, Marco Cuturi, Gabriel Peyré

    Abstract: Matching a source to a target probability measure is often solved by instantiating a linear optimal transport (OT) problem, parameterized by a ground cost function that quantifies discrepancy between points. When these measures live in the same metric space, the ground cost often defaults to its distance. When instantiated across two different spaces, however, choosing that cost in the absence of… ▽ More

    Submitted 23 November, 2023; v1 submitted 9 November, 2023; originally announced November 2023.

  2. arXiv:2111.13162  [pdf, other

    cs.LG math.OC stat.ML

    Randomized Stochastic Gradient Descent Ascent

    Authors: Othmane Sebbouh, Marco Cuturi, Gabriel Peyré

    Abstract: An increasing number of machine learning problems, such as robust or adversarial variants of existing algorithms, require minimizing a loss function that is itself defined as a maximum. Carrying a loop of stochastic gradient ascent (SGA) steps on the (inner) maximization problem, followed by an SGD step on the (outer) minimization, is known as Epoch Stochastic Gradient \textit{Descent Ascent} (ESG… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

  3. arXiv:2006.11573  [pdf, other

    cs.LG math.OC stat.ML

    Unified Analysis of Stochastic Gradient Methods for Composite Convex and Smooth Optimization

    Authors: Ahmed Khaled, Othmane Sebbouh, Nicolas Loizou, Robert M. Gower, Peter Richtárik

    Abstract: We present a unified theorem for the convergence analysis of stochastic gradient algorithms for minimizing a smooth and convex loss plus a convex regularizer. We do this by extending the unified analysis of Gorbunov, Hanzely \& Richtárik (2020) and dropping the requirement that the loss function be strongly convex. Instead, we only rely on convexity of the loss function. Our unified analysis appli… ▽ More

    Submitted 20 June, 2020; originally announced June 2020.

  4. arXiv:2006.10311  [pdf, other

    math.OC cs.LG stat.ML

    SGD for Structured Nonconvex Functions: Learning Rates, Minibatching and Interpolation

    Authors: Robert M. Gower, Othmane Sebbouh, Nicolas Loizou

    Abstract: Stochastic Gradient Descent (SGD) is being used routinely for optimizing non-convex functions. Yet, the standard convergence theory for SGD in the smooth non-convex setting gives a slow sublinear convergence to a stationary point. In this work, we provide several convergence theorems for SGD showing convergence to a global minimum for non-convex problems satisfying some extra structural assumption… ▽ More

    Submitted 22 March, 2021; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: Proceedings of the 24th International Conference on Artificial Intelligence and Statistics (AISTATS) 2021

  5. arXiv:2006.07867  [pdf, other

    cs.LG math.OC stat.ML

    Almost sure convergence rates for Stochastic Gradient Descent and Stochastic Heavy Ball

    Authors: Othmane Sebbouh, Robert M. Gower, Aaron Defazio

    Abstract: We study stochastic gradient descent (SGD) and the stochastic heavy ball method (SHB, otherwise known as the momentum method) for the general stochastic approximation problem. For SGD, in the convex and smooth setting, we provide the first \emph{almost sure} asymptotic convergence \emph{rates} for a weighted average of the iterates . More precisely, we show that the convergence rate of the funct… ▽ More

    Submitted 5 February, 2021; v1 submitted 14 June, 2020; originally announced June 2020.

  6. arXiv:1908.02725  [pdf, other

    math.OC cs.LG

    Towards closing the gap between the theory and practice of SVRG

    Authors: Othmane Sebbouh, Nidham Gazagnadou, Samy Jelassi, Francis Bach, Robert M. Gower

    Abstract: Among the very first variance reduced stochastic methods for solving the empirical risk minimization problem was the SVRG method (Johnson & Zhang 2013). SVRG is an inner-outer loop based method, where in the outer loop a reference full gradient is evaluated, after which $m \in \mathbb{N}$ steps of an inner loop are executed where the reference gradient is used to build a variance reduced estimate… ▽ More

    Submitted 2 July, 2021; v1 submitted 31 July, 2019; originally announced August 2019.

    Comments: 39 pages, 23 figures

    MSC Class: 90C15; 90C25; 68W20

  7. arXiv:1907.02710  [pdf, ps, other

    math.OC

    Nesterov's acceleration and Polyak's heavy ball method in continuous time: convergence rate analysis under geometric conditions and perturbations

    Authors: Othmane Sebbouh, Charles Dossal, Aude Rondepierre

    Abstract: In this article a family of second order ODEs associated to inertial gradient descend is studied. These ODEs are widely used to build trajectories converging to a minimizer $x^*$ of a function $F$, possibly convex. This family includes the continuous version of the Nesterov inertial scheme and the continuous heavy ball method. Several damping parameters, not necessarily vanishing, and a perturbati… ▽ More

    Submitted 5 July, 2019; originally announced July 2019.