Skip to main content

Showing 1–6 of 6 results for author: Wilson, A C

Searching in archive math. Search in all archives.
.
  1. arXiv:2408.13150  [pdf, other

    math.OC cs.LG

    Adaptive Backtracking Line Search

    Authors: Joao V. Cavalcanti, Laurent Lessard, Ashia C. Wilson

    Abstract: Backtracking line search is foundational in numerical optimization. The basic idea is to adjust the step-size of an algorithm by a constant factor until some chosen criterion (e.g. Armijo, Descent Lemma) is satisfied. We propose a novel way to adjust step-sizes, replacing the constant factor used in regular backtracking with one that takes into account the degree to which the chosen criterion is v… ▽ More

    Submitted 26 May, 2025; v1 submitted 23 August, 2024; originally announced August 2024.

  2. arXiv:1701.03863  [pdf, other

    math.OC math.NA

    Breaking Locality Accelerates Block Gauss-Seidel

    Authors: Stephen Tu, Shivaram Venkataraman, Ashia C. Wilson, Alex Gittens, Michael I. Jordan, Benjamin Recht

    Abstract: Recent work by Nesterov and Stich showed that momentum can be used to accelerate the rate of convergence for block Gauss-Seidel in the setting where a fixed partitioning of the coordinates is chosen ahead of time. We show that this setting is too restrictive, constructing instances where breaking locality by running non-accelerated Gauss-Seidel with randomly sampled coordinates substantially outpe… ▽ More

    Submitted 24 September, 2017; v1 submitted 13 January, 2017; originally announced January 2017.

    Comments: Presented at the 34th International Conference on Machine Learning (ICML 2017)

  3. arXiv:1611.02635  [pdf, ps, other

    math.OC cs.DS

    A Lyapunov Analysis of Momentum Methods in Optimization

    Authors: Ashia C. Wilson, Benjamin Recht, Michael I. Jordan

    Abstract: Momentum methods play a significant role in optimization. Examples include Nesterov's accelerated gradient method and the conditional gradient algorithm. Several momentum methods are provably optimal under standard oracle models, and all use a technique called estimate sequences to analyze their convergence properties. The technique of estimate sequences has long been considered difficult to under… ▽ More

    Submitted 12 March, 2018; v1 submitted 8 November, 2016; originally announced November 2016.

    Comments: Major revision. Cleaned up presentation and added results

  4. arXiv:1603.04245  [pdf, ps, other

    math.OC cs.LG stat.ML

    A Variational Perspective on Accelerated Methods in Optimization

    Authors: Andre Wibisono, Ashia C. Wilson, Michael I. Jordan

    Abstract: Accelerated gradient methods play a central role in optimization, achieving optimal rates in many settings. While many generalizations and extensions of Nesterov's original acceleration method have been proposed, it is not yet clear what is the natural scope of the acceleration concept. In this paper, we study accelerated methods from a continuous-time perspective. We show that there is a Lagrangi… ▽ More

    Submitted 14 March, 2016; originally announced March 2016.

    Comments: 38 pages. Subsumes an earlier working draft arXiv:1509.03616

  5. arXiv:1509.03616  [pdf, other

    math.OC

    On Accelerated Methods in Optimization

    Authors: Andre Wibisono, Ashia C. Wilson

    Abstract: In convex optimization, there is an {\em acceleration} phenomenon in which we can boost the convergence rate of certain gradient-based algorithms. We can observe this phenomenon in Nesterov's accelerated gradient descent, accelerated mirror descent, and accelerated cubic-regularized Newton's method, among others. In this paper, we show that the family of higher-order gradient methods in discrete t… ▽ More

    Submitted 11 September, 2015; originally announced September 2015.

    Comments: 42 pages, 2 figures

  6. arXiv:1410.6843  [pdf, ps, other

    math.ST stat.ME

    Posteriors, conjugacy, and exponential families for completely random measures

    Authors: Tamara Broderick, Ashia C. Wilson, Michael I. Jordan

    Abstract: We demonstrate how to calculate posteriors for general CRM-based priors and likelihoods for Bayesian nonparametric models. We further show how to represent Bayesian nonparametric priors as a sequence of finite draws using a size-biasing approach---and how to represent full Bayesian nonparametric models via finite marginals. Motivated by conjugate priors based on exponential family representations… ▽ More

    Submitted 22 April, 2016; v1 submitted 24 October, 2014; originally announced October 2014.

    Comments: 42 pages