Skip to main content

Showing 1–5 of 5 results for author: Nishihara, R

Searching in archive math. Search in all archives.
.
  1. arXiv:1511.06051  [pdf, other

    stat.ML cs.DC cs.LG cs.NE math.OC

    SparkNet: Training Deep Networks in Spark

    Authors: Philipp Moritz, Robert Nishihara, Ion Stoica, Michael I. Jordan

    Abstract: Training deep networks is a time-consuming process, with networks for object recognition often requiring multiple days to train. For this reason, leveraging the resources of a cluster to speed up training is an important area of work. However, widely-popular batch-processing computational frameworks like MapReduce and Spark were not designed to support the asynchronous and communication-intensive… ▽ More

    Submitted 28 February, 2016; v1 submitted 18 November, 2015; originally announced November 2015.

    Comments: 12 pages, 7 figures

  2. arXiv:1508.02933  [pdf, ps, other

    stat.ML cs.LG math.OC math.ST

    No Regret Bound for Extreme Bandits

    Authors: Robert Nishihara, David Lopez-Paz, Léon Bottou

    Abstract: Algorithms for hyperparameter optimization abound, all of which work well under different and often unverifiable assumptions. Motivated by the general challenge of sequentially choosing which algorithm to use, we study the more specific task of choosing among distributions to use for random hyperparameter optimization. This work is naturally framed in the extreme bandit setting, which deals with s… ▽ More

    Submitted 11 April, 2016; v1 submitted 12 August, 2015; originally announced August 2015.

    Comments: 11 pages, International Conference on Artificial Intelligence and Statistics, 2016

  3. arXiv:1508.02087  [pdf, other

    math.OC cs.LG math.NA stat.CO stat.ML

    A Linearly-Convergent Stochastic L-BFGS Algorithm

    Authors: Philipp Moritz, Robert Nishihara, Michael I. Jordan

    Abstract: We propose a new stochastic L-BFGS algorithm and prove a linear convergence rate for strongly convex and smooth functions. Our algorithm draws heavily from a recent stochastic variant of L-BFGS proposed in Byrd et al. (2014) as well as a recent approach to variance reduction for stochastic gradient descent from Johnson and Zhang (2013). We demonstrate experimentally that our algorithm performs wel… ▽ More

    Submitted 13 April, 2016; v1 submitted 9 August, 2015; originally announced August 2015.

    Comments: 10 pages, 3 figures in International Conference on Artificial Intelligence and Statistics, 2016

  4. arXiv:1502.02009  [pdf, other

    math.OC math.NA

    A General Analysis of the Convergence of ADMM

    Authors: Robert Nishihara, Laurent Lessard, Benjamin Recht, Andrew Packard, Michael I. Jordan

    Abstract: We provide a new proof of the linear convergence of the alternating direction method of multipliers (ADMM) when one of the objective terms is strongly convex. Our proof is based on a framework for analyzing optimization algorithms introduced in Lessard et al. (2014), reducing algorithm convergence to verifying the stability of a dynamical system. This approach generalizes a number of existing resu… ▽ More

    Submitted 18 May, 2015; v1 submitted 6 February, 2015; originally announced February 2015.

    Comments: 10 pages, 6 figures

    Journal ref: International Conference on Machine Learning 32, 2015

  5. arXiv:1406.6474  [pdf, ps, other

    math.OC cs.DM cs.DS cs.LG math.NA

    On the Convergence Rate of Decomposable Submodular Function Minimization

    Authors: Robert Nishihara, Stefanie Jegelka, Michael I. Jordan

    Abstract: Submodular functions describe a variety of discrete problems in machine learning, signal processing, and computer vision. However, minimizing submodular functions poses a number of algorithmic challenges. Recent work introduced an easy-to-use, parallelizable algorithm for minimizing submodular functions that decompose as the sum of "simple" submodular functions. Empirically, this algorithm perform… ▽ More

    Submitted 5 November, 2014; v1 submitted 25 June, 2014; originally announced June 2014.

    Comments: 17 pages, 3 figures

    Journal ref: Neural Information Processing Systems 27, 2014