Skip to main content

Showing 1–12 of 12 results for author: Ghadimi, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2307.05384  [pdf, other

    math.OC cs.DS cs.LG stat.ML

    Stochastic Nested Compositional Bi-level Optimization for Robust Feature Learning

    Authors: Xuxing Chen, Krishnakumar Balasubramanian, Saeed Ghadimi

    Abstract: We develop and analyze stochastic approximation algorithms for solving nested compositional bi-level optimization problems. These problems involve a nested composition of $T$ potentially non-convex smooth functions in the upper-level, and a smooth and strongly convex function in the lower-level. Our proposed algorithm does not rely on matrix inversions or mini-batches and can achieve an $ε$-statio… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  2. arXiv:2302.09766  [pdf, other

    math.OC cs.DC cs.LG stat.ML

    A One-Sample Decentralized Proximal Algorithm for Non-Convex Stochastic Composite Optimization

    Authors: Tesi Xiao, Xuxing Chen, Krishnakumar Balasubramanian, Saeed Ghadimi

    Abstract: We focus on decentralized stochastic non-convex optimization, where $n$ agents work together to optimize a composite objective function which is a sum of a smooth term and a non-smooth convex term. To solve this problem, we propose two single-time scale algorithms: Prox-DASA and Prox-DASA-GT. These algorithms can find $ε$-stationary points in $\mathcal{O}(n^{-1}ε^{-2})$ iterations using constant b… ▽ More

    Submitted 22 June, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: UAI 2023

  3. arXiv:2206.11346  [pdf, other

    math.OC cs.LG stat.ML

    Constrained Stochastic Nonconvex Optimization with State-dependent Markov Data

    Authors: Abhishek Roy, Krishnakumar Balasubramanian, Saeed Ghadimi

    Abstract: We study stochastic optimization algorithms for constrained nonconvex stochastic optimization problems with Markovian data. In particular, we focus on the case when the transition kernel of the Markov chain is state-dependent. Such stochastic optimization problems arise in various machine learning problems including strategic classification and reinforcement learning. For this problem, we study bo… ▽ More

    Submitted 8 November, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: 2 figures

  4. arXiv:2202.04296  [pdf, ps, other

    math.OC math.ST stat.ML

    A Projection-free Algorithm for Constrained Stochastic Multi-level Composition Optimization

    Authors: Tesi Xiao, Krishnakumar Balasubramanian, Saeed Ghadimi

    Abstract: We propose a projection-free conditional gradient-type algorithm for smooth stochastic multi-level composition optimization, where the objective function is a nested composition of $T$ functions and the constraint set is a closed convex set. Our algorithm assumes access to noisy evaluations of the functions and their gradients, through a stochastic first-order oracle satisfying certain standard un… ▽ More

    Submitted 9 October, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: To appear in NeurIPS 2022

  5. arXiv:2009.13016  [pdf, ps, other

    stat.ML cs.LG math.OC math.ST

    Escaping Saddle-Points Faster under Interpolation-like Conditions

    Authors: Abhishek Roy, Krishnakumar Balasubramanian, Saeed Ghadimi, Prasant Mohapatra

    Abstract: In this paper, we show that under over-parametrization several standard stochastic optimization algorithms escape saddle-points and converge to local-minimizers much faster. One of the fundamental aspects of over-parametrized models is that they are capable of interpolating the training data. We show that, under interpolation-like assumptions satisfied by the stochastic gradients in an over-parame… ▽ More

    Submitted 27 September, 2020; originally announced September 2020.

    Comments: To appear in NeurIPS, 2020

  6. arXiv:2008.10526  [pdf, other

    math.OC cs.DS cs.LG math.ST stat.ML

    Stochastic Multi-level Composition Optimization Algorithms with Level-Independent Convergence Rates

    Authors: Krishnakumar Balasubramanian, Saeed Ghadimi, Anthony Nguyen

    Abstract: In this paper, we study smooth stochastic multi-level composition optimization problems, where the objective function is a nested composition of $T$ functions. We assume access to noisy evaluations of the functions and their gradients, through a stochastic first-order oracle. For solving this class of problems, we propose two algorithms using moving-average stochastic estimates, and analyze their… ▽ More

    Submitted 14 February, 2022; v1 submitted 24 August, 2020; originally announced August 2020.

    Comments: Fixed some typos

  7. arXiv:2006.08167  [pdf, other

    math.OC cs.LG stat.ML

    Improved Complexities for Stochastic Conditional Gradient Methods under Interpolation-like Conditions

    Authors: Tesi Xiao, Krishnakumar Balasubramanian, Saeed Ghadimi

    Abstract: We analyze stochastic conditional gradient methods for constrained optimization problems arising in over-parametrized machine learning. We show that one could leverage the interpolation-like conditions satisfied by such models to obtain improved oracle complexities. Specifically, when the objective function is convex, we show that the conditional gradient method requires $\mathcal{O}(ε^{-2})$ call… ▽ More

    Submitted 26 January, 2022; v1 submitted 15 June, 2020; originally announced June 2020.

  8. arXiv:1907.13616  [pdf, ps, other

    stat.ML cs.DS cs.LG math.OC math.ST

    Multi-Point Bandit Algorithms for Nonstationary Online Nonconvex Optimization

    Authors: Abhishek Roy, Krishnakumar Balasubramanian, Saeed Ghadimi, Prasant Mohapatra

    Abstract: Bandit algorithms have been predominantly analyzed in the convex setting with function-value based stationary regret as the performance measure. In this paper, motivated by online reinforcement learning problems, we propose and analyze bandit algorithms for both general and structured nonconvex problems with nonstationary (or dynamic) regret as the performance measure, in both stochastic and non-s… ▽ More

    Submitted 11 September, 2019; v1 submitted 31 July, 2019; originally announced July 2019.

  9. arXiv:1902.01373  [pdf, ps, other

    math.ST math.OC stat.ML

    Stochastic Zeroth-order Discretizations of Langevin Diffusions for Bayesian Inference

    Authors: Abhishek Roy, Lingqing Shen, Krishnakumar Balasubramanian, Saeed Ghadimi

    Abstract: Discretizations of Langevin diffusions provide a powerful method for sampling and Bayesian inference. However, such discretizations require evaluation of the gradient of the potential function. In several real-world scenarios, obtaining gradient evaluations might either be computationally expensive, or simply impossible. In this work, we propose and analyze stochastic zeroth-order sampling algorit… ▽ More

    Submitted 17 January, 2021; v1 submitted 4 February, 2019; originally announced February 2019.

  10. arXiv:1809.06474  [pdf, ps, other

    math.OC cs.DS cs.LG math.ST stat.ML

    Zeroth-order Nonconvex Stochastic Optimization: Handling Constraints, High-Dimensionality and Saddle-Points

    Authors: Krishnakumar Balasubramanian, Saeed Ghadimi

    Abstract: In this paper, we propose and analyze zeroth-order stochastic approximation algorithms for nonconvex and convex optimization, with a focus on addressing constrained optimization, high-dimensional setting and saddle-point avoiding. To handle constrained optimization, we first propose generalizations of the conditional gradient algorithm achieving rates similar to the standard stochastic gradient al… ▽ More

    Submitted 13 January, 2019; v1 submitted 17 September, 2018; originally announced September 2018.

  11. arXiv:1508.07384  [pdf, ps, other

    math.OC stat.ML

    Generalized Uniformly Optimal Methods for Nonlinear Programming

    Authors: Saeed Ghadimi, Guanghui Lan, Hongchao Zhang

    Abstract: In this paper, we present a generic framework to extend existing uniformly optimal convex programming algorithms to solve more general nonlinear, possibly nonconvex, optimization problems. The basic idea is to incorporate a local search step (gradient descent or Quasi-Newton iteration) into these uniformly optimal convex programming methods, and then enforce a monotone decreasing property of the f… ▽ More

    Submitted 12 September, 2015; v1 submitted 28 August, 2015; originally announced August 2015.

  12. arXiv:1309.5549  [pdf, ps, other

    math.OC cs.CC stat.ML

    Stochastic First- and Zeroth-order Methods for Nonconvex Stochastic Programming

    Authors: Saeed Ghadimi, Guanghui Lan

    Abstract: In this paper, we introduce a new stochastic approximation (SA) type algorithm, namely the randomized stochastic gradient (RSG) method, for solving an important class of nonlinear (possibly nonconvex) stochastic programming (SP) problems. We establish the complexity of this method for computing an approximate stationary point of a nonlinear programming problem. We also show that this method posses… ▽ More

    Submitted 21 September, 2013; originally announced September 2013.