Skip to main content

Showing 1–15 of 15 results for author: Metel, M R

Searching in archive math. Search in all archives.
.
  1. arXiv:2506.06990  [pdf, ps, other

    cs.LG math.OC stat.ML

    Modified K-means Algorithm with Local Optimality Guarantees

    Authors: Mingyi Li, Michael R. Metel, Akiko Takeda

    Abstract: The K-means algorithm is one of the most widely studied clustering algorithms in machine learning. While extensive research has focused on its ability to achieve a globally optimal solution, there still lacks a rigorous analysis of its local optimality guarantees. In this paper, we first present conditions under which the K-means algorithm converges to a locally optimal solution. Based on this, we… ▽ More

    Submitted 11 June, 2025; v1 submitted 8 June, 2025; originally announced June 2025.

    Comments: ICML 2025

  2. arXiv:2303.15464  [pdf, other

    cs.LG cs.AI math.ST stat.ML

    Mathematical Challenges in Deep Learning

    Authors: Vahid Partovi Nia, Guojun Zhang, Ivan Kobyzev, Michael R. Metel, Xinlin Li, Ke Sun, Sobhan Hemati, Masoud Asgharian, Linglong Kong, Wulong Liu, Boxing Chen

    Abstract: Deep models are dominating the artificial intelligence (AI) industry since the ImageNet challenge in 2012. The size of deep models is increasing ever since, which brings new challenges to this field with applications in cell phones, personal computers, autonomous cars, and wireless base stations. Here we list a set of problems, ranging from training, inference, generalization bound, and optimizati… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  3. arXiv:2211.04655  [pdf, ps, other

    math.OC cs.LG

    Variants of SGD for Lipschitz Continuous Loss Functions in Low-Precision Environments

    Authors: Michael R. Metel

    Abstract: Motivated by neural network training in low-precision arithmetic environments, this work studies the convergence of variants of SGD using adaptive step sizes with computational error. Considering a general stochastic Lipschitz continuous loss function, an asymptotic convergence result to a Clarke stationary point is proven as well as the non-asymptotic convergence to an approximate stationary poin… ▽ More

    Submitted 24 April, 2024; v1 submitted 8 November, 2022; originally announced November 2022.

  4. arXiv:2202.06141  [pdf, ps, other

    math.OC

    Sparse Training with Lipschitz Continuous Loss Functions and a Weighted Group L0-norm Constraint

    Authors: Michael R. Metel

    Abstract: This paper is motivated by structured sparsity for deep neural network training. We study a weighted group L0-norm constraint, and present the projection and normal cone of this set. Using randomized smoothing, we develop zeroth and first-order algorithms for minimizing a Lipschitz continuous function constrained by any closed set which can be projected onto. Non-asymptotic convergence guarantees… ▽ More

    Submitted 20 December, 2022; v1 submitted 12 February, 2022; originally announced February 2022.

  5. arXiv:2009.12769  [pdf, ps, other

    math.OC

    Primal-dual subgradient method for constrained convex optimization problems

    Authors: Michael R. Metel, Akiko Takeda

    Abstract: This paper considers a general convex constrained problem setting where functions are not assumed to be differentiable nor Lipschitz continuous. Our motivation is in finding a simple first-order method for solving a wide range of convex optimization problems with minimal requirements. We study the method of weighted dual averages (Nesterov, 2009) in this setting and prove that it is an optimal met… ▽ More

    Submitted 18 March, 2021; v1 submitted 27 September, 2020; originally announced September 2020.

  6. arXiv:2003.07606  [pdf, ps, other

    math.OC

    Perturbed Iterate SGD for Lipschitz Continuous Loss Functions

    Authors: Michael R. Metel, Akiko Takeda

    Abstract: This paper presents an extension of stochastic gradient descent for the minimization of Lipschitz continuous loss functions. Our motivation is for use in non-smooth non-convex stochastic optimization problems, which are frequently encountered in applications such as machine learning. Using the Clarke $ε$-subdifferential, we prove the non-asymptotic convergence to an approximate stationary point in… ▽ More

    Submitted 3 October, 2022; v1 submitted 17 March, 2020; originally announced March 2020.

  7. arXiv:1905.10188  [pdf, ps, other

    math.OC

    Stochastic Proximal Methods for Non-Smooth Non-Convex Constrained Sparse Optimization

    Authors: Michael R. Metel, Akiko Takeda

    Abstract: This paper focuses on stochastic proximal gradient methods for optimizing a smooth non-convex loss function with a non-smooth non-convex regularizer and convex constraints. To the best of our knowledge we present the first non-asymptotic convergence results for this class of problem. We present two simple stochastic proximal gradient algorithms, for general stochastic and finite-sum optimization p… ▽ More

    Submitted 23 May, 2019; originally announced May 2019.

    Comments: arXiv admin note: text overlap with arXiv:1901.08369

  8. arXiv:1901.08369  [pdf, ps, other

    math.OC

    Simple Stochastic Gradient Methods for Non-Smooth Non-Convex Regularized Optimization

    Authors: Michael R. Metel, Akiko Takeda

    Abstract: Our work focuses on stochastic gradient methods for optimizing a smooth non-convex loss function with a non-smooth non-convex regularizer. Research on this class of problem is quite limited, and until recently no non-asymptotic convergence results have been reported. We present two simple stochastic gradient algorithms, for finite-sum and general stochastic optimization problems, which have superi… ▽ More

    Submitted 14 May, 2019; v1 submitted 24 January, 2019; originally announced January 2019.

  9. arXiv:1811.11970  [pdf, ps, other

    math.OC

    Charging station optimization for balanced electric car sharing

    Authors: Antoine Deza, Kai Huang, Michael R. Metel

    Abstract: This work focuses on finding optimal locations for charging stations for one-way electric car sharing programs. The relocation of vehicles by a service staff is generally required in vehicle sharing programs in order to correct imbalances in the network. We seek to limit the need for vehicle relocation by strategically locating charging stations given estimates of traffic flow. A mixed-integer lin… ▽ More

    Submitted 29 November, 2018; originally announced November 2018.

  10. arXiv:1708.00555  [pdf, ps, other

    math.OC

    Mini-batch stochastic gradient descent with dynamic sample sizes

    Authors: Michael R. Metel

    Abstract: We focus on solving constrained convex optimization problems using mini-batch stochastic gradient descent. Dynamic sample size rules are presented which ensure a descent direction with high probability. Empirical results from two applications show superior convergence compared to fixed sample implementations.

    Submitted 1 August, 2017; originally announced August 2017.

  11. arXiv:1701.02814  [pdf, ps, other

    math.OC

    Kelly betting on horse races with uncertainty in probability estimates

    Authors: Michael R. Metel

    Abstract: We investigate the problem of gambling with uncertainty in outcome probabilities. Stochastic optimization models are proposed for optimal investing on events with mutually exclusive outcomes when probabilities are estimated using multinomial logistic regression. Special attention is given to the case of there being two outcomes, and the general case of many outcomes. An empirical study using simul… ▽ More

    Submitted 1 August, 2017; v1 submitted 10 January, 2017; originally announced January 2017.

  12. arXiv:1510.06493  [pdf, ps, other

    math.OC

    Imperfect demand estimation for new product production planning

    Authors: Antoine Deza, Kai Huang, Michael R. Metel

    Abstract: We are interested in the effect of consumer demand estimation error for new products in the context of production planning. An inventory model is proposed, whereby demand is influenced by price and advertising. The effect of parameter misspecification of the demand model is empirically examined in relation to profit and service level feasibility. Faced with an uncertain consumer reaction to price… ▽ More

    Submitted 22 October, 2015; originally announced October 2015.

  13. arXiv:1510.05790  [pdf, ps, other

    q-fin.PM math.OC

    Risk management under Omega measure

    Authors: Michael R. Metel, Traian A. Pirvu, Julian Wong

    Abstract: We prove that the Omega measure, which considers all moments when assessing portfolio performance, is equivalent to the widely used Sharpe ratio under jointly elliptic distributions of returns. Portfolio optimization of the Sharpe ratio is then explored, with an active-set algorithm presented for markets prohibiting short sales. When asymmetric returns are considered we show that the Omega measure… ▽ More

    Submitted 11 April, 2017; v1 submitted 20 October, 2015; originally announced October 2015.

  14. arXiv:1503.06535  [pdf, ps, other

    math.OC

    Managing losses in exotic horse race wagering

    Authors: Antoine Deza, Kai Huang, Michael R. Metel

    Abstract: We consider a specialized form of risk management for betting opportunities with low payout frequency, presented in particular for exotic horse race wagering. An optimization problem is developed which limits losing streaks with high probability to the given time horizon of a gambler, which is formulated as a globally solvable mixed integer non-linear program. A case study is conducted using one s… ▽ More

    Submitted 1 August, 2017; v1 submitted 23 March, 2015; originally announced March 2015.

  15. arXiv:1407.7924  [pdf, ps, other

    cs.CE math.OC

    Chance Constrained Optimization for Targeted Internet Advertising

    Authors: Antoine Deza, Kai Huang, Michael R. Metel

    Abstract: We introduce a chance constrained optimization model for the fulfillment of guaranteed display Internet advertising campaigns. The proposed formulation for the allocation of display inventory takes into account the uncertainty of the supply of Internet viewers. We discuss and present theoretical and computational features of the model via Monte Carlo sampling and convex approximations. Theoretical… ▽ More

    Submitted 29 July, 2014; originally announced July 2014.