Skip to main content

Showing 1–1 of 1 results for author: Si, D

Searching in archive math. Search in all archives.
.
  1. arXiv:2306.09850  [pdf, other

    cs.LG math.OC stat.ML

    Practical Sharpness-Aware Minimization Cannot Converge All the Way to Optima

    Authors: Dongkuk Si, Chulhee Yun

    Abstract: Sharpness-Aware Minimization (SAM) is an optimizer that takes a descent step based on the gradient at a perturbation $y_t = x_t + ρ\frac{\nabla f(x_t)}{\lVert \nabla f(x_t) \rVert}$ of the current point $x_t$. Existing studies prove convergence of SAM for smooth functions, but they do so by assuming decaying perturbation size $ρ$ and/or no gradient normalization in $y_t$, which is detached from pr… ▽ More

    Submitted 27 October, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: 39 pages. v3 NeurIPS 2023 camera ready version