Skip to main content

Showing 1–1 of 1 results for author: Kreisler, I

Searching in archive stat. Search in all archives.
.
  1. arXiv:2305.13064  [pdf, other

    cs.LG math.OC stat.ML

    Gradient Descent Monotonically Decreases the Sharpness of Gradient Flow Solutions in Scalar Networks and Beyond

    Authors: Itai Kreisler, Mor Shpigel Nacson, Daniel Soudry, Yair Carmon

    Abstract: Recent research shows that when Gradient Descent (GD) is applied to neural networks, the loss almost never decreases monotonically. Instead, the loss oscillates as gradient descent converges to its ''Edge of Stability'' (EoS). Here, we find a quantity that does decrease monotonically throughout GD training: the sharpness attained by the gradient flow solution (GFS)-the solution that would be obtai… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.