Skip to main content

Showing 1–2 of 2 results for author: Legović, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2102.00853  [pdf, other

    cs.LG cs.NE

    Painless step size adaptation for SGD

    Authors: Ilona Kulikovskikh, Tarzan Legović

    Abstract: Convergence and generalization are two crucial aspects of performance in neural networks. When analyzed separately, these properties may lead to contradictory results. Optimizing a convergence rate yields fast training, but does not guarantee the best generalization error. To avoid the conflict, recent studies suggest adopting a moderately large step size for optimizers, but the added value on the… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Comments: This work has been submitted to the IEEE for possible publication

  2. arXiv:2008.03501  [pdf, other

    cs.LG cs.NE stat.ML

    Why to "grow" and "harvest" deep learning models?

    Authors: Ilona Kulikovskikh, Tarzan Legović

    Abstract: Current expectations from training deep learning models with gradient-based methods include: 1) transparency; 2) high convergence rates; 3) high inductive biases. While the state-of-art methods with adaptive learning rate schedules are fast, they still fail to meet the other two requirements. We suggest reconsidering neural network models in terms of single-species population dynamics where adapta… ▽ More

    Submitted 8 August, 2020; originally announced August 2020.