Skip to main content

Showing 1–3 of 3 results for author: Dadi, L T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04731  [pdf, other

    math.OC cs.LG

    Efficient Continual Finite-Sum Minimization

    Authors: Ioannis Mavrothalassitis, Stratis Skoulakis, Leello Tadesse Dadi, Volkan Cevher

    Abstract: Given a sequence of functions $f_1,\ldots,f_n$ with $f_i:\mathcal{D}\mapsto \mathbb{R}$, finite-sum minimization seeks a point ${x}^\star \in \mathcal{D}$ minimizing $\sum_{j=1}^n f_j(x)/n$. In this work, we propose a key twist into the finite-sum minimization, dubbed as continual finite-sum minimization, that asks for a sequence of points ${x}_1^\star,\ldots,{x}_n^\star \in \mathcal{D}$ such that… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted in ICLR 2024, 35 pages

  2. arXiv:2211.01851  [pdf, other

    math.OC cs.LG

    Adaptive Stochastic Variance Reduction for Non-convex Finite-Sum Minimization

    Authors: Ali Kavis, Stratis Skoulakis, Kimon Antonakopoulos, Leello Tadesse Dadi, Volkan Cevher

    Abstract: We propose an adaptive variance-reduction method, called AdaSpider, for minimization of $L$-smooth, non-convex functions with a finite-sum structure. In essence, AdaSpider combines an AdaGrad-inspired [Duchi et al., 2011, McMahan & Streeter, 2010], but a fairly distinct, adaptive step-size schedule with the recursive stochastic path integrated estimator proposed in [Fang et al., 2018]. To our know… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Comments: 23 pages, 2 figures, accepted at NeurIPS 2022

  3. arXiv:2202.13473  [pdf, other

    cs.LG cs.CV

    The Spectral Bias of Polynomial Neural Networks

    Authors: Moulik Choraria, Leello Tadesse Dadi, Grigorios Chrysos, Julien Mairal, Volkan Cevher

    Abstract: Polynomial neural networks (PNNs) have been recently shown to be particularly effective at image generation and face recognition, where high-frequency information is critical. Previous studies have revealed that neural networks demonstrate a $\textit{spectral bias}$ towards low-frequency functions, which yields faster learning of low-frequency components during training. Inspired by such studies,… ▽ More

    Submitted 27 February, 2022; originally announced February 2022.

    Comments: Accepted at the International Conference on Learning Representations(ICLR) 2022