Skip to main content

Showing 1–6 of 6 results for author: Tsilivis, N

Searching in archive stat. Search in all archives.
.
  1. arXiv:2410.22069  [pdf, other

    cs.LG stat.ML

    Flavors of Margin: Implicit Bias of Steepest Descent in Homogeneous Neural Networks

    Authors: Nikolaos Tsilivis, Gal Vardi, Julia Kempe

    Abstract: We study the implicit bias of the general family of steepest descent algorithms with infinitesimal learning rate in deep homogeneous neural networks. We show that: (a) an algorithm-dependent geometric margin starts increasing once the networks reach perfect training accuracy, and (b) any limit point of the training trajectory corresponds to a KKT point of the corresponding margin-maximization prob… ▽ More

    Submitted 2 April, 2025; v1 submitted 29 October, 2024; originally announced October 2024.

    Comments: The earlier conference version (ICLR 2025) of this paper showed a bias towards KKT points of the max-margin problem only in the case of 'smooth' norms. The current version (submitted to JMLR) proves that this holds true for any norm. It also includes new experiments on the implicit bias of the Shampoo algorithm

  2. arXiv:2410.16073  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG math.ST

    On the Geometry of Regularization in Adversarial Training: High-Dimensional Asymptotics and Generalization Bounds

    Authors: Matteo Vilucchio, Nikolaos Tsilivis, Bruno Loureiro, Julia Kempe

    Abstract: Regularization, whether explicit in terms of a penalty in the loss or implicit in the choice of algorithm, is a cornerstone of modern machine learning. Indeed, controlling the complexity of the model class is particularly important when data is scarce, noisy or contaminated, as it translates a statistical belief on the underlying structure of the data. This work investigates the question of how to… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  3. arXiv:2406.04981  [pdf, other

    cs.LG stat.ML

    The Price of Implicit Bias in Adversarially Robust Generalization

    Authors: Nikolaos Tsilivis, Natalie Frank, Nathan Srebro, Julia Kempe

    Abstract: We study the implicit bias of optimization in robust empirical risk minimization (robust ERM) and its connection with robust generalization. In classification settings under adversarial perturbations with linear models, we study what type of regularization should ideally be applied for a given perturbation set to improve (robust) generalization. We then show that the implicit bias of optimization… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  4. arXiv:2404.19640  [pdf, other

    cs.LG cs.AI cs.CV stat.ME stat.ML

    Attacking Bayes: On the Adversarial Robustness of Bayesian Neural Networks

    Authors: Yunzhen Feng, Tim G. J. Rudner, Nikolaos Tsilivis, Julia Kempe

    Abstract: Adversarial examples have been shown to cause neural networks to fail on a wide range of vision and language tasks, but recent work has claimed that Bayesian neural networks (BNNs) are inherently robust to adversarial perturbations. In this work, we examine this claim. To study the adversarial robustness of BNNs, we investigate whether it is possible to successfully break state-of-the-art BNN infe… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  5. arXiv:2307.02693  [pdf, other

    cs.LG stat.ML

    Kernels, Data & Physics

    Authors: Francesco Cagnetta, Deborah Oliveira, Mahalakshmi Sabanayagam, Nikolaos Tsilivis, Julia Kempe

    Abstract: Lecture notes from the course given by Professor Julia Kempe at the summer school "Statistical physics of Machine Learning" in Les Houches. The notes discuss the so-called NTK approach to problems in machine learning, which consists of gaining an understanding of generally unsolvable problems by finding a tractable kernel formulation. The notes are mainly focused on practical applications such as… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: These are notes from the lecture of Julia Kempe given at the summer school "Statistical Physics \& Machine Learning", that took place in Les Houches School of Physics in France from 4th to 29th July 2022

  6. arXiv:2011.04468  [pdf, other

    math.OC cs.LG math.RA stat.ML

    Sparse Approximate Solutions to Max-Plus Equations with Application to Multivariate Convex Regression

    Authors: Nikos Tsilivis, Anastasios Tsiamis, Petros Maragos

    Abstract: In this work, we study the problem of finding approximate, with minimum support set, solutions to matrix max-plus equations, which we call sparse approximate solutions. We show how one can obtain such solutions efficiently and in polynomial time for any $\ell_p$ approximation error. Based on these results, we propose a novel method for piecewise-linear fitting of convex multivariate functions, wit… ▽ More

    Submitted 21 December, 2020; v1 submitted 6 November, 2020; originally announced November 2020.

    Comments: 20 pages, 5 figures, 5 tables. Introduction revision and typos correction