Skip to main content

Showing 1–7 of 7 results for author: Shahverdi, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.11846  [pdf, ps, other

    cs.LG math.AG

    Learning on a Razor's Edge: the Singularity Bias of Polynomial Neural Networks

    Authors: Vahid Shahverdi, Giovanni Luca Marchetti, Kathlén Kohn

    Abstract: Deep neural networks often infer sparse representations, converging to a subnetwork during the learning process. In this work, we theoretically analyze subnetworks and their bias through the lens of algebraic geometry. We consider fully-connected networks with polynomial activation functions, and focus on the geometry of the function space they parametrize, often referred to as neuromanifold. Firs… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

  2. arXiv:2501.18915  [pdf, ps, other

    cs.LG math.AG

    Algebra Unveils Deep Learning -- An Invitation to Neuroalgebraic Geometry

    Authors: Giovanni Luca Marchetti, Vahid Shahverdi, Stefano Mereta, Matthew Trager, Kathlén Kohn

    Abstract: In this position paper, we promote the study of function spaces parameterized by machine learning models through the lens of algebraic geometry. To this end, we focus on algebraic models, such as neural networks with polynomial activations, whose associated function spaces are semi-algebraic varieties. We outline a dictionary between algebro-geometric invariants of these varieties, such as dimensi… ▽ More

    Submitted 30 May, 2025; v1 submitted 31 January, 2025; originally announced January 2025.

    Comments: Published at ICML 2025

  3. arXiv:2410.00722  [pdf, other

    cs.LG math.AG

    On the Geometry and Optimization of Polynomial Convolutional Networks

    Authors: Vahid Shahverdi, Giovanni Luca Marchetti, Kathlén Kohn

    Abstract: We study convolutional neural networks with monomial activation functions. Specifically, we prove that their parameterization map is regular and is an isomorphism almost everywhere, up to rescaling the filters. By leveraging on tools from algebraic geometry, we explore the geometric properties of the image in function space of this map - typically referred to as neuromanifold. In particular, we co… ▽ More

    Submitted 3 March, 2025; v1 submitted 1 October, 2024; originally announced October 2024.

    Comments: Accepted at AISTATS 2025

  4. arXiv:2409.04868  [pdf, other

    cs.IT

    Moment Constraints and Phase Recovery for Multireference Alignment

    Authors: Vahid Shahverdi, Emanuel Ström, Joakim Andén

    Abstract: Multireference alignment (MRA) refers to the problem of recovering a signal from noisy samples subject to random circular shifts. Expectation maximization (EM) and variational approaches use statistical modeling to achieve high accuracy at the cost of solving computationally expensive optimization problems. The method of moments, instead, achieves fast reconstructions by utilizing the power spectr… ▽ More

    Submitted 7 September, 2024; originally announced September 2024.

    Comments: 24 pages, 10 figures

    MSC Class: 94A12; 92C55; 62F12; 68U10; 90C30; 58C25; 58E05

  5. arXiv:2401.16613  [pdf, ps, other

    math.AG cs.LG

    Algebraic Complexity and Neurovariety of Linear Convolutional Networks

    Authors: Vahid Shahverdi

    Abstract: In this paper, we study linear convolutional networks with one-dimensional filters and arbitrary strides. The neuromanifold of such a network is a semialgebraic set, represented by a space of polynomials admitting specific factorizations. Introducing a recursive algorithm, we generate polynomial equations whose common zero locus corresponds to the Zariski closure of the corresponding neuromanifold… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    MSC Class: 68T07; 14E99; 14J99; 14P10; 90C23

  6. arXiv:2309.13736  [pdf, other

    cs.LG math.AG

    Geometry of Linear Neural Networks: Equivariance and Invariance under Permutation Groups

    Authors: Kathlén Kohn, Anna-Laura Sattelberger, Vahid Shahverdi

    Abstract: The set of functions parameterized by a linear fully-connected neural network is a determinantal variety. We investigate the subvariety of functions that are equivariant or invariant under the action of a permutation group. Examples of such group actions are translations or $90^\circ$ rotations on images. We describe such equivariant or invariant subvarieties as direct products of determinantal va… ▽ More

    Submitted 10 January, 2025; v1 submitted 24 September, 2023; originally announced September 2023.

    Comments: 42 pages, 8 figures, 1 table; comments welcome!

  7. arXiv:2304.05752  [pdf, other

    cs.LG math.AG

    Function Space and Critical Points of Linear Convolutional Networks

    Authors: Kathlén Kohn, Guido Montúfar, Vahid Shahverdi, Matthew Trager

    Abstract: We study the geometry of linear networks with one-dimensional convolutional layers. The function spaces of these networks can be identified with semi-algebraic families of polynomials admitting sparse factorizations. We analyze the impact of the network's architecture on the function space's dimension, boundary, and singular points. We also describe the critical points of the network's parameteriz… ▽ More

    Submitted 26 January, 2024; v1 submitted 12 April, 2023; originally announced April 2023.

    Comments: 35 pages, 1 figure, 2 tables

    MSC Class: 68T07; 14B05; 14E99; 14J99; 14N05; 14P10; 90C23