Skip to main content

Showing 1–1 of 1 results for author: Henry, N W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.17221  [pdf, other

    cs.LG math.AG

    Geometry of Lightning Self-Attention: Identifiability and Dimension

    Authors: Nathan W. Henry, Giovanni Luca Marchetti, Kathlén Kohn

    Abstract: We consider function spaces defined by self-attention networks without normalization, and theoretically analyze their geometry. Since these networks are polynomial, we rely on tools from algebraic geometry. In particular, we study the identifiability of deep attention by providing a description of the generic fibers of the parametrization for an arbitrary number of layers and, as a consequence, co… ▽ More

    Submitted 19 February, 2025; v1 submitted 30 August, 2024; originally announced August 2024.

    Comments: Accepted at ICLR 2025