Skip to main content

Showing 1–1 of 1 results for author: Maulen-Soto, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.13112  [pdf, other

    stat.ML cs.LG

    Attention-based clustering

    Authors: Rodrigo Maulen-Soto, Claire Boyer, Pierre Marion

    Abstract: Transformers have emerged as a powerful neural network architecture capable of tackling a wide range of learning tasks. In this work, we provide a theoretical analysis of their ability to automatically extract structure from data in an unsupervised setting. In particular, we demonstrate their suitability for clustering when the input data is generated from a Gaussian mixture model. To this end, we… ▽ More

    Submitted 3 July, 2025; v1 submitted 19 May, 2025; originally announced May 2025.