Skip to main content

Showing 1–1 of 1 results for author: Mongaras, G

.
  1. arXiv:2409.18747  [pdf, other

    cs.LG

    Cottention: Linear Transformers With Cosine Attention

    Authors: Gabriel Mongaras, Trevor Dohm, Eric C. Larson

    Abstract: Attention mechanisms, particularly softmax attention, have been instrumental in the success of transformer-based models such as GPT. However, the quadratic memory complexity of softmax attention with respect to sequence length poses significant challenges for processing longer sequences. We introduce Cottention, a novel attention mechanism that replaces the softmax operation with cosine similarity… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

    Comments: 12 pages, 5 figures