Skip to main content

Showing 1–1 of 1 results for author: Veneroso, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.11471  [pdf, ps, other

    cs.IR

    CRISP: Clustering Multi-Vector Representations for Denoising and Pruning

    Authors: João Veneroso, Rajesh Jayaram, Jinmeng Rao, Gustavo Hernández Ábrego, Majid Hadian, Daniel Cer

    Abstract: Multi-vector models, such as ColBERT, are a significant advancement in neural information retrieval (IR), delivering state-of-the-art performance by representing queries and documents by multiple contextualized token-level embeddings. However, this increased representation size introduces considerable storage and computational overheads which have hindered widespread adoption in practice. A common… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.