Skip to main content

Showing 1–2 of 2 results for author: Scheerer, J L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.17788  [pdf, other

    cs.IR

    WARP: An Efficient Engine for Multi-Vector Retrieval

    Authors: Jan Luca Scheerer, Matei Zaharia, Christopher Potts, Gustavo Alonso, Omar Khattab

    Abstract: Multi-vector retrieval methods such as ColBERT and its recent variant, the ConteXtualized Token Retriever (XTR), offer high accuracy but face efficiency challenges at scale. To address this, we present WARP, a retrieval engine that substantially improves the efficiency of retrievers trained with the XTR objective through three key innovations: (1) WARP$_\text{SELECT}$ for dynamic similarity imputa… ▽ More

    Submitted 30 April, 2025; v1 submitted 29 January, 2025; originally announced January 2025.

    Comments: Accepted at SIGIR 2025

  2. arXiv:2408.07494  [pdf, other

    cs.DB cs.LG

    QirK: Question Answering via Intermediate Representation on Knowledge Graphs

    Authors: Jan Luca Scheerer, Anton Lykov, Moe Kayali, Ilias Fountalis, Dan Olteanu, Nikolaos Vasiloglou, Dan Suciu

    Abstract: We demonstrate QirK, a system for answering natural language questions on Knowledge Graphs (KG). QirK can answer structurally complex questions that are still beyond the reach of emerging Large Language Models (LLMs). It does so using a unique combination of database technology, LLMs, and semantic search over vector embeddings. The glue for these components is an intermediate representation (IR).… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.