Skip to main content

Showing 1–6 of 6 results for author: Pacaci, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.03437  [pdf, ps, other

    cs.IR

    Quake: Adaptive Indexing for Vector Search

    Authors: Jason Mohoney, Devesh Sarda, Mengze Tang, Shihabur Rahman Chowdhury, Anil Pacaci, Ihab F. Ilyas, Theodoros Rekatsinas, Shivaram Venkataraman

    Abstract: Vector search, the task of finding the k-nearest neighbors of a query vector against a database of high-dimensional vectors, underpins many machine learning applications, including retrieval-augmented generation, recommendation systems, and information retrieval. However, existing approximate nearest neighbor (ANN) methods perform poorly under dynamic and skewed workloads where data distributions… ▽ More

    Submitted 6 June, 2025; v1 submitted 3 June, 2025; originally announced June 2025.

  2. arXiv:2504.05573  [pdf, other

    cs.DB cs.AI cs.IR

    MicroNN: An On-device Disk-resident Updatable Vector Database

    Authors: Jeffrey Pound, Floris Chabert, Arjun Bhushan, Ankur Goswami, Anil Pacaci, Shihabur Rahman Chowdhury

    Abstract: Nearest neighbour search over dense vector collections has important applications in information retrieval, retrieval augmented generation (RAG), and content ranking. Performing efficient search over large vector collections is a well studied problem with many existing approaches and open source implementations. However, most state-of-the-art systems are generally targeted towards scenarios using… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

  3. arXiv:2411.00970  [pdf, other

    cs.DB cs.AI cs.LG

    Incremental IVF Index Maintenance for Streaming Vector Search

    Authors: Jason Mohoney, Anil Pacaci, Shihabur Rahman Chowdhury, Umar Farooq Minhas, Jeffery Pound, Cedric Renggli, Nima Reyhani, Ihab F. Ilyas, Theodoros Rekatsinas, Shivaram Venkataraman

    Abstract: The prevalence of vector similarity search in modern machine learning applications and the continuously changing nature of data processed by these applications necessitate efficient and effective index maintenance techniques for vector search indexes. Designed primarily for static workloads, existing vector search indexes degrade in search quality and performance as the underlying data is updated… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

    Comments: 14 pages, 14 figures

  4. arXiv:2304.01926  [pdf

    cs.DB cs.AI cs.LG

    High-Throughput Vector Similarity Search in Knowledge Graphs

    Authors: Jason Mohoney, Anil Pacaci, Shihabur Rahman Chowdhury, Ali Mousavi, Ihab F. Ilyas, Umar Farooq Minhas, Jeffrey Pound, Theodoros Rekatsinas

    Abstract: There is an increasing adoption of machine learning for encoding data into vectors to serve online recommendation and search use cases. As a result, recent data management systems propose augmenting query processing with online vector similarity search. In this work, we explore vector similarity search in the context of Knowledge Graphs (KGs). Motivated by the tasks of finding related KG queries a… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: 13 pages, 7 figures, to be published in ACM SIGMOD 2023

  5. arXiv:2101.12305  [pdf, other

    cs.DB

    Evaluating Complex Queries on Streaming Graphs

    Authors: Anil Pacaci, Angela Bonifati, M. Tamer Özsu

    Abstract: We study the problem of evaluating persistent queries over streaming graphs in a principled fashion. These queries need to be evaluated over unbounded and very high speed graph streams. We define a streaming graph data model and query model incorporating navigational queries, subgraph queries and paths as first-class citizens. To support this full-fledged query model we develop a streaming graph a… ▽ More

    Submitted 1 August, 2021; v1 submitted 28 January, 2021; originally announced January 2021.

    Comments: 18 pages; typos fixed; examples, experimental setup and analysis updated

  6. arXiv:2004.02012  [pdf, other

    cs.DB

    Regular Path Query Evaluation on Streaming Graphs

    Authors: Anil Pacaci, Angela Bonifati, M. Tamer Özsu

    Abstract: We study persistent query evaluation over streaming graphs, which is becoming increasingly important. We focus on navigational queries that determine if there exists a path between two entities that satisfies a user-specified constraint. We adopt the Regular Path Query (RPQ) model that specifies navigational patterns with labeled constraints. We propose deterministic algorithms to efficiently eval… ▽ More

    Submitted 4 April, 2020; originally announced April 2020.

    Comments: A shorter version of this paper has been accepted for publication in 2020 International Conference on Management of Data (SIGMOD 2020)