Skip to main content

Showing 1–9 of 9 results for author: Chandramouli, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.13826  [pdf, other

    cs.IR

    In-Place Updates of a Graph Index for Streaming Approximate Nearest Neighbor Search

    Authors: Haike Xu, Magdalen Dobson Manohar, Philip A. Bernstein, Badrish Chandramouli, Richard Wen, Harsha Vardhan Simhadri

    Abstract: Indices for approximate nearest neighbor search (ANNS) are a basic component for information retrieval and widely used in database, search, recommendation and RAG systems. In these scenarios, documents or other objects are inserted into and deleted from the working set at a high rate, requiring a stream of updates to the vector index. Algorithms based on proximity graph indices are the most effici… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

  2. arXiv:2412.13314  [pdf, other

    cs.DC

    Distributed Speculative Execution for Resilient Cloud Applications

    Authors: Tianyu Li, Badrish Chandramouli, Philip A. Bernstein, Samuel Madden

    Abstract: Fault-tolerance is critically important in highly-distributed modern cloud applications. Solutions such as Temporal, Azure Durable Functions, and Beldi hide fault-tolerance complexity from developers by persisting execution state and resuming seamlessly from persisted state after failure. This pattern, often called durable execution, usually forces frequent and synchronous persistence and results… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

  3. arXiv:2407.13618  [pdf

    cs.DC

    DDS: DPU-optimized Disaggregated Storage [Extended Report]

    Authors: Qizhen Zhang, Philip Bernstein, Badrish Chandramouli, Jiasheng Hu, Yiming Zheng

    Abstract: This extended report presents DDS, a novel disaggregated storage architecture enabled by emerging networking hardware, namely DPUs (Data Processing Units). DPUs can optimize the latency and CPU consumption of disaggregated storage servers. However, utilizing DPUs for DBMSs requires careful design of the network and storage paths and the interface exposed to the DBMS. To fully benefit from DPUs, DD… ▽ More

    Submitted 28 August, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: This paper is an extended report of DDS, which has been accepted by VLDB 2024

    ACM Class: H.2.4; D.4.2

  4. arXiv:2305.01516  [pdf, other

    cs.DB

    F2: Designing a Key-Value Store for Large Skewed Workloads

    Authors: Konstantinos Kanellis, Badrish Chandramouli, Shivaram Venkataraman

    Abstract: Many real-world workloads present a challenging set of requirements: point operations requiring high throughput, working sets much larger than main memory, and natural skew in key access patterns for both reads and writes. We find that modern key-value designs are either optimized for memory-efficiency, sacrificing high-performance (LSM-tree designs), or achieve high-performance, saturating modern… ▽ More

    Submitted 4 December, 2024; v1 submitted 2 May, 2023; originally announced May 2023.

  5. arXiv:2112.12946  [pdf

    cs.DC cs.DB

    Redy: Remote Dynamic Memory Cache

    Authors: Qizhen Zhang, Philip A. Bernstein, Daniel S. Berger, Badrish Chandramouli

    Abstract: Redy is a cloud service that provides high performance caches using RDMA-accessible remote memory. An application can customize the performance of each cache with a service level objective (SLO) for latency and throughput. By using remote memory, it can leverage stranded memory and spot VM instances to reduce the cost of its caches and improve data center resource utilization. Redy automatically c… ▽ More

    Submitted 1 January, 2022; v1 submitted 24 December, 2021; originally announced December 2021.

    Comments: This is the extended report of Redy (accepted at VLDB 2022)

    ACM Class: H.2.4; C.2.4

  6. Achieving High Throughput and Elasticity in a Larger-than-Memory Store

    Authors: Chinmay Kulkarni, Badrish Chandramouli, Ryan Stutsman

    Abstract: Millions of sensors, mobile applications and machines now generate billions of events. Specialized many-core key-value stores (KVSs) can ingest and index these events at high rates (over 100 Mops/s on one machine) if events are generated on the same machine; however, to be practical and cost-effective they must ingest events over the network and scale across cloud resources elastically. We prese… ▽ More

    Submitted 27 April, 2021; v1 submitted 4 June, 2020; originally announced June 2020.

    Journal ref: PVLDB, 14(8): 1427 - 1440, 2021

  7. arXiv:2004.10898  [pdf, other

    cs.DB cs.DS cs.LG

    Qd-tree: Learning Data Layouts for Big Data Analytics

    Authors: Zongheng Yang, Badrish Chandramouli, Chi Wang, Johannes Gehrke, Yinan Li, Umar Farooq Minhas, Per-Åke Larson, Donald Kossmann, Rajeev Acharya

    Abstract: Corporations today collect data at an unprecedented and accelerating scale, making the need to run queries on large datasets increasingly important. Technologies such as columnar block-based data organization and compression have become standard practice in most commercial database systems. However, the problem of best assigning records to data blocks on storage is still open. For example, today's… ▽ More

    Submitted 22 April, 2020; originally announced April 2020.

    Comments: ACM SIGMOD 2020

  8. arXiv:1905.08898  [pdf, other

    cs.DB cs.DS cs.LG

    ALEX: An Updatable Adaptive Learned Index

    Authors: Jialin Ding, Umar Farooq Minhas, Jia Yu, Chi Wang, Jaeyoung Do, Yinan Li, Hantian Zhang, Badrish Chandramouli, Johannes Gehrke, Donald Kossmann, David Lomet, Tim Kraska

    Abstract: Recent work on "learned indexes" has changed the way we look at the decades-old field of DBMS indexing. The key idea is that indexes can be thought of as "models" that predict the position of a key in a dataset. Indexes can, thus, be learned. The original work by Kraska et al. shows that a learned index beats a B+Tree by a factor of up to three in search time and by an order of magnitude in memory… ▽ More

    Submitted 20 May, 2020; v1 submitted 21 May, 2019; originally announced May 2019.

    Report number: MSR-TR-2020-12

  9. arXiv:1205.4477  [pdf, other

    cs.LG cs.DB

    Streaming Algorithms for Pattern Discovery over Dynamically Changing Event Sequences

    Authors: Debprakash Patnaik, Naren Ramakrishnan, Srivatsan Laxman, Badrish Chandramouli

    Abstract: Discovering frequent episodes over event sequences is an important data mining task. In many applications, events constituting the data sequence arrive as a stream, at furious rates, and recent trends (or frequent episodes) can change and drift due to the dynamical nature of the underlying event generation process. The ability to detect and track such the changing sets of frequent episodes can be… ▽ More

    Submitted 20 May, 2012; originally announced May 2012.