Skip to main content

Showing 1–8 of 8 results for author: Athanassoulis, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.16929  [pdf, ps, other

    cs.DB

    CUBIT: Concurrent Updatable Bitmap Indexing (Extended Version)

    Authors: Junchang Wang, Manos Athanassoulis

    Abstract: Bitmap indexes are widely used for read-intensive analytical workloads because they are clustered and offer efficient reads with a small memory footprint. However, they are notoriously inefficient to update. As analytical applications are increasingly fused with transactional applications, leading to the emergence of hybrid transactional/analytical processing (HTAP), it is desirable that bitmap in… ▽ More

    Submitted 23 October, 2024; v1 submitted 22 October, 2024; originally announced October 2024.

  2. arXiv:2311.10005  [pdf, other

    cs.DB

    Towards Flexibility and Robustness of LSM Trees

    Authors: Andy Huynh, Harshal A. Chaudhari, Evimaria Terzi, Manos Athanassoulis

    Abstract: Log-Structured Merge trees (LSM trees) are increasingly used as part of the storage engine behind several data systems, and are frequently deployed in the cloud. As the number of applications relying on LSM-based storage backends increases, the problem of performance tuning of LSM trees receives increasing attention. We consider both nominal tunings - where workload and execution environment are a… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 25 pages, 19 figures, VLDB-J. arXiv admin note: substantial text overlap with arXiv:2110.13801

  3. arXiv:2310.03098  [pdf, other

    cs.DB

    NOCAP: Near-Optimal Correlation-Aware Partitioning Joins

    Authors: Zichen Zhu, Xiao Hu, Manos Athanassoulis

    Abstract: Storage-based joins are still commonly used today because the memory budget does not always scale with the data size. One of the many join algorithms developed that has been widely deployed and proven to be efficient is the Hybrid Hash Join (HHJ), which is designed to exploit any available memory to maximize the data that is joined directly in memory. However, HHJ cannot fully exploit detailed kno… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  4. Constructing and Analyzing the LSM Compaction Design Space (Updated Version)

    Authors: Subhadeep Sarkar, Dimitris Staratzis, Zichen Zhu, Manos Athanassoulis

    Abstract: Log-structured merge (LSM) trees offer efficient ingestion by appending incoming data, and thus, are widely used as the storage layer of production NoSQL data stores. To enable competitive read performance, LSM-trees periodically re-organize data to form a tree with levels of exponentially increasing capacity, through iterative compactions. Compactions fundamentally influence the performance of an… ▽ More

    Submitted 26 February, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    Journal ref: Proceedings of the VLDB Endowment, 2021

  5. arXiv:2202.04185  [pdf, other

    cs.DB cs.DS

    OSM-tree: A Sortedness-Aware Index

    Authors: Aneesh Raman, Subhadeep Sarkar, Matthaios Olma, Manos Athanassoulis

    Abstract: Indexes facilitate efficient querying when the selection predicate is on an indexed key. As a result, when loading data, if we anticipate future selective (point or range) queries, we typically maintain an index that is gradually populated as new data is ingested. In that respect, indexing can be perceived as the process of adding structure to an incoming, otherwise unsorted, data collection. The… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

  6. arXiv:2110.13801  [pdf, other

    cs.DB

    Endure: A Robust Tuning Paradigm for LSM Trees Under Workload Uncertainty

    Authors: Andy Huynh, Harshal A. Chaudhari, Evimaria Terzi, Manos Athanassoulis

    Abstract: Log-Structured Merge trees (LSM trees) are increasingly used as the storage engines behind several data systems, frequently deployed in the cloud. Similar to other database architectures, LSM trees take into account information about the expected workload (e.g., reads vs. writes, point vs. range queries) to optimize their performance via tuning. Operating in shared infrastructure like the cloud, h… ▽ More

    Submitted 2 November, 2021; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: 21 pages, 30 figures

  7. arXiv:2109.14349  [pdf, other

    cs.DB cs.AR

    Relational Memory: Native In-Memory Accesses on Rows and Columns

    Authors: Shahin Roozkhosh, Denis Hoornaert, Ju Hyoung Mun, Tarikul Islam Papon, Ahmed Sanaullah, Ulrich Drepper, Renato Mancuso, Manos Athanassoulis

    Abstract: Analytical database systems are typically designed to use a column-first data layout to access only the desired fields. On the other hand, storing data row-first works great for accessing, inserting, or updating entire rows. Transforming rows to columns at runtime is expensive, hence, many analytical systems ingest data in row-first form and transform it in the background to columns to facilitate… ▽ More

    Submitted 6 February, 2022; v1 submitted 29 September, 2021; originally announced September 2021.

  8. arXiv:2006.04777  [pdf, other

    cs.DB

    Lethe: A Tunable Delete-Aware LSM Engine (Updated Version)

    Authors: Subhadeep Sarkar, Tarikul Islam Papon, Dimitris Staratzis, Manos Athanassoulis

    Abstract: Data-intensive applications fueled the evolution of log structured merge (LSM) based key-value engines that employ the out-of-place paradigm to support high ingestion rates with low read/write interference. These benefits, however, come at the cost of treating deletes as a second-class citizen. A delete inserts a tombstone that invalidates older instances of the deleted key. State-of-the-art LSM e… ▽ More

    Submitted 12 June, 2020; v1 submitted 8 June, 2020; originally announced June 2020.