Skip to main content

Showing 1–10 of 10 results for author: Hershcovitch, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.20818  [pdf, other

    cs.DC

    SkyStore: Cost-Optimized Object Storage Across Regions and Clouds

    Authors: Shu Liu, Xiangxi Mo, Moshik Hershcovitch, Henric Zhang, Audrey Cheng, Guy Girmonsky, Gil Vernik, Michael Factor, Tiemo Bang, Soujanya Ponnapalli, Natacha Crooks, Joseph E. Gonzalez, Danny Harnik, Ion Stoica

    Abstract: Modern applications span multiple clouds to reduce costs, avoid vendor lock-in, and leverage low-availability resources in another cloud. However, standard object stores operate within a single cloud, forcing users to manually manage data placement across clouds, i.e., navigate their diverse APIs and handle heterogeneous costs for network and storage. This is often a complex choice: users must eit… ▽ More

    Submitted 28 February, 2025; originally announced February 2025.

  2. arXiv:2411.05239  [pdf, ps, other

    cs.LG cs.IT

    ZipNN: Lossless Compression for AI Models

    Authors: Moshik Hershcovitch, Andrew Wood, Leshem Choshen, Guy Girmonsky, Roy Leibovitz, Ilias Ennmouri, Michal Malka, Peter Chin, Swaminathan Sundararaman, Danny Harnik

    Abstract: With the growth of model sizes and the scale of their deployment, their sheer size burdens the infrastructure requiring more network and more storage to accommodate these. While there is a vast model compression literature deleting parts of the model weights for faster inference, we investigate a more traditional type of compression - one that represents the model in a compact form and is coupled… ▽ More

    Submitted 4 June, 2025; v1 submitted 7 November, 2024; originally announced November 2024.

    Comments: IEEE Cloud. arXiv admin note: text overlap with arXiv:2404.15198

  3. arXiv:2404.15198  [pdf, other

    cs.LG cs.IT

    Lossless and Near-Lossless Compression for Foundation Models

    Authors: Moshik Hershcovitch, Leshem Choshen, Andrew Wood, Ilias Enmouri, Peter Chin, Swaminathan Sundararaman, Danny Harnik

    Abstract: With the growth of model sizes and scale of their deployment, their sheer size burdens the infrastructure requiring more network and more storage to accommodate these. While there is a vast literature about reducing model sizes, we investigate a more traditional type of compression -- one that compresses the model to a smaller form and is coupled with a decompression algorithm that returns it to i… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  4. arXiv:2306.04602  [pdf, other

    cs.CR

    Prefix Siphoning: Exploiting LSM-Tree Range Filters For Information Disclosure (Full Version)

    Authors: Adi Kaufman, Moshik Hershcovitch, Adam Morrison

    Abstract: Key-value stores typically leave access control to the systems for which they act as storage engines. Unfortunately, attackers may circumvent such read access controls via timing attacks on the key-value store, which use differences in query response times to glean information about stored data. To date, key-value store timing attacks have aimed to disclose stored values and have exploited exter… ▽ More

    Submitted 8 September, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: Full version of USENIX ATC'23 paper

  5. arXiv:2202.13718  [pdf, other

    cs.LG cs.CY

    Fast Feature Selection with Fairness Constraints

    Authors: Francesco Quinzan, Rajiv Khanna, Moshik Hershcovitch, Sarel Cohen, Daniel G. Waddington, Tobias Friedrich, Michael W. Mahoney

    Abstract: We study the fundamental problem of selecting optimal features for model construction. This problem is computationally challenging on large datasets, even with the use of greedy algorithm variants. To address this challenge, we extend the adaptive query model, recently proposed for the greedy forward selection for submodular functions, to the faster paradigm of Orthogonal Matching Pursuit for non-… ▽ More

    Submitted 3 February, 2023; v1 submitted 28 February, 2022; originally announced February 2022.

  6. arXiv:2202.11518  [pdf, other

    cs.LG cs.AR

    Non-Volatile Memory Accelerated Geometric Multi-Scale Resolution Analysis

    Authors: Andrew Wood, Moshik Hershcovitch, Daniel Waddington, Sarel Cohen, Meredith Wolf, Hongjun Suh, Weiyu Zong, Peter Chin

    Abstract: Dimensionality reduction algorithms are standard tools in a researcher's toolbox. Dimensionality reduction algorithms are frequently used to augment downstream tasks such as machine learning, data science, and also are exploratory methods for understanding complex phenomena. For instance, dimensionality reduction is commonly used in Biology as well as Neuroscience to understand data collected from… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

    Comments: 2021 IEEE High Performance Extreme Computing Conference (HPEC)

  7. arXiv:2202.10522  [pdf, other

    cs.LG cs.AR

    Non-Volatile Memory Accelerated Posterior Estimation

    Authors: Andrew Wood, Moshik Hershcovitch, Daniel Waddington, Sarel Cohen, Peter Chin

    Abstract: Bayesian inference allows machine learning models to express uncertainty. Current machine learning models use only a single learnable parameter combination when making predictions, and as a result are highly overconfident when their predictions are wrong. To use more learnable parameter combinations efficiently, these samples must be drawn from the posterior distribution. Unfortunately computing t… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

  8. arXiv:2104.06225  [pdf, other

    cs.DB cs.PF

    A High-Performance Persistent Memory Key-Value Store with Near-Memory Compute

    Authors: Daniel Waddington, Clem Dickey, Luna Xu, Moshik Hershcovitch, Sangeetha Seshadri

    Abstract: MCAS (Memory Centric Active Storage) is a persistent memory tier for high-performance durable data storage. It is designed from the ground-up to provide a key-value capability with low-latency guarantees and data durability through memory persistence and replication. To reduce data movement and make further gains in performance, we provide support for user-defined "push-down" operations (known as… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

  9. arXiv:2103.00007  [pdf, other

    cs.AR

    An Architecture for Memory Centric Active Storage (MCAS)

    Authors: Daniel Waddington, Clem Dickey, Moshik Hershcovitch, Sangeetha Seshadri

    Abstract: The advent of CPU-attached persistent memory technology, such as Intel's Optane Persistent Memory Modules (PMM), has brought with it new opportunities for storage. In 2018, IBM Research Almaden began investigating and developing a new enterprise-grade storage solution directly aimed at this emerging technology. MCAS (Memory Centric Active Storage) defines an evolved network-attached key-value stor… ▽ More

    Submitted 21 May, 2021; v1 submitted 26 February, 2021; originally announced March 2021.

    Comments: Revision 1.2

  10. arXiv:1306.3772  [pdf, other

    cs.DS

    Minimal Indices for Successor Search

    Authors: Sarel Cohen, Amos Fiat, Moshik Hershcovitch, Haim Kaplan

    Abstract: We give a new successor data structure which improves upon the index size of the Pǎtraşcu-Thorup data structures, reducing the index size from $O(n w^{4/5})$ bits to $O(n \log w)$ bits, with optimal probe complexity. Alternatively, our new data structure can be viewed as matching the space complexity of the (probe-suboptimal) $z$-fast trie of Belazzougui et al. Thus, we get the best of both approa… ▽ More

    Submitted 17 June, 2013; originally announced June 2013.

    Comments: 28 pages, full version, extended abstract submitted to MFCS 2013