Skip to main content

Showing 1–2 of 2 results for author: Gootzen, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.05366  [pdf, ps, other

    cs.NI

    SDR-RDMA: Software-Defined Reliability Architecture for Planetary Scale RDMA Communication

    Authors: Mikhail Khalilov, Siyuan Shen, Marcin Chrapek, Tiancheng Chen, Kenji Nakano, Peter-Jan Gootzen, Salvatore Di Girolamo, Rami Nudelman, Gil Bloch, Sreevatsa Anantharamu, Mahmoud Elhaddad, Jithin Jose, Abdul Kabbani, Scott Moe, Konstantin Taranov, Zhuolong Yu, Jie Zhang, Nicola Mazzoletti, Torsten Hoefler

    Abstract: RDMA is vital for efficient distributed training across datacenters, but millisecond-scale latencies complicate the design of its reliability layer. We show that depending on long-haul link characteristics, such as drop rate, distance and bandwidth, the widely used Selective Repeat algorithm can be inefficient, warranting alternatives like Erasure Coding. To enable such alternatives on existing ha… ▽ More

    Submitted 10 May, 2025; v1 submitted 8 May, 2025; originally announced May 2025.

  2. arXiv:2112.06810  [pdf, other

    cs.OS cs.PL

    Bento and the Art of Repeated Research

    Authors: Peter-Jan Gootzen, Animesh Trivedi

    Abstract: Bento provides a new approach to developing file systems, with safety and high-velocity development in mind. This is achieved by using Rust, a modern and memory-safe systems programming language, and by providing a framework to run a single file system implementation in kernel space with the VFS or in user space with FUSE. In this paper, the benchmarking experiments from the Bento paper are repeat… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.