Skip to main content

Showing 1–3 of 3 results for author: Wydrowski, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.10693  [pdf, other

    cs.DC math.OC

    Load Balancing with Network Latencies via Distributed Gradient Descent

    Authors: Santiago R. Balseiro, Vahab S. Mirrokni, Bartek Wydrowski

    Abstract: Motivated by the growing demand for serving large language model inference requests, we study distributed load balancing for global serving systems with network latencies. We consider a fluid model in which continuous flows of requests arrive at different frontends and need to be routed to distant backends for processing whose processing rates are workload dependent. Network latencies can lead to… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

  2. arXiv:2411.17103  [pdf, other

    cs.DC

    Optimal and Stable Distributed Bipartite Load Balancing

    Authors: Wenxin Zhang, Santiago R. Balseiro, Robert Kleinberg, Vahab Mirrokni, Balasubramanian Sivan, Bartek Wydrowski

    Abstract: We study distributed load balancing in bipartite queueing systems. Specifically, a set of frontends route jobs to a set of heterogeneous backends with workload-dependent service rates, with an arbitrary bipartite graph representing the connectivity between the frontends and backends. Each frontend operates independently without any communication with the other frontends, and the goal is to minimiz… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

  3. arXiv:2312.10172  [pdf, other

    cs.DC

    Load is not what you should balance: Introducing Prequal

    Authors: Bartek Wydrowski, Robert Kleinberg, Stephen M. Rumble, Aaron Archer

    Abstract: We present Prequal (Probing to Reduce Queuing and Latency), a load balancer for distributed multi-tenant systems. Prequal aims to minimize real-time request latency in the presence of heterogeneous server capacities and non-uniform, time-varying antagonist load. It actively probes server load to leverage the power-of-d-choices paradigm, extending it with asynchronous and reusable probes. Cutting a… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted to USENIX NSDI 2024

    ACM Class: C.2.4