-
Distributed Rate Scaling in Large-Scale Service Systems
Authors:
Daan Rutten,
Martin Zubeldia,
Debankur Mukherjee
Abstract:
We consider a large-scale parallel-server system, where each server independently adjusts its processing speed in a decentralized manner. The objective is to minimize the overall cost, which comprises the average cost of maintaining the servers' processing speeds and a non-decreasing function of the tasks' sojourn times. The problem is compounded by the lack of knowledge of the task arrival rate a…
▽ More
We consider a large-scale parallel-server system, where each server independently adjusts its processing speed in a decentralized manner. The objective is to minimize the overall cost, which comprises the average cost of maintaining the servers' processing speeds and a non-decreasing function of the tasks' sojourn times. The problem is compounded by the lack of knowledge of the task arrival rate and the absence of a centralized control or communication among the servers. We draw on ideas from stochastic approximation and present a novel rate scaling algorithm that ensures convergence of all server processing speeds to the globally asymptotically optimum value as the system size increases. Apart from the algorithm design, a key contribution of our approach lies in demonstrating how concepts from the stochastic approximation literature can be leveraged to effectively tackle learning problems in large-scale, distributed systems. En route, we also analyze the performance of a fully heterogeneous parallel-server system, where each server has a distinct processing speed, which might be of independent interest.
△ Less
Submitted 3 June, 2023;
originally announced June 2023.
-
Mean-field Analysis for Load Balancing on Spatial Graphs
Authors:
Daan Rutten,
Debankur Mukherjee
Abstract:
The analysis of large-scale, parallel-server load balancing systems has relied heavily on mean-field analysis. A pivotal assumption for this framework is that the servers are exchangeable. However, modern data-centers have data locality constraints, where tasks of a particular type can only be routed to a small subset of servers. An emerging line of research, therefore, considers load balancing al…
▽ More
The analysis of large-scale, parallel-server load balancing systems has relied heavily on mean-field analysis. A pivotal assumption for this framework is that the servers are exchangeable. However, modern data-centers have data locality constraints, where tasks of a particular type can only be routed to a small subset of servers. An emerging line of research, therefore, considers load balancing algorithms on bipartite graphs where vertices in the two partitions represent the task types and servers, respectively, and an edge represents the server's ability to process the corresponding task type. Due to the lack of exchangeability in this model, the mean-field techniques fundamentally break down. Recent progress has been made by considering graphs with strong edge-expansion properties, i.e., where any two large subsets of vertices are well-connected. However, data locality often leads to spatial constraints, where edges are local. As a result, these bipartite graphs do not have strong expansion properties.
In this paper, we consider the power-of-$d$ choices algorithm and develop a novel coupling-based approach to establish mean-field approximation for a large class of graphs that includes spatial graphs. As a corollary, we also show that, as the system size becomes large, the steady-state occupancy process for arbitrary regular bipartite graphs with diverging degrees, is indistinguishable from a fully flexible system on a complete bipartite graph. The method extends the scope of mean-field analysis far beyond the classical full-flexibility setup. En route, we prove that, starting from suitable states, the occupancy process becomes close to its steady state in a time that is independent of $N$. Such a large-scale mixing-time result might be of independent interest. Numerical experiments are conducted, which positively support the theoretical results.
△ Less
Submitted 9 January, 2023;
originally announced January 2023.
-
Online Capacity Scaling Augmented With Unreliable Machine Learning Predictions
Authors:
Daan Rutten,
Debankur Mukherjee
Abstract:
Modern data centers suffer from immense power consumption. As a result, data center operators have heavily invested in capacity scaling solutions, which dynamically deactivate servers if the demand is low and activate them again when the workload increases. We analyze a continuous-time model for capacity scaling, where the goal is to minimize the weighted sum of flow-time, switching cost, and powe…
▽ More
Modern data centers suffer from immense power consumption. As a result, data center operators have heavily invested in capacity scaling solutions, which dynamically deactivate servers if the demand is low and activate them again when the workload increases. We analyze a continuous-time model for capacity scaling, where the goal is to minimize the weighted sum of flow-time, switching cost, and power consumption in an online fashion. We propose a novel algorithm, called Adaptive Balanced Capacity Scaling (ABCS), that has access to black-box machine learning predictions. ABCS aims to adapt to the predictions and is also robust against unpredictable surges in the workload. In particular, we prove that ABCS is $(1+\varepsilon)$-competitive if the predictions are accurate, and yet, it has a uniformly bounded competitive ratio even if the predictions are completely inaccurate. Finally, we investigate the performance of this algorithm on a real-world dataset and carry out extensive numerical experiments, which positively support the theoretical results.
△ Less
Submitted 20 April, 2022; v1 submitted 28 January, 2021;
originally announced January 2021.
-
Modeling Rydberg Gases using Random Sequential Adsorption on Random Graphs
Authors:
Daan Rutten,
Jaron Sanders
Abstract:
The statistics of strongly interacting, ultracold Rydberg gases are governed by the interplay of two factors: geometrical restrictions induced by blockade effects, and quantum mechanical effects. To shed light on their relative roles in the statistics of Rydberg gases, we compare three models in this paper: a quantum mechanical model describing the excitation dynamics within a Rydberg gas, a Rando…
▽ More
The statistics of strongly interacting, ultracold Rydberg gases are governed by the interplay of two factors: geometrical restrictions induced by blockade effects, and quantum mechanical effects. To shed light on their relative roles in the statistics of Rydberg gases, we compare three models in this paper: a quantum mechanical model describing the excitation dynamics within a Rydberg gas, a Random Sequential Adsorption (RSA) process on a Random Geometric Graph (RGG), and a RSA process on a Decomposed Random Intersection Graph (DRIG). The latter model is new, and refers to choosing a particular subgraph of a mixture of two other random graphs. Contrary to the former two models, it lends itself for a rigorous mathematical analysis; and it is built specifically to have particular structural properties of a RGG. We establish for it a fluid limit describing the time-evolution of number of Rydberg atoms, and show numerically that the expression remains accurate across a wider range of particle densities than an earlier approach based on an RSA process on an Erdos-Renyi Random Graph (ERRG). Finally, we also come up with a new heuristic using random graphs that gives a recursion to describe a normalized pair-correlation function of a Rydberg gas. Our results suggest that even without dissipation, on long time scales the statistics are affected most by the geometrical restrictions induced by blockade effects, while on short time scales the statistics are affected most by quantum mechanical effects.
△ Less
Submitted 26 October, 2020;
originally announced October 2020.
-
Load Balancing Under Strict Compatibility Constraints
Authors:
Daan Rutten,
Debankur Mukherjee
Abstract:
We study large-scale systems operating under the JSQ$(d)$ policy in the presence of stringent task-server compatibility constraints. Consider a system with $N$ identical single-server queues and $M(N)$ task types, where each server is able to process only a small subset of possible task types. Each arriving task selects $d\geq 2$ random servers compatible to its type, and joins the shortest queue…
▽ More
We study large-scale systems operating under the JSQ$(d)$ policy in the presence of stringent task-server compatibility constraints. Consider a system with $N$ identical single-server queues and $M(N)$ task types, where each server is able to process only a small subset of possible task types. Each arriving task selects $d\geq 2$ random servers compatible to its type, and joins the shortest queue among them. The compatibility constraint is naturally captured by a fixed bipartite graph $G_N$ between the servers and the task types. When $G_N$ is complete bipartite, the meanfield approximation is proven to be accurate. However, such dense compatibility graphs are infeasible due to their overwhelming implementation cost and prohibitive storage capacity requirement at the servers. Our goal in this paper is to characterize the class of sparse compatibility graphs for which the meanfield approximation remains valid.
To achieve this, first, we introduce a novel graph expansion-based notion, called proportional sparsity, and establish that systems with proportionally sparse compatibility graphs match the performance of a fully flexible system, asymptotically in the large-system limit. Furthermore, for any $c(N)$ satisfying $$\frac{Nc(N)}{M(N)\ln(N)}\to \infty\quad \text{and}\quad c(N)\to \infty,$$ as $N\to\infty$, we show that proportionally sparse random compatibility graphs can be designed, so that the degree of each server is at most $c(N)$. This reduces the server-degree almost by a factor $N/\ln(N)$, compared to the complete bipartite compatibility graph, while maintaining the same asymptotic performance. Extensive simulation experiments are conducted to corroborate the theoretical results.
△ Less
Submitted 23 August, 2020; v1 submitted 17 August, 2020;
originally announced August 2020.