-
Exponential Tail Bounds on Queues: A Confluence of Non-Asymptotic Heavy Traffic and Large Deviations
Authors:
Prakirt Raj Jhunjhunwala,
Daniela Hurtado-Lange,
Siva Theja Maguluri
Abstract:
In general, obtaining the exact steady-state distribution of queue lengths is not feasible. Therefore, we establish bounds for the tail probabilities of queue lengths. Specifically, we examine queueing systems under Heavy-Traffic (HT) conditions and provide exponentially decaying bounds for the probability $\mathbb P(εq > x)$, where $ε$ is the HT parameter denoting how far the load is from the max…
▽ More
In general, obtaining the exact steady-state distribution of queue lengths is not feasible. Therefore, we establish bounds for the tail probabilities of queue lengths. Specifically, we examine queueing systems under Heavy-Traffic (HT) conditions and provide exponentially decaying bounds for the probability $\mathbb P(εq > x)$, where $ε$ is the HT parameter denoting how far the load is from the maximum allowed load. Our bounds are not limited to asymptotic cases and are applicable even for finite values of $ε$, and they get sharper as $ε\to 0$. Consequently, we derive non-asymptotic convergence rates for the tail probabilities. Unlike other approaches such as moment bounds based on drift arguments and bounds on Wasserstein distance using Stein's method, our method yields sharper tail bounds. Furthermore, our results offer bounds on the exponential rate of decay of the tail, given by $-\frac{1}{x} \log \mathbb P(εq > x)$ for any finite value of $x$. These can be interpreted as non-asymptotic versions of Large Deviation (LD) results.
We demonstrate our approach by presenting tail bounds for: (i) a continuous time Join-the-shortest queue (JSQ) load balancing system, (ii) a discrete time single-server queue and (iii) an $M/M/n$ queue. We not only bridge the gap between classical-HT and LD regimes but also explore the large system HT regimes for JSQ and $M/M/n$ systems. In these regimes, both the system size and the system load increase simultaneously. Our results also close a gap in the existing literature on the limiting distribution of JSQ in the super-NDS (a.k.a. super slowdown) regime. This contribution is of an independent interest. Here, a key ingredient is a more refined characterization of state space collapse for JSQ system, achieved by using an exponential Lyapunov function designed to approximate the $\ell_{\infty}$ norm.
△ Less
Submitted 16 June, 2023;
originally announced June 2023.
-
Join-the-Shortest Queue with Abandonment: Critically Loaded and Heavily Overloaded Regimes
Authors:
Prakirt Raj Jhunjhunwala,
Martin Zubeldia,
Siva Theja Maguluri
Abstract:
We consider a load balancing system comprised of a fixed number of single server queues, operating under the well-known Join-the-Shortest Queue policy, and where jobs/customers are impatient and abandon if they do not receive service after some (random) amount of time. In this setting, we characterize the centered and appropriately scaled steady state queue length distribution (hereafter referred…
▽ More
We consider a load balancing system comprised of a fixed number of single server queues, operating under the well-known Join-the-Shortest Queue policy, and where jobs/customers are impatient and abandon if they do not receive service after some (random) amount of time. In this setting, we characterize the centered and appropriately scaled steady state queue length distribution (hereafter referred to as limiting distribution), in the limit as the abandonment rate goes to zero at the same time as the load either converges to one or is larger than one.
Depending on the arrival, service, and abandonment rates, we observe three different regimes of operation that yield three different limiting distributions. The first regime is when the system is underloaded and its load converges relatively slowly to one. In this case, abandonments do not affect the limiting distribution, and we obtain the same exponential distribution as in the system without abandonments. When the load converges to one faster, we have the second regime, where abandonments become significant. Here, the system undergoes a phase transition, and the limiting distribution is a truncated Gaussian. Further, the third regime is when the system is heavily overloaded, and so the queue lengths are very large. In this case, we show the limiting distribution converges to a normal distribution.
To establish our results, we first prove a weaker form of State Space Collapse by providing a uniform bound on the second moment of the (unscaled) perpendicular component of the queue lengths, which shows that the system behaves like a single server queue. We then use exponential Lyapunov functions to characterize the limiting distribution of the steady state queue length vector.
△ Less
Submitted 3 April, 2023; v1 submitted 27 November, 2022;
originally announced November 2022.
-
Heavy Traffic Joint Queue Length Distribution without Resource Pooling
Authors:
Prakirt Raj Jhunjhunwala,
Siva Theja Maguluri
Abstract:
This paper studies the heavy-traffic joint distribution of queue lengths in two stochastic processing networks (SPN), viz., an input-queued switch operating under the MaxWeight scheduling policy and a two-server parallel server system called the $\mathcal{N}$-system. These two systems serve as representatives of SPNs that do not satisfy the so-called Complete Resource Pooling (CRP) condition, and…
▽ More
This paper studies the heavy-traffic joint distribution of queue lengths in two stochastic processing networks (SPN), viz., an input-queued switch operating under the MaxWeight scheduling policy and a two-server parallel server system called the $\mathcal{N}$-system. These two systems serve as representatives of SPNs that do not satisfy the so-called Complete Resource Pooling (CRP) condition, and consequently exhibit a multidimensional State Space Collapse (SSC). Except in special cases, only mean queue lengths of such non-CRP systems is known in the literature. In this paper, we develop the Transform method to study the joint distribution of queue lengths in non-CRP systems. The key challenge is in solving an implicit functional equation involving the Laplace transform of the heavy-traffic limiting distribution. For the $\mathcal{N}$-system and a special case of an input-queued switch involving only three queues, we obtain the exact limiting heavy-traffic joint distribution in terms of a linear combination of two iid exponentials. For the general $n\times n$ input-queued switch that has $n^2$ queues, under a conjecture on uniqueness of the solution of the functional equation, we obtain an exact joint distribution of the heavy-traffic limiting queue-lengths in terms of a non-linear combination of $2n$ iid exponentials.
△ Less
Submitted 15 September, 2023; v1 submitted 13 June, 2022;
originally announced June 2022.