-
Asymptotic Optimality of the Speed-Aware Join-the-Shortest-Queue in the Halfin-Whitt Regime for Heterogeneous Systems
Authors:
Sanidhay Bhambay,
Burak Büke,
Arpan Mukhopadhyay
Abstract:
The Join-the-Shortest-Queue (JSQ) load balancing scheme is known to minimise the average response time of jobs in homogeneous systems with identical servers. However, for {\em heterogeneous} systems with servers having different processing speeds, finding an optimal load balancing scheme remains an open problem for finite system sizes. Recently, for systems with heterogeneous servers, a variant of…
▽ More
The Join-the-Shortest-Queue (JSQ) load balancing scheme is known to minimise the average response time of jobs in homogeneous systems with identical servers. However, for {\em heterogeneous} systems with servers having different processing speeds, finding an optimal load balancing scheme remains an open problem for finite system sizes. Recently, for systems with heterogeneous servers, a variant of the JSQ scheme, called the {\em Speed-Aware-Join-the-Shortest-Queue (SA-JSQ)} scheme, has been shown to achieve asymptotic optimality in the fluid-scaling regime where the number of servers $n$ tends to infinity but the normalised the arrival rate of jobs remains constant. {In this paper, we show that the SA-JSQ scheme is also asymptotically optimal for heterogeneous systems in the {\em Halfin-Whitt} traffic regime where the normalised arrival rate scales as $1-O(1/\sqrt{n})$.} Our analysis begins by establishing that an appropriately scaled and centered version of the Markov process describing system dynamics weakly converges to a two-dimensional reflected {\em Ornstein-Uhlenbeck (OU) process}. We then show using {\em Stein's method} that the stationary distribution of the underlying Markov process converges to that of the OU process as the system size increases by establishing the validity of interchange of limits. {Finally, through coupling with a suitably constructed system, we show that SA-JSQ asymptotically minimises the diffusion-scaled total number of jobs and the diffusion-scaled number of waiting jobs in the steady-state in the Halfin-Whitt regime among all policies which dispatch jobs based on queue lengths and server speeds.}
△ Less
Submitted 21 October, 2024; v1 submitted 16 December, 2023;
originally announced December 2023.
-
Many-Server Queueing Systems with Heterogeneous Strategic Servers in Heavy Traffic
Authors:
Burak Büke,
Goncalo dos Reis,
Vadim Platonov
Abstract:
In most service systems, the servers are humans who desire to experience a certain level of idleness. In call centers, this manifests itself as the call avoidance behavior, where servers strategically adjust their service rate to strike a balance between the idleness they receive and effort to work harder. Moreover, being humans, each server values this trade-off differently and has different capa…
▽ More
In most service systems, the servers are humans who desire to experience a certain level of idleness. In call centers, this manifests itself as the call avoidance behavior, where servers strategically adjust their service rate to strike a balance between the idleness they receive and effort to work harder. Moreover, being humans, each server values this trade-off differently and has different capabilities. Drawing ideas on mean-field games we develop a novel framework relying on measure-valued processes to simultaneously address strategic server behavior and inherent server heterogeneity in service systems. This framework enables us to extend the recent literature on strategic servers in four new directions by: (i) incorporating individual choices of servers, (ii) incorporating individual abilities of servers, (iii) modeling the discomfort experienced by servers due to low levels of idleness, and (iv) considering more general routing policies. Using our framework, we are able to asymptotically characterize asymmetric Nash equilibria for many-server systems with strategic servers.
In simpler cases, it has been shown that the purely quality-driven regime is asymptotically optimal. However, we show that if the discomfort increases fast enough as the idleness approaches zero, the quality-and-efficiency-driven regime and other quality driven regimes can be optimal. This is the first time this conclusion appears in the literature.
△ Less
Submitted 27 May, 2025; v1 submitted 8 November, 2022;
originally announced November 2022.
-
Many-Server Queues with Random Service Rates in the Halfin-Whitt Regime: A Measure-Valued Process Approach
Authors:
Burak Büke,
Wenyi Qin
Abstract:
We consider many-server queueing systems with heterogeneous exponential servers and renewal arrivals. The service rate of each server is a random variable drawn from a given distribution. We develop a framework for analyzing the heavy traffic limit of these queues in random environment using probability measure-valued stochastic processes. We introduce the measure-valued fairness process which den…
▽ More
We consider many-server queueing systems with heterogeneous exponential servers and renewal arrivals. The service rate of each server is a random variable drawn from a given distribution. We develop a framework for analyzing the heavy traffic limit of these queues in random environment using probability measure-valued stochastic processes. We introduce the measure-valued fairness process which denotes the proportion of cumulative idleness experienced by servers whose rates fall in a Borel subset of the support of the service rates. It can be shown that these fairness processes do not converge in the usual Skorokhod-$J_1$ topology, hence we introduce a new notion of convergence based on shifted versions of these processes. We also introduce some useful martingales to identify limiting fairness processes under different routing policies.
△ Less
Submitted 10 May, 2019;
originally announced May 2019.
-
Separable Approximations and Decomposition Methods for the Augmented Lagrangian
Authors:
Rachael Tappenden,
Peter Richtarik,
Burak Buke
Abstract:
In this paper we study decomposition methods based on separable approximations for minimizing the augmented Lagrangian. In particular, we study and compare the Diagonal Quadratic Approximation Method (DQAM) of Mulvey and Ruszczyński and the Parallel Coordinate Descent Method (PCDM) of Richtárik and Takáč. We show that the two methods are equivalent for feasibility problems up to the selection of a…
▽ More
In this paper we study decomposition methods based on separable approximations for minimizing the augmented Lagrangian. In particular, we study and compare the Diagonal Quadratic Approximation Method (DQAM) of Mulvey and Ruszczyński and the Parallel Coordinate Descent Method (PCDM) of Richtárik and Takáč. We show that the two methods are equivalent for feasibility problems up to the selection of a single step-size parameter. Furthermore, we prove an improved complexity bound for PCDM under strong convexity, and show that this bound is at least $8(L'/\bar{L})(ω-1)^2$ times better than the best known bound for DQAM, where $ω$ is the degree of partial separability and $L'$ and $\bar{L}$ are the maximum and average of the block Lipschitz constants of the gradient of the quadratic penalty appearing in the augmented Lagrangian.
△ Less
Submitted 30 August, 2013;
originally announced August 2013.
-
Cross-training with Imperfect training Schemes
Authors:
Burak Buke,
Ozgur M. Araz,
John W. Fowler
Abstract:
Cross-training workers is one of the most efficient ways to achieve flexibility in manufacturing and service systems to increase responsiveness to demand variability. However, it is generally the case that cross-trained employees are not as productive as employees who are originally trained on a specific task. Also, the productivity of the cross-trained workers depend on when they are cross-traine…
▽ More
Cross-training workers is one of the most efficient ways to achieve flexibility in manufacturing and service systems to increase responsiveness to demand variability. However, it is generally the case that cross-trained employees are not as productive as employees who are originally trained on a specific task. Also, the productivity of the cross-trained workers depend on when they are cross-trained. In this work, we consider a two-stage model to analyze the affect of variations in productivity levels of workers on cross-training policies. Our results indicate that the most important factor determining the problem structure is the consistency in productivity levels of workers trained at different times. As long as cross-training can be done in a consistent manner, the productivity differences between cross-trained workers and workers originally trained on the task plays a minor role. We also analyze the effect of the variabilities in demand and producivity levels. We show that if the productivity levels of workers trained at different times are consistent, the decision maker is inclined to defer the cross-training decisions as the variability of demand or productivity levels increases. However, when the productivities of workers trained at different times differ, the decision maker may prefer to invest more in cross-training earlier as variability increases.
△ Less
Submitted 26 August, 2013;
originally announced August 2013.