Join the Shortest Queue in Parallel Servers

This paper studies the steady-state properties of the join-the-shortest-queue model in the Halfin–Whitt regime. We focus on the process tracking the number of idle servers and the number of servers with nonempty buffers. Recently, Eschenfeldt and Gamarnik proved that a scaled version of this process converges, over finite time intervals, to a two-dimensional diffusion limit as the number of servers goes to infinity. In this paper, we prove that the diffusion limit is exponentially ergodic and that the diffusion scaled sequence of the steady-state number of idle servers and nonempty buffers is tight. Combined with the process-level convergence proved by Eschenfeldt and Gamarnik, our results imply convergence of steady-state distributions. The methodology used is the generator expansion framework based on Stein’s method, also referred to as the drift-based fluid limit Lyapunov function approach in Stolyar. One technical contribution to the framework is to show how it can be used as a general tool to establish exponential ergodicity.

Download Full-text

Large deviations without principle: join the shortest queue

Mathematical Methods of Operations Research ◽

10.1007/s00186-005-0037-1 ◽

2005 ◽

Vol 62 (3) ◽

pp. 467-483 ◽

Cited By ~ 6

Author(s):

Ad Ridder ◽

Adam Shwartz

Keyword(s):

Large Deviations ◽

Join The Shortest Queue ◽

Shortest Queue

Download Full-text

Join-the-shortest-queue scheduling with delay

2017 American Control Conference (ACC) ◽

10.23919/acc.2017.7963205 ◽

2017 ◽

Cited By ~ 3

Author(s):

Saied Mehdian ◽

Zhengyuan Zhou ◽

Nicholas Bambos

Keyword(s):

Queue Scheduling ◽

Join The Shortest Queue ◽

Shortest Queue

Download Full-text

ASYMPTOTIC BEHAVIOR FOR MArP/PH/2 QUEUE WITH JOIN THE SHORTEST QUEUE DISCIPLINE

Journal of the Operations Research Society of Japan ◽

10.15807/jorsj.54.46 ◽

2011 ◽

Vol 54 (1) ◽

pp. 46-64 ◽

Cited By ~ 1

Author(s):

Yutaka Sakuma

Keyword(s):

Asymptotic Behavior ◽

Queue Discipline ◽

Join The Shortest Queue ◽

Shortest Queue

Download Full-text

Join the shortest queue among $$k$$ parallel queues: tail asymptotics of its stationary distribution

Queueing Systems ◽

10.1007/s11134-013-9353-y ◽

2013 ◽

Vol 74 (2-3) ◽

pp. 303-332

Author(s):

Masahiro Kobayashi ◽

Yutaka Sakuma ◽

Masakiyo Miyazawa

Keyword(s):

Stationary Distribution ◽

Tail Asymptotics ◽

Parallel Queues ◽

Join The Shortest Queue ◽

Shortest Queue

Download Full-text

Many-server asymptotics for join-the-shortest-queue: Large deviations and rare events

The Annals of Applied Probability ◽

10.1214/20-aap1650 ◽

2021 ◽

Vol 31 (5) ◽

Author(s):

Amarjit Budhiraja ◽

Eric Friedlander ◽

Ruoyu Wu

Keyword(s):

Large Deviations ◽

Rare Events ◽

Join The Shortest Queue ◽

Shortest Queue

Download Full-text

Multiple-server system with flexible arrivals

Advances in Applied Probability ◽

10.1239/aap/1324045695 ◽

2011 ◽

Vol 43 (4) ◽

pp. 985-1004 ◽

Cited By ~ 8

Author(s):

Osman T. Akgun ◽

Rhonda Righter ◽

Ronald Wolff

Keyword(s):

Finite Buffers ◽

Service Production ◽

Weak Majorization ◽

Service Rates ◽

Join The Shortest Queue ◽

Traffic Systems ◽

Shortest Queue ◽

Performance Gains ◽

Number Of Customers ◽

Server System

In many service, production, and traffic systems there are multiple types of customers requiring different types of ‘servers’, i.e. different services, products, or routes. Often, however, a proportion of the customers are flexible, i.e. they are willing to change their type in order to achieve faster service, and even if this proportion is small, it has the potential of achieving large performance gains. We generalize earlier results on the optimality of ‘join the shortest queue’ (JSQ) for flexible arrivals to the following: arbitrary arrivals where only a subset are flexible, multiple-server stations, and abandonments. Surprisingly, with abandonments, the optimality of JSQ for minimizing the number of customers in the system depends on the relative abandonment and service rates. We extend our model to finite buffers and resequencing. We assume exponential service. Our optimality results are very strong; we minimize the queue length process in the weak majorization sense.

Download Full-text

Subdiffusive Load Balancing in Time-Varying Queueing Systems

Operations Research ◽

10.1287/opre.2019.1851 ◽

2019 ◽

Vol 67 (6) ◽

pp. 1678-1698

Author(s):

Rami Atar ◽

Isaac Keslassy ◽

Gal Mendelson

Keyword(s):

Load Balancing ◽

Queue Length ◽

Queueing Systems ◽

Load Condition ◽

Service Rates ◽

Queue Lengths ◽

Join The Shortest Queue ◽

The Difference ◽

Shortest Queue ◽

Diffusion Scale

The degree to which delays or queue lengths equalize under load-balancing algorithms gives a good indication of their performance. Some of the most well-known results in this context are concerned with the asymptotic behavior of the delay or queue length at the diffusion scale under a critical load condition, where arrival and service rates do not vary with time. For example, under the join-the-shortest-queue policy, the queue length deviation process, defined as the difference between the greatest and smallest queue length as it varies over time, is at a smaller scale (subdiffusive) than that of queue lengths (diffusive).

Download Full-text

On the optimal assignment of customers to parallel servers

Journal of Applied Probability ◽

10.1017/s0021900200045678 ◽

1978 ◽

Vol 15 (02) ◽

pp. 406-413 ◽

Cited By ~ 24

Author(s):

Richard R. Weber

Keyword(s):

Stochastic Process ◽

Service Time ◽

Hazard Rate ◽

Random Variable ◽

Queuing System ◽

Optimal Assignment ◽

Parallel Servers ◽

Shortest Queue ◽

Number Of Customers

We consider a queuing system with several identical servers, each with its own queue. Identical customers arrive according to some stochastic process and as each customer arrives it must be assigned to some server's queue. No jockeying amongst the queues is allowed. We are interested in assigning the arriving customers so as to maximize the number of customers which complete their service by a certain time. If each customer's service time is a random variable with a non-decreasing hazard rate then the strategy which does this is one which assigns each arrival to the shortest queue.

Download Full-text