Asymptotics of insensitive load balancing and blocking phases

Matthieu Jonckheere, Balakrishna J. Prabhu
2017 Queueing systems  
Load balancing with various types of load information has become a key component of modern communication and information systems. In many systems, characterizing precisely the blocking probability allows to establish a performance trade-off between delay and losses. We address here the problem of giving robust performance bounds based on the study of the asymptotic behavior of the insensitive load balancing schemes when the number of servers and the load scales jointly. These schemes have the
more » ... sirable property that the stationary distribution of the resulting stochastic network depends on the distribution of job sizes only through its mean. It was shown that they give good estimates of performance indicators for systems with finite buffers, generalizing henceforth Erlang's formula whereas optimal policies are already theoretically and computationally out of reach for networks of moderate size. We study a single class of traffic acting on a symmetric set of processor sharing queues with finite buffers and we consider the case where the load scales with the number of servers. We characterize the response of symmetric systems under those schemes at different scales and show that three amplitudes of deviations can be identified according to whether ρ < 1, ρ = 1, and ρ > 1. A central limit scaling takes place for a sub-critical load; for ρ = 1, the number of free servers scales like n θ θ+1 (θ being the buffer depth and n being the number of servers) and is of order 1 for super-critical loads. This further implies the existence of different phases for the blocking probability. Before a (refined) critical load ρ c (n) = 1 − an − θ θ+1 , the blocking is exponentially small and becomes of order n − θ θ+1 at ρ c (n). This generalizes the well-known Quality and Efficiency Driven (QED) regime or Halfin-Whitt regime for a one-dimensional queue, and leads to a generalized staffing rule for a given target blocking probability.
doi:10.1007/s11134-017-9559-5 fatcat:tpfrgajtb5dp7iho4toli35jvi