Abstract
We study a system of several identical servers in parallel, where a routing decision must be made immediately on a job’s arrival. Jobs arrive according to a Poisson process, with their processing times following a discrete distribution with finite support. The processing time of a job is known on arrival and may be used in the routing decision. We propose a policy consisting of multi-layered round robin routing followed by shortest remaining processing time scheduling at the servers. This policy is shown to have a heavy traffic limit that is identical to one in which there is a single queue (no routing) and optimal heavy traffic scheduling. In light traffic, we show that the performance of this policy is worse than round robin routing followed by shortest remaining processing time scheduling. We also quantify the difference between round robin and multi-layered round robin routing, which in turn yields insights on the relative importance of routing versus (local) scheduling in such systems.
Similar content being viewed by others
References
N. Bansal and M. Harchol-Balter, Analysis of SRPT scheduling: Investigating unfairness. Proceedings of ACM Sigmetrics ’01, 2001.
G. Ciardo, A. Riska and E. Smirni, EquiLoad: a load balancing policy for clustered Web servers. Performance Evaluation, 46 (2001), 101–124.
D. Down. On the stability of polling models with multiple servers. Journal of Applied Probability, 35:925–935, 1998.
D.G. Down and R. Wu. Scheduling distributed server systems with highly variable processing times. Proceedings of the 2003 International Symposium on Performance Evaluation of Computer and Telecommunications Systems (SPECTS ’03), Montreal, 2003.
A. Ephremides, P. Varaiya, and J. Walrand, A simple dynamic routing problem. IEEE Transactions on Automatic Control, AC-25 (1980), 690–693.
E.J. Friedman and S.G. Henderson, Fairness and efficiency in web server protocols. Proceedings of ACM Sigmetrics ’03, 2003.
D. Gamarnik and A. Zeevi, Validity of heavy traffic steady-state approximations in open queueing networks. Preprint.
M. Harchol-Balter, M.E. Crovella and C.D. Murta, On choosing a task assignment policy for a distributed server system. Journal of Parallel and Distributed Computing, 59 (1999), 204–228.
D.L. Iglehart and W. Whitt, Multiple channel queues in heavy traffic. I. Advances in Applied Probability, 2 (1970), 150–177.
D.L. Iglehart and W. Whitt, Multiple channel queues in heavy traffic. II: Sequences, networks, and batches. Advances in Applied Probability, 2 (1970), 355–369.
Z. Liu and R. Righter, Optimal load balancing on distributed homogeneous unreliable processors. Operations Research, 46 (1998), 563–573.
Z. Liu and D. Towsley, Optimality of the round robin routing policy. Journal of Applied Probability, 31 (1994), 466–475.
M.I. Reiman, Some diffusion approximations with state space collapse. In Lecture Notes in Control and Information Sciences, volume 60, pages 209–240, Springer, Berlin-New York, 1984.
M. I. Reiman and B. Simon, Open queueing systems in light traffic. Mathematics of Operations Research, 14 (1989), 26–59.
M. I. Reiman and B. Simon, Light traffic limits of sojourn time distributions in Markovian queueing networks. Stochastic Models, 4 (1998), 191–233.
R. Righter, J.G. Shanthikumar, and G. Yamazaki, On extremal service disciplines in single-stage queueing systems. Journal of Applied Probability, 27 (1990), 409–416.
A. Riska, W. Sun, E. Smirni and G. Ciardo, AdaptLoad: effective balancing in clustered Web servers under transient load conditions. Proceedings of 22nd International Conference on Distributed Computing Systems (ICDCS ’02), 2002.
L. Schrage, A proof of the optimality of the shortest remaining processing time discipline. Operations Research, 16 (1968), 687–690.
H. Takagi. Queueing Analysis: A Foundation of Performance Evaluation, Part 1. Vacation and Priority Systems, Elsevier, 1991.
A. Tanenbaum, Modern Operating Systems, Prentice Hall, 1992.
Henk C. Tijms. A First Course in Stochastic Models, Wiley, 2003.
V. Ungureanu, P.G. Bradford, M. Katehakis and B. Melamed, Deferred assignment scheduling in clustered Web servers. Technical Report DIMACS TR: 2002-41, Rutgers University, 2002.
V. Ungureanu, B. Melamed, P.G. Bradford and M. Katehakis, Class-dependent assignment in cluster-based servers. Proceedings of the 19th ACM Symposium on Applied Computing, Nicosia, Cyprus, 2004.
R.R. Weber, On the optimal assignment of customers to parallel servers. Journal of Applied Probability, 15 (1978), 406–413.
W. Whitt, Weak convergence theorems for priority queues: preemptive-resume discipline. Journal of Applied Probability, 8 (1971), 74–94.
W. Whitt, Deciding which queue to join: some counter examples. Operations Research, 34 (1986), 226–244.
W. Winston, Optimality of the shortest line discipline. Journal of Applied Probability, 14 (1977), 181–189.
Author information
Authors and Affiliations
Corresponding author
Additional information
AMS subject classifications: 68M20 · 60K25
(Work done while both authors were visitors at EURANDOM, P.O. Box 513, 5600 MB Eindhoven, The Netherlands.)
Rights and permissions
About this article
Cite this article
Down, D.G., Wu, R. Multi-layered round robin routing for parallel servers. Queueing Syst 53, 177–188 (2006). https://doi.org/10.1007/s11134-006-7419-9
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/s11134-006-7419-9