Skip to main content
Log in

Multi-layered round robin routing for parallel servers

  • Published:
Queueing Systems Aims and scope Submit manuscript

Abstract

We study a system of several identical servers in parallel, where a routing decision must be made immediately on a job’s arrival. Jobs arrive according to a Poisson process, with their processing times following a discrete distribution with finite support. The processing time of a job is known on arrival and may be used in the routing decision. We propose a policy consisting of multi-layered round robin routing followed by shortest remaining processing time scheduling at the servers. This policy is shown to have a heavy traffic limit that is identical to one in which there is a single queue (no routing) and optimal heavy traffic scheduling. In light traffic, we show that the performance of this policy is worse than round robin routing followed by shortest remaining processing time scheduling. We also quantify the difference between round robin and multi-layered round robin routing, which in turn yields insights on the relative importance of routing versus (local) scheduling in such systems.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. N. Bansal and M. Harchol-Balter, Analysis of SRPT scheduling: Investigating unfairness. Proceedings of ACM Sigmetrics ’01, 2001.

  2. G. Ciardo, A. Riska and E. Smirni, EquiLoad: a load balancing policy for clustered Web servers. Performance Evaluation, 46 (2001), 101–124.

    Article  Google Scholar 

  3. D. Down. On the stability of polling models with multiple servers. Journal of Applied Probability, 35:925–935, 1998.

    Article  Google Scholar 

  4. D.G. Down and R. Wu. Scheduling distributed server systems with highly variable processing times. Proceedings of the 2003 International Symposium on Performance Evaluation of Computer and Telecommunications Systems (SPECTS ’03), Montreal, 2003.

  5. A. Ephremides, P. Varaiya, and J. Walrand, A simple dynamic routing problem. IEEE Transactions on Automatic Control, AC-25 (1980), 690–693.

    Article  Google Scholar 

  6. E.J. Friedman and S.G. Henderson, Fairness and efficiency in web server protocols. Proceedings of ACM Sigmetrics ’03, 2003.

  7. D. Gamarnik and A. Zeevi, Validity of heavy traffic steady-state approximations in open queueing networks. Preprint.

  8. M. Harchol-Balter, M.E. Crovella and C.D. Murta, On choosing a task assignment policy for a distributed server system. Journal of Parallel and Distributed Computing, 59 (1999), 204–228.

    Article  Google Scholar 

  9. D.L. Iglehart and W. Whitt, Multiple channel queues in heavy traffic. I. Advances in Applied Probability, 2 (1970), 150–177.

    Article  Google Scholar 

  10. D.L. Iglehart and W. Whitt, Multiple channel queues in heavy traffic. II: Sequences, networks, and batches. Advances in Applied Probability, 2 (1970), 355–369.

    Article  Google Scholar 

  11. Z. Liu and R. Righter, Optimal load balancing on distributed homogeneous unreliable processors. Operations Research, 46 (1998), 563–573.

    Google Scholar 

  12. Z. Liu and D. Towsley, Optimality of the round robin routing policy. Journal of Applied Probability, 31 (1994), 466–475.

    Article  Google Scholar 

  13. M.I. Reiman, Some diffusion approximations with state space collapse. In Lecture Notes in Control and Information Sciences, volume 60, pages 209–240, Springer, Berlin-New York, 1984.

    Google Scholar 

  14. M. I. Reiman and B. Simon, Open queueing systems in light traffic. Mathematics of Operations Research, 14 (1989), 26–59.

    Article  Google Scholar 

  15. M. I. Reiman and B. Simon, Light traffic limits of sojourn time distributions in Markovian queueing networks. Stochastic Models, 4 (1998), 191–233.

    Google Scholar 

  16. R. Righter, J.G. Shanthikumar, and G. Yamazaki, On extremal service disciplines in single-stage queueing systems. Journal of Applied Probability, 27 (1990), 409–416.

    Article  Google Scholar 

  17. A. Riska, W. Sun, E. Smirni and G. Ciardo, AdaptLoad: effective balancing in clustered Web servers under transient load conditions. Proceedings of 22nd International Conference on Distributed Computing Systems (ICDCS ’02), 2002.

  18. L. Schrage, A proof of the optimality of the shortest remaining processing time discipline. Operations Research, 16 (1968), 687–690.

    Google Scholar 

  19. H. Takagi. Queueing Analysis: A Foundation of Performance Evaluation, Part 1. Vacation and Priority Systems, Elsevier, 1991.

  20. A. Tanenbaum, Modern Operating Systems, Prentice Hall, 1992.

  21. Henk C. Tijms. A First Course in Stochastic Models, Wiley, 2003.

  22. V. Ungureanu, P.G. Bradford, M. Katehakis and B. Melamed, Deferred assignment scheduling in clustered Web servers. Technical Report DIMACS TR: 2002-41, Rutgers University, 2002.

  23. V. Ungureanu, B. Melamed, P.G. Bradford and M. Katehakis, Class-dependent assignment in cluster-based servers. Proceedings of the 19th ACM Symposium on Applied Computing, Nicosia, Cyprus, 2004.

  24. R.R. Weber, On the optimal assignment of customers to parallel servers. Journal of Applied Probability, 15 (1978), 406–413.

    Article  Google Scholar 

  25. W. Whitt, Weak convergence theorems for priority queues: preemptive-resume discipline. Journal of Applied Probability, 8 (1971), 74–94.

    Article  Google Scholar 

  26. W. Whitt, Deciding which queue to join: some counter examples. Operations Research, 34 (1986), 226–244.

    Google Scholar 

  27. W. Winston, Optimality of the shortest line discipline. Journal of Applied Probability, 14 (1977), 181–189.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Douglas G. Down.

Additional information

AMS subject classifications: 68M20 · 60K25

(Work done while both authors were visitors at EURANDOM, P.O. Box 513, 5600 MB Eindhoven, The Netherlands.)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Down, D.G., Wu, R. Multi-layered round robin routing for parallel servers. Queueing Syst 53, 177–188 (2006). https://doi.org/10.1007/s11134-006-7419-9

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11134-006-7419-9

Keywords

Navigation