Skip to main content
Log in

A survey on skill-based routing with applications to service operations management

  • Published:
Queueing Systems Aims and scope Submit manuscript

Abstract

Service systems often feature multiple classes of customers with different service needs and multiple pools of servers with different skillsets. How to efficiently match customers of different classes with servers of different skillsets is of great importance to the management of these systems. In this survey, we review works on skill-based routing in queues. We first summarize key insights on routing/scheduling policies developed in the literature. We then discuss complications brought by modern service operations management problems, particularly healthcare systems. These complications stimulate a growing body of literature on new modeling and analysis tools. Lastly, we provide additional numerical experiments to highlight the complex nature of a routing problem motivated from hospital patient-flow management, and provide some useful intuition to develop good skill-based routing policies in practice. Our goal is to provide a brief overview of the skill-based routing research landscape and to help generate interesting research ideas.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Adan, I., Weiss, G.: A loss system with skill-based servers under assign to longest idle server policy. Probab. Eng. Inf. Sci. 26(3), 307–321 (2012)

    Google Scholar 

  2. Adan, I., Weiss, G.: A skill based parallel service system under FCFS-ALIS: steady state, overloads, and abandonments. Stoch. Syst. 4(1), 250–299 (2014)

    Google Scholar 

  3. Argon, N., Ziya, S.: Priority assignment under imperfect information on customer type identities. Manuf. Serv. Oper. Manag. 11(4), 674–693 (2009)

    Google Scholar 

  4. Armony, M.: Dynamic routing in large-scale service systems with heterogeneous servers. Queueing Syst. 51(3–4), 287–329 (2005)

    Google Scholar 

  5. Armony, M., Bambos, N.: Queueing dynamics and maximal throughput scheduling in switched processing systems. Queueing Syst. 44(3), 209–252 (2003)

    Google Scholar 

  6. Armony, M., Israelit, S., Mandelbaum, A., Marmor, Y.N., Tseytlin, Y., Yom-Tov, G.B.: On patient flow in hospitals: a data-based queueing-science perspective. Stochas. Syst. 5(1), 146–194 (2015)

    Google Scholar 

  7. Armony, M., Ward, A.R.: Fair dynamic routing in large-scale heterogeneous-server systems. Oper. Res. 58(3), 624–637 (2010)

    Google Scholar 

  8. Ata, B., Peng, X.: An optimal callback policy for general arrival processes: a pathwise analysis. Oper. Res. 68(2), 327–347 (2020)

    Google Scholar 

  9. Atar, R.: Scheduling control for queueing systems with many servers: asymptotic optimality in heavy traffic. Ann. Appl. Probab. 15(4), 2606–2650 (2005)

    Google Scholar 

  10. Atar, R.: A diffusion regime with nondegenerate slowdown. Oper. Res. 60(2), 490–500 (2012)

    Google Scholar 

  11. Atar, R., Mandelbaum, A., Reiman, M.I.: Scheduling a multi class queue with many exponential servers: asymptotic optimality in heavy traffic. Ann. Appl. Probab. 14(3), 1084–1134 (2004)

    Google Scholar 

  12. Balseiro, S., Brown, D.B., Chen, C.: Dynamic pricing of relocating resources in large networks. Manage. Sci. (forthcoming) (2020)

  13. Bassamboo, A., Harrison, J.M., Zeevi, A.: Design and control of a large call center: asymptotic analysis of an lp-based method. Oper. Res. 54(3), 419–435 (2006)

    Google Scholar 

  14. Bassamboo, A., Randhawa, R.: Scheduling homogeneous impatient customers. Manag. Sci. 62(7), 2129–2147 (2016)

    Google Scholar 

  15. Bassamboo, A., Zeevi, A.: On a data-driven method for staffing large call centers. Oper. Res. 57(3), 714–726 (2009)

    Google Scholar 

  16. Bell, S.L., Williams, R.J.: Dynamic scheduling of a system with two parallel servers in heavy traffic with resource pooling: asymptotic optimality of a threshold policy. Ann. Appl. Probab. 11(3), 608–649 (2001)

    Google Scholar 

  17. Best, T.J., Sandçı, B., Eisenstein, D.D., Meltzer, D.O.: Managing hospital inpatient bed capacity through partitioning care into focused wings. Manuf. Serv. Oper. Manag. 17(2), 157–176 (2015)

    Google Scholar 

  18. Bonald, T., Proutiere, A.: Insensitive bandwidth sharing in data networks. Queueing Syst. 44(1), 69–100 (2003)

    Google Scholar 

  19. Borst, S., Mandelbaum, A., Reiman, M.I.: Dimensioning large call centers. Oper. Res. 52(1), 17–34 (2004)

    Google Scholar 

  20. Braverman, A., Gurvich, I., Huang, J.: On the Taylor expansion of value functions. Oper. Res. 68(2), 631–654 (2020)

    Google Scholar 

  21. Buyukkoc, C., Varaiya, P., Walrand, J.: The c\(\mu \) rule revisited. Adv. Appl. Probab. 17(1), 237–238 (1985)

    Google Scholar 

  22. Chalfin, D., Trzeciak, S., Likourezos, A., Baumann, B., Dellinger, R.: Impact of delayed transfer of critically ill patients from the emergency department to the intensive care unit. Crit. Care Med. 35, 1477–1483 (2007)

    Google Scholar 

  23. Chan, C.W., Farias, V.F., Bambos, N., Escobar, G.J.: Optimizing intensive care unit discharge decisions with patient readmissions. Oper. Res. 60(6), 1323–1341 (2012)

    Google Scholar 

  24. Chan, C.W., Farias, V.F., Escobar, G.J.: The impact of delays on service times in the intensive care unit. Manag. Sci. 63(7), 2049–2072 (2017)

    Google Scholar 

  25. Chan, C.W., Yom-Tov, G., Escobar, G.J.: When to use speedup: an examination of service systems with returns. Oper. Res. 62(2), 462–482 (2014)

    Google Scholar 

  26. Chen, Y., Dong, J.: Scheduling with service-time information: the power of two priority classes. Working paper (2019)

  27. Cox, D.R., Smith, W.L.: Queues. Methuen, London (1961)

    Google Scholar 

  28. Dai, J., Shi, P.: Inpatient bed overflow: an approximate dynamic programming approach. Manuf. Serv. Oper. Manag. 21(4), 894–911 (2019)

  29. Dai, J.G., Lin, W.: Maximum pressure policies in stochastic processing networks. Oper. Res. 53(2), 197–218 (2005)

    Google Scholar 

  30. Dai, J.G., Lin, W.: Asymptotic optimality of maximum pressure policies in stochastic processing networks. Ann. Appl. Probab. 18(6), 2239–2299 (2008)

    Google Scholar 

  31. Dai, J.G., Tezcan, T.: Optimal control of parallel server systems with many servers in heavy traffic. Queueing Syst. 59(2), 95 (2008)

    Google Scholar 

  32. Dai, J.G., Tezcan, T.: State space collapse in many-server diffusion limits of parallel server systems. Math. Oper. Res. 36(2), 271–320 (2011)

    Google Scholar 

  33. Daniel Adelman, A.J.M.: Relaxations of weakly coupled stochastic dynamic programs. Oper. Res. 56(3), 712–727 (2008)

    Google Scholar 

  34. Delana, K., Savva, N., Tezcan, T.: Proactive customer service: operational benefits and economic frictions. Manuf. Serv. Oper. Manag., Articles in Advance (2020)

  35. Dong, J., Feldman, P., Yom-Tov, G.B.: Service systems with slowdowns: potential failures and proposed solutions. Oper. Res. 63(2), 305–324 (2015)

    Google Scholar 

  36. Dong, J., Perry, O.: Queueing models for patient-flow dynamics in inpatient wards. Oper. Res. 68(1), 250–275 (2020)

    Google Scholar 

  37. Dong, J., Shi, P., Zheng, F., Jin, X.: Off-service placement in inpatient ward network: resource pooling versus service slowdown. Working paper (2019)

  38. Foley, R.D., McDonald, D.R.: Join the shortest queue: stability and exact asymptotics. Ann. Appl. Probab. 11(3), 569–607 (2001)

  39. Foss, S., Chernova, N.: On the stability of a partially accessible multi-station queue with state-dependent routing. Queueing Syst. 29, 55–73 (1998)

    Google Scholar 

  40. Gans, N., Koole, G., Mandelbaum, A.: Telephone call centers: tutorial, review, and research prospects. Manuf. Serv. Oper. Manag. 5(2), 79–141 (2003)

    Google Scholar 

  41. Gardner, K., Righter, R.: Product forms for FCFS queueing models with arbitrary server-job compatibilities: an overview. In: Queueing Systems (2020)

  42. Gardner, K., Zbarsky, S., Doroudi, S., Harchol-Balter, M., Hyytiä, E., Scheller-Wolf, A.: Queueing with redundant requests: exact analysis. Queueing Syst. 83(3), 227–259 (2016)

    Google Scholar 

  43. Garnett, O., Mandelbaum, A.: An introduction to skills-based routing and its operational complexities. Teaching Note, Technion, Haifa, Israel (2000)

  44. Garnett, O., Mandelbaum, A., Reiman, M.: Designing a call center with impatient customers. Manuf. Serv. Oper. Manag. 4(3), 208–227 (2002)

    Google Scholar 

  45. Ghamami, S., Ward, A.R.: Dynamic scheduling of a two-server parallel server system with complete resource pooling and reneging in heavy traffic: Asymptotic optimality of a two-threshold policy. Math. Oper. Res. 38(4), 761–824 (2013)

    Google Scholar 

  46. Gurvich, I., Luedtke, J., Tezcan, T.: Staffing call centers with uncertain demand forecasts: a chance-constrained optimization approach. Manag. Sci. 56(7), 1093–1115 (2010)

    Google Scholar 

  47. Gurvich, I., Van Meghem, J.: Collaboration and multitasking in networks: prioritization and achievable capacity. Manag. Sci. 64(5), 2390–2406 (2018)

    Google Scholar 

  48. Gurvich, I., Ward, A.R.: On the dynamic control of matching queues. Stochas. Syst. 4(2), 479–523 (2014)

    Google Scholar 

  49. Gurvich, I., Whitt, W.: Scheduling flexible servers with convex delay costs in many-server service systems. Manuf. Serv. Oper. Manag. 11(2), 237–253 (2008)

    Google Scholar 

  50. Gurvich, I., Whitt, W.: Queue-and-idleness-ratio controls in many-server service systems. Math. Oper. Res. 34(2), 363–396 (2009)

    Google Scholar 

  51. Gurvich, I., Whitt, W.: Service-level differentiation in many-server service systems via queue-ratio routing. Oper. Res. 58(2), 316–328 (2010)

    Google Scholar 

  52. Halfin, S., Whitt, W.: Heavy-traffic limits for queues with many exponential servers. Oper. Res. 29(3), 567–588 (1981)

    Google Scholar 

  53. Harrison, J.M.: Heavy traffic analysis of a system with parallel servers: asymptotic optimality of discrete-review policies. Ann. Appl. Probab. 8(3), 822–848 (1998)

    Google Scholar 

  54. Harrison, J.M., López, M.J.: Heavy traffic resource pooling in parallel-server systems. Queueing Syst. 33(4), 339–368 (1999)

    Google Scholar 

  55. Harrison, J.M., Zeevi, A.: Dynamic scheduling of a multiclass queue in the Halfin–Whitt heavy traffic regime. Oper. Res. 52(2), 243–257 (2004)

    Google Scholar 

  56. Hu, Y., Chan, C.W., Dong, J.: Optimal scheduling of proactive service with customer deterioration and improvement. Manage. Sci. (forthcoming) (2020)

  57. Huang, J., Carmeli, B., Mandelbaum, A.: Control of patient flow in emergency departments, or multiclass queues with deadlines and feedback. Oper. Res. 63(4), 892–908 (2015)

    Google Scholar 

  58. Iglehart, D.L., Whitt, W.: Multiple channel queues in heavy traffic. II: sequences, networks, and batches. Adv. Appl. Probab. 2(2), 355–369 (1970)

    Google Scholar 

  59. Jaeker, J.A.B., Tucker, A.L.: Past the point of speeding up: the negative effects of workload saturation on efficiency and patient severity. Manag. Sci. 63(4), 1042–1062 (2017)

    Google Scholar 

  60. Kc, D.S., Terwiesch, C.: An econometric analysis of patient flows in the cardiac intensive care unit. Manuf. Serv. Oper. Manag. 14(1), 50–65 (2012)

    Google Scholar 

  61. Kc, D.S., Terwiesch, C.: Impact of workload on service time and patient safety: an econometric analysis of hospital operations. Manag. Sci. 55(9), 1486–1498 (2009)

    Google Scholar 

  62. Krishnasamy, S., Sen, R., Johari, R., Shakkottai, S.: Learning unknown service rates in queues: a multi-armed bandit approach. Oper. Res. Articles in Advance (2020)

  63. Lu, Y., Xie, Q., Kliot, G., Geller, A., Larus, J.R., Greenberg, A.: Join-idle-queue: a novel load balancing algorithm for dynamically scalable web services. Perform. Eval. 68(11), 1056–1071 (2011)

    Google Scholar 

  64. Luo, J., Zhang, J.: Staffing and control of instant messaging contact centers. Oper. Res. 61(2), 328–343 (2013)

    Google Scholar 

  65. Mandelbaum, A., Momcilovic, P.: Personalized queues: the customer view, via a fluid model of serving least-patient first. Queueing Syst. 87, 1–31 (2017)

    Google Scholar 

  66. Mandelbaum, A., Stolyar, A.L.: Scheduling flexible servers with convex delay costs: heavy-traffic optimality of the generalized c\(\mu \)-rule. Oper. Res. 52(6), 836–855 (2004)

    Google Scholar 

  67. van Mieghem, J.A.: Dynamic scheduling with convex delay costs: the generalized \(c\mu \) rule. Ann. Appl. Probab. 5(3), 809–833 (1995)

    Google Scholar 

  68. Mitzenmacher, M.: The power of two choices in randomized load balancing. IEEE Trans. Parallel Distrib. Syst. 12(10), 1094–1104 (2001)

    Google Scholar 

  69. Örmeci, E.L., Güneş, E.D., Kunduzcu, D.: A modeling framework for control of preventive services. Manuf. Serv. Oper. Manag. 18(2), 227–244 (2015)

    Google Scholar 

  70. Perry, O., Whitt, W.: Responding to unexpected overloads in large-scale service systems. Manage Sci. 55(8), 1353–1367 (2009)

    Google Scholar 

  71. Perry, O., Whitt, W.: A fluid approximation for service systems responding to unexpected overloads. Oper. Res. 59(5), 1159–1170 (2011)

    Google Scholar 

  72. Puha, A.L., Ward, A.R.: Tutorial paper: scheduling an overloaded multiclass many-server queue with impatient customers. In: Tutorials in Operations Research (2019)

  73. Puterman, M.L.: Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley-Interscience, Hoboken (2005)

    Google Scholar 

  74. Reed, J.E., Ward, A.R.: Approximating the GI/GI/1+GI queue with a nonlinear drift diffusion: hazard rate scaling in heavy traffic. Math. Oper. Res. 33(3), 606–644 (2008)

    Google Scholar 

  75. Reiman, M.I.: Some diffusion approximations with state space collapse. In: Baccelli, F., Fayolle, G. (eds.) Modelling and Performance Evaluation Methodology. Lecture Notes in Control and Information Sciences, pp. 207–240. Springer, Berlin (1984)

    Google Scholar 

  76. Selen, J., Adan, I.J., Kulkarni, V.G., van Leeuwaarden, J.S.: The snowball effect of customer slowdown in critical many-server systems. Stochas. Models 32(3), 366–391 (2016)

    Google Scholar 

  77. Shi, P., Chou, M.C., Dai, J., Ding, D., Sim, J.: Models and insights for hospital inpatient operations: time-dependent ED boarding time. Manag. Sci. 62(1), 1–28 (2016)

    Google Scholar 

  78. Shi, P., Helm, J., Deglise-Hawkinson, J., Pan, J.: Timing it right: balancing inpatient congestion versus readmission risk at discharge. Oper. Res. (forthcoming) (2020)

  79. Song, H., Tucker, A., Graue, R., Moravick, S., Yang, J.: Capacity pooling in hospitals: the hidden consequences of off-service placement. Manag. Sci. 66(9), 3825–3842 (2020)

  80. Srikant, R., Ying, L.: Communication Networks: An Optimization, Control, and Stochastic Networks Perspective. Cambridge University Press, Cambridge (2013)

    Google Scholar 

  81. Stolyar, A.L.: MaxWeight scheduling in a generalized switch: state space collapse and workload minimization in heavy traffic. Ann. Appl. Probab. 14(1), 1–53 (2004)

    Google Scholar 

  82. Sun, Z., Argon, N., Ziya, S.: Patient triage and prioritization under austere conditions. Manag. Sci. 64(10), 4471–4489 (2018)

    Google Scholar 

  83. Tassiulas, L., Ephremides, A.: Stability properties of constrained queueing systems and scheduling policies for maximum throughput in multihop radio networks. In: 29th IEEE Conference on Decision and Control, IEEE, pp. 2130–2132 (1990)

  84. Tezcan, T., Dai, J.G.: Dynamic control of N-systems with many servers: asymptotic optimality of a static priority policy in heavy traffic. Oper. Res. 58(1), 94–110 (2010)

    Google Scholar 

  85. Ward, A.R., Armony, M.: Blind fair routing in large-scale service systems with heterogeneous customers and servers. Oper. Res. 61(1), 228–243 (2013)

    Google Scholar 

  86. Ward, A.R., Glynn, P.W.: A diffusion approximation for a Markovian queue with reneging. Queueing Syst. 43, 103–128 (2003)

    Google Scholar 

  87. Whitt, W.: How multiserver queues scale with growing congestion-dependent demand. Oper. Res. 51(4), 531–542 (2003)

    Google Scholar 

  88. Wierman, A., Harchol-Balter, M.: Classifying scheduling policies with respect to unfairness in an M/GI/1. In: Proceedings of the 2003 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, pp. 238–249 (2003)

  89. Wierman, A., Nuyens, M.: Scheduling despite inexact job-size informaion. ACM SIGMETRICS Perform. Eval. Rev. 36, 25–36 (2008)

    Google Scholar 

  90. Woei, L., Kumar, P.: Optimal control of a queueing system with two heterogeneous servers. IEEE Trans. Autom. Control 29(8), 696–703 (1984)

    Google Scholar 

  91. Xu, K., Chan, C.: Using future information to reduce waiting times in the emergency department via diversion. Manuf. Serv. Oper. Manag. 18(3), 314–331 (2016)

    Google Scholar 

  92. Yom-Tov, G.B., Mandelbaum, A.: Erlang-r: a time-varying queue with reentrant customers, in support of healthcare staffing. Manuf. Serv. Oper. Manag. 16(2), 283–299 (2014)

    Google Scholar 

  93. Zhan, D., Ward, A.R.: Staffing, routing, and payment to trade off speed and quality in large service systems. Oper. Res. 67(6), 1738–1751 (2019)

    Google Scholar 

Download references

Acknowledgements

The authors would like to thank Kristen Gardner, Yoni Nazarathy, and the referee for their insightful suggestions. Support from NSF Grant CMMI-1762544 is gratefully acknowledged by J. Dong.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jing Dong.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chen, J., Dong, J. & Shi, P. A survey on skill-based routing with applications to service operations management. Queueing Syst 96, 53–82 (2020). https://doi.org/10.1007/s11134-020-09669-5

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11134-020-09669-5

Keywords

Mathematics Subject Classification

Navigation