References
Aalto, S.: Optimal control of batch service queues with finite service capacity and linear holding costs. Math. Meth. Oper. Res. 51(2), 263–285 (2000)
Crankshaw, D., Wang, X., Zhou, G., Franklin, M. J., Gonzalez, J. E., Stoica, I.: Clipper: a low-latency online prediction serving system. In Proc. of NSDI ’17, pp. 613–627 (2017)
Deb, R.K., Serfozo, R.F.: Optimal control of batch service queues. Adv. Appl. Prob. 5(2), 340–361 (1973)
Fowler, J.W., Mönch, L.: A survey of scheduling with parallel batch (p-batch) processing. Eur. J. Oper. Res. 298(1), 1–24 (2022)
Inoue, Y.: Queueing analysis of GPU-based inference servers with dynamic batching: A closed-form characterization. Perform. Eval. 147, 102183 (2021)
Papadaki, K.P., Powell, W.B.: Exploiting structure in adaptive dynamic programming algorithms for a stochastic batch service problem. Eur. J. Oper. Res. 142(1), 108–127 (2002)
Pepyne, D., Cassandras, C.: Optimal dispatching control for elevator systems during uppeak traffic. IEEE Trans. Cont. Sys. Tech. 5(6), 629–643 (1997)
Yao, C., Liu, W., Tang, W., Hu, S.: EAIS: Energy-aware adaptive scheduling for CNN inference on high-performance GPUs. Future Gen. Comput. Syst. 130, 253–268 (2022)
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Inoue, Y. A load-balancing problem for distributed bulk-service queues with size-dependent batch processing times. Queueing Syst 100, 449–451 (2022). https://doi.org/10.1007/s11134-022-09794-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11134-022-09794-3