Abstract
We study service rate control problems for an M/M/1 queue with server breakdowns in which the breakdown rate is assumed to be a function of the service rate. Assuming that the queue has infinite capacity, we first establish the optimality equations for the discounted cost problem and characterize the optimal rate control policies. Then, we characterize the ergodicity of the controlled queue and establish the optimality conditions for the average-cost (ergodic) control problem using the vanishing discounted method. We next study the ergodic control problem when the queue has a finite capacity and establish a verification theorem by directly involving the stationary distribution of the controlled Markov process. For practical applications, we consider the adaptive service rate control problem for the model with finite capacity. Studying this problem is useful because the relationship between the server breakdown rate and the service rate is costly to observe in practice. We propose an adaptive (self-tuning) control algorithm, assuming that the relationship between the server breakdown rate and the service rate is linear with unknown parameters. We prove that the regret vanishes under the algorithm and the proposed policies are asymptotically optimal. In addition, numerical experiments are conducted to validate the algorithm.
Similar content being viewed by others
References
Abernethy, J., Bartlett, P.L., Hazan, E.: Blackwell approachability and no-regret learning are equivalent. In: Proceedings of the 24th Annual Conference on Learning Theory, pp. 27–46 (2011)
Adusumilli, K.M., Hasenbein, J.J.: Dynamic admission and service rate control of a queue. Queueing Syst. 66(2), 131–154 (2010)
Alvarez-Mena, J., Hernández-Lerma, O.: Convergence and approximation of optimization problems. SIAM J. Optim. 15(2), 527–539 (2004)
Arapostathis, A., Pang, G., Zheng, Y.: Optimal scheduling of critically loaded multiclass GI/M/n+M queues in an alternating renewal environment. Appl. Math. Optim. 84(2), 1857–1901 (2021)
Ata, B.: Dynamic power control in a wireless static channel subject to a quality-of-service constraint. Oper. Res. 53(5), 842–851 (2005)
Ata, B., Shneorson, S.: Dynamic control of an M/M/1 service system with adjustable arrival and service rates. Manage. Sci. 52(11), 1778–1791 (2006)
Avi-Itzhak, B., Naor, P.: Some queuing problems with the service station subject to breakdown. Oper. Res. 11, 303–320 (1963)
Badian-Pessot, P., Lewis, M.E., Down, D.G.: Optimal control policies for an M/M/1 queue with a removable server and dynamic service rates. Probab. Engrg. Inf. Sci. 35(2), 189–209 (2021)
Besbes, O., Zeevi, A.: On the (surprising) sufficiency of linear models for dynamic pricing with demand learning. Manage. Sci. 61(4), 723–739 (2015)
Besbes, O., Zeevi, A.: Dynamic pricing without knowing the demand function: risk bounds and near-optimal algorithms. Oper. Res. 57, 1407–1420 (2009)
Blackwell, D.: An analog of the minimax theorem for vector payoffs. Pac. J. Math. 6(1), 1–8 (1956)
Borkar, V., Varaiya, P.: Adaptive control of Markov chains. I. Finite parameter set. IEEE Trans. Automat. Control 24(6), 953–957 (1979)
Borkar, V.: Recursive self-tuning control of finite markov chains. Applicationes Mathematicae 24(2), 169–188 (1997)
Borkar, V., Varaiya, P.: Identification and adaptive control of Markov chains. SIAM J. Control. Optim. 20(4), 470–489 (1982)
Borkar, V.: The Kumar-Becker-Lin scheme revisited. J. Optim. Theory Appl. 66, 289–309 (1990)
Cesa-Bianchi, N., Lugosi, G.: Prediction, Learning, and Games. Cambridge University Press (2006)
Chelst, K., Tilles, A.Z., Pipis, J.S.: A coal unloader: a finite queueing system with breakdowns. INFORMS J. Appl. Anal. 11(5), 12–25 (1981)
Chen, B., Chao, X., Shi, C.: Nonparametric learning algorithms for joint pricing and inventory control with lost sales and censored demand. Math. Oper. Res. 46(2), 726–756 (2021)
Chen, Q., Jasin, S., Duenyas, I.: Nonparametric self-adjusting control for joint learning and optimization of multiproduct pricing with finite resource capacity. Math. Oper. Res. 44(2), 601–631 (2019)
Chen, X., Yunan, L., Hong, G.: An online learning approach to dynamic pricing and capacity sizing in service systems. Oper. Res. Forthcom. (2023). https://doi.org/10.1287/opre.2020.612
Crites, R.H., Barto, A.G.: Improving elevator performance using reinforcement learning. Adv. Neural. Inf. Process. Syst. 690, 1017–1023 (1996)
den Boer, A., Zwart, B.: Simultaneously learning and optimizing using controlled variance pricing. Manage. Sci. 60(3), 770–783 (2014)
den Boer, A.V., Zwart, B.: Mean square convergence rates for maximum quasi-likelihood estimators. Stoch. Syst. 4(2), 375–403 (2014)
Drabik, E., Stettner, L.: On adaptive control of markov chains using nonparametric estimation. Applicationes Mathematicae 27(2), 143–152 (2000)
Duncan, T., Pasik-Duncan, B., Stettner, L.: Adaptive control of a partially observed discrete time markov process. Appl. Math. Optim. 37, 269–293 (1998)
Federgruen, A., So, K.C.: Optimal time to repair a broken server. Adv. Appl. Probab. 21(2), 376–397 (1989)
Feinberg, E.A., Lewis, M.E.: Optimality inequalities for average cost Markov decision processes and the stochastic cash balance problem. Math. Oper. Res. 32(4), 769–783 (2007)
Gaver, D.P., Jr.: A waiting line with interrupted service, including priorities. J. Roy. Statist. Soc. Ser. B 24, 73–90 (1962)
George, J.M., Harrison, J.M.: Dynamic control of a queue with adjustable service rate. Oper. Res. 49(5), 720–731 (2001)
Ghosh, A.P., Weerasinghe, A.P.: Optimal buffer size and dynamic rate control for a queueing system with impatient customers in heavy traffic. Stoch. Process. Appl. 120(11), 2103–2141 (2010)
Groover, M.P.: Fundamentals of Modern Manufacturing: Materials, Processes, and Systems. Wiley (2010)
He, Q.-M.: Fundamentals of Matrix-Analytic Methods. Springer, New York (2014)
Hernández-Lerma, O., Marcus, S.I.: Adaptive control of service in queueing systems. Syst. Control Lett. 3(5), 283–289 (1983)
Hernàndez-Lerma, O., Marcus, S.I.: Optimal adaptive control of priority assignment in queueing systems. Syst. Control Lett. 4(2), 65–72 (1984)
Heyde, C.C.: Quasi-Likelihood and its Application. A General Approach to Optimal Parameter Estimation. Springer Series in Statistics, Springer, New York (1997)
Keskin, N.B., Zeevi, A.: Dynamic pricing with an unknown demand model: asymptotically optimal semi-myopic policies. Oper. Res. 62(5), 1142–1167 (2014)
Koçağa, Y.L.: An approximating diffusion control problem for dynamic admission and service rate control in a G/M/N + G queue. Oper. Res. Lett. 45(6), 538–542 (2017)
Koçağa, Y.L., Ward, A.R.: Admission control for a multi-server queue with abandonment. Queueing Syst. 65, 275–323 (2010)
Kruglov, V.M.: Strong law of large numbers, Vol. 3. In: Stability Problems for Stochastic Models: Proceedings of the Fifteenth Perm Seminar Perm, Russia, June 2-6, 1992, pp. 139–150 (2020)
Kumar, P.R., Lin, W.: Optimal adaptive controllers for unknown Markov chains. IEEE Trans. Automat. Control 27(4), 765–774 (1982)
Kumar, R., Lewis, M.E., Topaloglu, H.: Dynamic service rate control for a single-server queue with Markovmodulated arrivals. Naval Res. Logist. 60(8), 661–677 (2013)
Kuo, C.-C., Wang, K.-H., Lee, S.-L.: Optimal control of the p, N-policy M/G/1 queue with server breakdowns and general startup times. Int. J. Inf. Manage. Sci. 20(4), 565–577 (2009)
Lee, N., Kulkarni, V.G.: Optimal arrival rate and service rate control of multi-server queues. Queueing Syst. 76, 37–50 (2014)
Lippman, S.A.: Semi-Markov decision processes with unbounded rewards. Manage. Sci. 19, 717–731 (1972/73)
Lippman, S.A.: Applying a new device in the optimization of exponential queuing systems. Oper. Res. 23(4), 687–710 (1975)
Liu, B., Xie, Q., Modiano, E.: Reinforcement learning for optimal control of queueing systems. In: 2019 57th Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp. 663–670 (2019)
Mandl, P.: On self-optimizing control of Markov processes, Mathematical Control Theory, pp. 345–360 (1985)
Neuts, M.F.: Matrix-geometric solutions in stochastic models. An algorithmic approach, Corrected reprint of the 1981 original. Dover Publications, Inc., New York (1994)
Pang, G., Whitt, W.: Heavy-traffic limits for many-server queues with service interruptions. Queueing Syst. 61(2–3), 167–202 (2009)
Pang, G., Whitt, W.: Service interruptions in large-scale service systems. Manage. Sci. 55(9), 1499–1512 (2009)
Raeis, M., Tizghadam, A., Leon-Garcia, A.: Reinforcement learning-based admission control in delay-sensitive service systems. In: GLOBECOM 2020-2020 IEEE Global Communications Conference, pp. 1–6 (2020)
Sennott, L.I.: Average cost semi-Markov decision processes and the control of queueing systems. Probab. Eng. Inf. Sci. 3(2), 247–272 (1989)
Shi, C., Chen, W., Duenyas, I.: Technical note-nonparametric data-driven algorithms for multiproduct inventory systems with censored demand. Oper. Res. 64(2), 362–370 (2016)
Shwartz, A., Makowski, A.M.: An optimal adaptive scheme for two competing queues with constraints. In: Analysis and Optimization of Systems: Proceedings of the Seventh International Conference on Analysis and Optimization of Systems, Antibes, June 25–27, pp. 515–532 (1986)
Stettner, L.: On nearly self-optimizing strategies for a discrete-time uniformly ergodic adaptive model. Appl. Math. Optim. 27, 161–177 (1993)
Walton, N., Xu, K.: Learning and information in stochastic networks and queues. Informs Tutorials in Operations Research, pp. 161–198 (2021)
Weerasinghe, A.: Diffusion approximations for G/M/n + GI queues with state-dependent service rates. Math. Oper. Res. 39(1), 207–228 (2014)
White, H., Christie, L.S.: Queuing with preemptive priorities or with breakdown. Oper. Res. 6, 79–95 (1958)
Yeh, L.: A finite algorithm for optimal solutions of adaptive queueing control. J. Math. Anal. Appl. 125(1), 218–233 (1987)
Acknowledgements
This work was supported in part by the US National Science Foundation grants DMS-1715875 and DMS-2108683/2216765, and Army Research Office grant W911NF-17-1-0019. The authors are grateful for the helpful comments from the associate editors and reviewers that have improved the exposition of the results in the paper.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zheng, Y., Julaiti, J. & Pang, G. Adaptive service rate control of an M/M/1 queue with server breakdowns. Queueing Syst 106, 159–191 (2024). https://doi.org/10.1007/s11134-023-09900-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11134-023-09900-z
Keywords
- M/M/1 queue
- Finite or infinite capacity
- Server breakdowns
- Service rate control
- Adaptive (self-tuning) control
- Markov decision process
- Discounted and long-run average (ergodic) cost criteria