Auto-Scaling in Data Stream Processing Applications: A Model-Based Reinforcement Learning Approach

Cardellini, Valeria; Lo Presti, Francesco; Nardelli, Matteo; Russo Russo, Gabriele

doi:10.1007/978-3-319-91632-3_8

Valeria Cardellini¹²,
Francesco Lo Presti¹²,
Matteo Nardelli¹² &
…
Gabriele Russo Russo¹²

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 825))

Included in the following conference series:

Workshop on New Frontiers in Quantitative Methods in Informatics

365 Accesses
3 Citations

Abstract

By exploiting on-the-fly computation, Data Stream Processing (DSP) applications can process huge volumes of data in a near real-time fashion. Adapting the application parallelism at run-time is critical in order to guarantee a proper level of QoS in face of varying workloads. In this paper, we consider Reinforcement Learning based techniques in order to self-configure the number of parallel instances for a single DSP operator. Specifically, we propose two model-based approaches and compare them to the baseline Q-learning algorithm. Our numerical investigations show that the proposed solutions provide better performance and faster convergence than the baseline.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Since we assume the action to be executed at the beginning of a time period, the number of instances during an interval is \(k+a\).
2.
http://chriswhong.com/open-data/foil_nyc_taxi/.

References

Cardellini, V., Lo Presti, F., Nardelli, M., Russo Russo, G.: Optimal operator deployment and replication for elastic distributed data stream processing. Concurr. Comput. 30(9), e4334 (2018). https://doi.org/10.1002/cpe.4334
Article Google Scholar
De Matteis, T., Mencagli, G.: Elastic scaling for distributed latency-sensitive data stream operators. In: Proceedings of PDP 2017, pp. 61–68 (2017)
Google Scholar
Fernandez, R.C., Migliavacca, M., Kalyvianaki, E., Pietzuch, P.: Integrating scale out and fault tolerance in stream processing using operator state management. In: Proceedings of ACM SIGMOD 2013, pp. 725–736 (2013)
Google Scholar
Gedik, B., Schneider, S., Hirzel, M., Wu, K.L.: Elastic scaling for data stream processing. IEEE Trans. Parallel Distrib. Syst. 25(6), 1447–1463 (2014)
Article Google Scholar
Heinze, T., Pappalardo, V., Jerzak, Z., Fetzer, C.: Auto-scaling techniques for elastic data stream processing. In: Proceedings of IEEE ICDEW 2014, pp. 296–302 (2014). https://doi.org/10.1109/ICDEW.2014.6818344
Heinze, T., Aniello, L., Querzoni, L., Jerzak, Z.: Cloud-based data stream processing. In: Proceedings of ACM DEBS 2014, pp. 238–245 (2014)
Google Scholar
Hirzel, M., Soulé, R., Schneider, S., Gedik, B., Grimm, R.: A catalog of stream processing optimizations. ACM Comput. Surv. 46(4), 46:1–46:34 (2014)
Article Google Scholar
Lohrmann, B., Janacik, P., Kao, O.: Elastic stream processing with latency guarantees. In: Proceedings of IEEE ICDCS 2015, pp. 399–410 (2015)
Google Scholar
Lorido-Botran, T., Miguel-Alonso, J., Lozano, J.A.: A review of auto-scaling techniques for elastic applications in cloud environments. J. Grid Comput. 12(4), 559–592 (2014). https://doi.org/10.1007/s10723-014-9314-7
Article Google Scholar
Mastronarde, N., van der Schaar, M.: Fast reinforcement learning for energy-efficient wireless communication. IEEE Trans. Signal Process. 59(12), 6262–6266 (2011). https://doi.org/10.1109/TSP.2011.2165211
Article MathSciNet Google Scholar
Puterman, M.L.: Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley, New York (2014)
MATH Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
Google Scholar
Tesauro, G., Jong, N.K., Das, R., Bennani, M.N.: On the use of hybrid reinforcement learning for autonomic resource allocation. Cluster Comput. 10(3), 287–299 (2007). https://doi.org/10.1007/s10586-007-0035-6
Article Google Scholar
Watkins, C.J., Dayan, P.: Q-learning. Mach. Learn. 8(3–4), 279–292 (1992). https://doi.org/10.1007/BF00992698
Article MATH Google Scholar
Yoon, K.P., Hwang, C.L.: Multiple Attribute Decision Making: An Introduction, vol. 104. Sage Publications, Thousand Oaks (1995)
Book Google Scholar

Download references

Author information

Authors and Affiliations

Department of Civil Engineering and Computer Science Engineering, University of Rome Tor Vergata, Rome, Italy
Valeria Cardellini, Francesco Lo Presti, Matteo Nardelli & Gabriele Russo Russo

Authors

Valeria Cardellini
View author publications
You can also search for this author in PubMed Google Scholar
Francesco Lo Presti
View author publications
You can also search for this author in PubMed Google Scholar
Matteo Nardelli
View author publications
You can also search for this author in PubMed Google Scholar
Gabriele Russo Russo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Valeria Cardellini .

Editor information

Editors and Affiliations

Ca’ Foscari University of Venice, Venice, Italy
Simonetta Balsamo
Ca’ Foscari University of Venice, Venice, Italy
Andrea Marin
University of Florence, Florence, Italy
Enrico Vicario

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cardellini, V., Lo Presti, F., Nardelli, M., Russo Russo, G. (2018). Auto-Scaling in Data Stream Processing Applications: A Model-Based Reinforcement Learning Approach. In: Balsamo, S., Marin, A., Vicario, E. (eds) New Frontiers in Quantitative Methods in Informatics. InfQ 2017. Communications in Computer and Information Science, vol 825. Springer, Cham. https://doi.org/10.1007/978-3-319-91632-3_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-91632-3_8
Published: 24 May 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-91631-6
Online ISBN: 978-3-319-91632-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics