Abstract
With the increasing number of created and deployed prediction models and the complexity of machine learning workflows we require so called model management systems to support data scientists in their tasks. In this work we describe our technological concept for such a model management system. This concept includes versioned storage of data, support for different machine learning algorithms, fine tuning of models, subsequent deployment of models and monitoring of model performance after deployment. We describe this concept with a close focus on model lifecycle requirements stemming from our industry application cases, but generalize key features that are relevant for all applications of machine learning.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
References
Chapman, P., et al.: CRISP-DM 1.0 step-by-step data mining guide (2000)
Cheng, Y., Qin, C., Rusu, F.: GLADE: big data analytics made easy. In: Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, SIGMOD 2012, pp. 697–700. ACM, New York (2012)
Crankshaw, D., Wang, X., Zhou, G., Franklin, M.J., Gonzalez, J.E., Stoica, I.: Clipper: a low-latency online prediction serving system. In: 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI 17), pp. 613–627 (2017)
Interlandi, M., Matusevych, S., Amizadeh, S., Zahirazami, S., Weimer, M.: Machine learning at microsoft with ML.NET. In: NIPS 2018 Workshop MLOSS (2018)
Kumar, A., Boehm, M., Yang, J.: Data management in machine learning. In: Proceedings of the 2017 ACM International Conference on Management of Data, pp. 1717–1722. ACM Press (2017)
Miao, H., Chavan, A., Deshpande, A.: ProvDB: lifecycle management of collaborative analysis workflows. In: Proceedings of the 2nd Workshop on Human-In-the-Loop Data Analytics, HILDA 2017, pp. 7:1–7:6. ACM, New York (2017)
Prasad, S., et al.: Large-scale predictive analytics in vertica: fast data transfer, distributed model creation, and in-database prediction. In: Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, SIGMOD 2015, pp. 1657–1668. ACM, New York (2015)
Vartak, M., et al.: ModelDB: a system for machine learning model management. In: Proceedings of the Workshop on Human-In-the-Loop Data Analytics, HILDA 2016, pp. 14:1–14:3. ACM, New York (2016)
Wang, H., Zaniolo, C., Luo, C.R.: ATLAS: a small but complete SQL extension for data mining and data streams. In: Proceedings 2003 VLDB Conference, pp. 1113–1116. Elsevier (2003)
Acknowledgement
The financial support by the Christian Doppler Research Association, the Austrian Federal Ministry for Digital and Economic Affairs and the National Foundation for Research, Technology and Development is gratefully acknowledged.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Bachinger, F., Kronberger, G. (2020). Concept for a Technical Infrastructure for Management of Predictive Models in Industrial Applications. In: Moreno-Díaz, R., Pichler, F., Quesada-Arencibia, A. (eds) Computer Aided Systems Theory – EUROCAST 2019. EUROCAST 2019. Lecture Notes in Computer Science(), vol 12013. Springer, Cham. https://doi.org/10.1007/978-3-030-45093-9_32
Download citation
DOI: https://doi.org/10.1007/978-3-030-45093-9_32
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-45092-2
Online ISBN: 978-3-030-45093-9
eBook Packages: Computer ScienceComputer Science (R0)