Skip to main content

Architecting AI Deployment: A Systematic Review of State-of-the-Art and State-of-Practice Literature

  • Conference paper
  • First Online:
Software Business (ICSOB 2020)

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 407))

Included in the following conference series:

Abstract

Companies across domains are rapidly engaged in shifting computational power and intelligence from centralized cloud to fully decentralized edges to maximize value delivery, strengthen security and reduce latency. However, most companies have only recently started pursuing this opportunity and are therefore at the early stage of the cloud-to-edge transition. To provide an overview of AI deployment in the context of edge/cloud/hybrid architectures, we conduct a systematic literature review and a grey literature review. To advance understanding of how to integrate, deploy, operationalize and evolve AI models, we derive a framework from existing literature to accelerate the end-to-end deployment process. The framework is organized into five phases: Design, Integration, Deployment, Operation and Evolution. We make an attempt to analyze the extracted results by comparing and contrasting them to derive insights. The contribution of the paper is threefold. First, we conduct a systematic literature review in which we review the contemporary scientific literature and provide a detailed overview of the state-of-the-art of AI deployment. Second, we review the grey literature and present the state-of-practice and experience of practitioners while deploying AI models. Third, we present a framework derived from existing literature for the end-to-end deployment process and attempt to compare and contrast SLR and GLR results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Amershi, S., et al.: Software engineering for machine learning: a case study. In: 2019 IEEE/ACM 41st International Conference on Software Engineering: Software Engineering in Practice (ICSE-SEIP), pp. 291–300. IEEE, May 2019

    Google Scholar 

  2. Bernardi, L., Mavridis, T., Estevez, P.: 150 successful machine learning models: 6 lessons learned at booking. com. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 1743–1751, July 2019

    Google Scholar 

  3. Sculley, D., et al.: Hidden technical debt in machine learning systems. In: Advances in Neural Information Processing Systems, pp. 2503–2511 (2015)

    Google Scholar 

  4. Dahlmeier, D.: On the challenges of translating NLP research into commercial products. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 92–96, July 2017

    Google Scholar 

  5. Crankshaw, D., Gonzalez, J., Bailis, P.: Research for practice: prediction-serving systems. Commun. ACM 61(8), 45–49 (2018)

    Article  Google Scholar 

  6. Hill, C., Bellamy, R., Erickson, T., Burnett, M.: Trials and tribulations of developers of intelligent systems: a field study. In: 2016 IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC), pp. 162–170. IEEE, September 2016

    Google Scholar 

  7. Provost, F., Kohavi, R.: Guest editors’ introduction: on applied research in machine learning. Mach. Learn. 30(2–3), 127–132 (1998)

    Article  Google Scholar 

  8. Keele, S.: Guidelines for performing systematic literature reviews in software engineering, vol. 5. Technical report, Version 2.3 EBSE Technical Report, EBSE (2007)

    Google Scholar 

  9. Kitchenham, B.A., Budgen, D., Brereton, P.: Evidence-Based Software Engineering and Systematic Reviews, vol. 4. CRC Press, Boca Raton (2015)

    Google Scholar 

  10. Garousi, V., Felderer, M., Mäntylä, M.V.: The need for multivocal literature reviews in software engineering: complementing systematic literature reviews with grey literature. In: Proceedings of the 20th International Conference on Evaluation and Assessment in Software Engineering, pp. 1–6, June 2016

    Google Scholar 

  11. Williams, A.: Using reasoning markers to select the more rigorous software practitioners’ online content when searching for grey literature. In: Proceedings of the 22nd International Conference on Evaluation and Assessment in Software Engineering 2018, pp. 46–56, June 2018

    Google Scholar 

  12. Xiaojing, X.I.E., Govardhan, S.S.: A service mesh-based load balancing and task scheduling system for deep learning applications. In: 2020 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID), pp. 843–849. IEEE, May 2020

    Google Scholar 

  13. Serebryakov, S., Milojicic, D., Vassilieva, N., Fleischman, S., Clark, R.D.: Deep learning cookbook: recipes for your AI infrastructure and applications. In 2019 IEEE International Conference on Rebooting Computing (ICRC), pp. 1–9. IEEE, November 2019

    Google Scholar 

  14. Li, J., Li, J. and Zhang, H., 2018, August. Deep learning based parking prediction on cloud platform. In: 2018 4th International Conference on Big Data Computing and Communications (BIGCOM), pp. 132–137. IEEE

    Google Scholar 

  15. Brayford, D., Vallecorsa, S., Atanasov, A., Baruffa, F., Riviera, W.: Deploying AI frameworks on secure HPC systems with containers. In: 2019 IEEE High Performance Extreme Computing Conference (HPEC), pp. 1–6. IEEE, September 2019

    Google Scholar 

  16. Tuli, S., Basumatary, N., Buyya, R.: EdgeLens: deep learning based object detection in integrated IoT, fog and cloud computing environments. In: 2019 4th International Conference on Information Systems and Computer Networks (ISCON), pp. 496–502. IEEE, November 2019

    Google Scholar 

  17. Brumbaugh, E., et al.: Bighead: a framework-agnostic, end-to-end machine learning platform. In: 2019 IEEE International Conference on Data Science and Advanced Analytics (DSAA), pp. 551–560. IEEE, October 2019

    Google Scholar 

  18. Salza, P., Hemberg, E., Ferrucci, F., O’Reilly, U.M.: Towards evolutionary machine learning comparison, competition, and collaboration with a multi-cloud platform. In: Proceedings of the Genetic and Evolutionary Computation Conference Companion, pp. 1263–1270, July 2017

    Google Scholar 

  19. Gupta, N., Anantharaj, K., Subramani, K.: Containerized architecture for edge computing in smart home: a consistent architecture for model deployment. In: 2020 International Conference on Computer Communication and Informatics (ICCCI), pp. 1–8. IEEE, January 2020

    Google Scholar 

  20. Cartas, A., et al.: A reality check on inference at mobile networks edge. In: Proceedings of the 2nd International Workshop on Edge Systems, Analytics and Networking, pp. 54–59, March 2019

    Google Scholar 

  21. Zhang, X., Wang, Y., Lu, S., Liu, L., Shi, W.: OpenEI: an open framework for edge intelligence. In 2019 IEEE 39th International Conference on Distributed Computing Systems (ICDCS), pp. 1840–1851. IEEE, July 2019

    Google Scholar 

  22. Rovnyagin, M.M., Timofeev, K., Elenkin, A.A., Shipugin, V.A.: Cloud computing architecture for high-volume ML-based solutions. In: 2019 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (EIConRus), pp. 315–318. IEEE, January 2019

    Google Scholar 

  23. Pääkkönen, P., Pakkala, D.: Extending reference architecture of big data systems towards machine learning in edge computing environments. J. Big Data 7, 1–29 (2020). https://doi.org/10.1186/s40537-020-00303-y

    Article  Google Scholar 

  24. Li, Y., Han, Z., Zhang, Q., Li, Z., Tan, H.: Automating cloud deployment for deep learning inference of real-time online services. In: IEEE INFOCOM 2020-IEEE Conference on Computer Communications, pp. 1668–1677. IEEE, July 2020

    Google Scholar 

  25. Data blueprint - IOA Knowledge Base. https://ioakb.com/fileattachment?file=b3O7X9NGQKO2OrG8ohmRvw%3D%3D&v=3

  26. Machine Learning Based Advanced Analytics using Intel Technology. https://media.bitpipe.com/io_14x/io_147787/item_1954603/machine-learning-based-advanced-analytics-using-intel-ForDistribution-2.pdf

  27. Next Generation Hybrid Cloud Data Analytics Solution. https://builders.intel.com/docs/datacenterbuilders/next-gen-hybrid-cloud-data-analytics-solution.pdf

  28. Getting Started with Machine Learning in the Cloud. https://www.oracle.com/a/ocom/docs/machine-learning-goes-to-the-cloud-ebook.pdf

  29. Machine Learning Lens-AWS Well-Architected Framework. https://d1.awsstatic.com/whitepapers/architecture/wellarchitected-Machine-Learning-Lens.pdf.Move the Algorithms, Not the Data

  30. https://www.oracle.com/technetwork/database/options/advanced-analytics/oml19c-white-paper-5601561.pdf

  31. Easterbrook, S., Singer, J., Storey, M.-A., Damian, D.: Selecting empirical methods for software engineering research. In: Shull, F., Singer, J., Sjoberg D.I.K. (eds.) Guide to Advanced Empirical Software Engineering, pp. 285–311. Springer, London (2008). https://doi.org/10.1007/978-1-84800-044-5_11

  32. Crankshaw, D., et al.: The missing piece in complex analytics: low latency, scalable model management and serving with velox. arXiv preprint arXiv:1409.3809, September 2014

  33. Bosch, J., Olsson, H.H., Crnkovic, I.: Engineering AI systems: a research agenda. In: Artificial Intelligence Paradigms for Smart Cyber-Physical Systems, pp. 1–19. IGI Global (2020)

    Google Scholar 

  34. Arpteg, A., Brinne, B., Crnkovic-Friis, L., Bosch, J.: Software engineering challenges of deep learning. In: 2018 44th Euromicro Conference on Software Engineering and Advanced Applications (SEAA), pp. 50–59. IEEE, August 2018

    Google Scholar 

  35. Baylor, D., et al.: TFX: a tensorflow-based production-scale machine learning platform. In: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1387–1395 (2017)

    Google Scholar 

  36. Crankshaw, D., et al.: The missing piece in complex analytics: low latency, scalable (2014)

    Google Scholar 

  37. Olston, C., et al.: Tensor flow-serving: flexible, high-performance ml serving. In: Workshop on ML Systems at NIPS (2017)

    Google Scholar 

  38. Breck, E., Cai, S., Nielsen, E., Salib, M., Sculley, D.: What’s your ML test score? A rubric for ML production systems (2016)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Meenu Mary John .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

John, M.M., Holmström Olsson, H., Bosch, J. (2021). Architecting AI Deployment: A Systematic Review of State-of-the-Art and State-of-Practice Literature. In: Klotins, E., Wnuk, K. (eds) Software Business. ICSOB 2020. Lecture Notes in Business Information Processing, vol 407. Springer, Cham. https://doi.org/10.1007/978-3-030-67292-8_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-67292-8_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-67291-1

  • Online ISBN: 978-3-030-67292-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics