Skip to main content

Quality Assessment and Assurance of Machine Learning Systems: A Comprehensive Approach

  • Conference paper
  • First Online:
Information and Communication Technologies in Education, Research, and Industrial Applications (ICTERI 2023)

Abstract

Machine learning (ML) is opening up new opportunities for the development of innovative systems across a wide range of industries. However, assessing and ensuring the quality of systems with ML components introduces unique challenges related to inherent characteristics of such components like data centricity and unpredictable behavior. Traditional software quality assessment and assurance methods may not be sufficient for ML systems: (1) they focus on software code, while ML systems’ quality is influenced by the characteristics of the data and the algorithms used to create ML components; (2) they do not cover the emerging quality characteristics specific to ML systems, such as interpretability, explainability, fairness and trustworthiness. This PhD project aims to develop a comprehensive approach for assessing and assuring the quality of ML systems, with a focus on bias detection and prevention. The research will (1) explore the problem of bias in production ML systems; (2) analyze the gaps in existing software quality models and methods related to bias detection and prevention; and (3) propose an improved approach to quality assessment and assurance to address the challenges associated with bias in ML systems. The results of this PhD project are expected to contribute to the development of better models and methods for assessing and assuring the quality of ML systems, as well as have practical implications for industries that rely on ML systems to automate complex tasks, facilitate decision-making processes and gain insights from large amounts of data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 64.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 84.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Parashar, G., Chaudhary, A., Rana, A.: Systematic mapping study of AI/machine learning in healthcare and future directions. SN Comput. Sci. 2, 461 (2021). https://doi.org/10.1007/s42979-021-00848-6

    Article  Google Scholar 

  2. Egger, J., Gsaxner, C., Pepe, A., et al.: Medical deep learning – A systematic meta-review. Comput. Methods Programs Biomed. 221, 106874 (2022). https://doi.org/10.1016/j.cmpb.2022.106874

    Article  Google Scholar 

  3. Kim, S.W., Kong, J.H., Lee, S.W., et al.: Recent advances of artificial intelligence in manufacturing industrial sectors: a review. Int. J. Precis. Eng. Manuf. 23, 111–129 (2022). https://doi.org/10.1007/s12541-021-00600-3

    Article  Google Scholar 

  4. Sircar, A., Yadav, K., Rayavarapu, K., et al.: Application of machine learning and artificial intelligence in oil and gas industry. Pet. Res. 6(4), 379–391. https://doi.org/10.1016/j.ptlrs.2021.05.009

  5. Younis, H., Sundarakani, B., Alsharairi, M.: Applications of artificial intelligence and machine learning within supply chains: systematic review and future research directions. J. Model. Manag. 17(3), 916–940 (2022). https://doi.org/10.1108/JM2-12-2020-0322

    Article  Google Scholar 

  6. The state of AI in 2021. https://www.mckinsey.com/capabilities/quantumblack/our-insights/global-survey-the-state-of-ai-in-2021. Accessed 01 Apr 2023

  7. The state of AI in 2022—and a half decade in review. https://www.mckinsey.com/capabilities/quantumblack/our-insights/the-state-of-ai-in-2022-and-a-half-decade-in-review. Accessed 01 Apr 2023

  8. Carleton, A., Klein, M., Robert, J., et al.: Architecting the Future of Software Engineering: A National Agenda for Software Engineering Research & Development. Carnegie Mellon University, Software Engineering Institute (2021)

    Google Scholar 

  9. Sarker, I.H.: Machine learning: algorithms, real-world applications and research directions. SN COMPUT. SCI. 2, 1–21 (2021). https://doi.org/10.1007/s42979-021-00592-x

    Article  MathSciNet  Google Scholar 

  10. Janiesch, C., Zschech, P., Heinrich, K.: Machine learning and deep learning. Electron Markets 31, 685–695 (2021). https://doi.org/10.1007/s12525-021-00475-2

    Article  Google Scholar 

  11. Kühl, N., Schemmer, M., Goutier, M., et al.: Artificial intelligence and machine learning. Electron Markets 32, 2235–2244 (2022). https://doi.org/10.1007/s12525-022-00598-0

    Article  Google Scholar 

  12. Sculley, D., Holt, G., Golovin, D. et al.: Hidden technical debt in machine learning systems. In: Cortes, C., Lawrence, N., Lee, D., Sugiyama, M., R. Garnett, R. (eds.) Proceedings of the 28th International Conference on Neural Information Processing Systems, pp. 2503–2511 (2015)

    Google Scholar 

  13. Horneman, A., Mellinger, A., Ozkaya, I.: AI Engineering: 11 Foundational Practices. Carnegie Mellon University, Software Engineering Institute (2019)

    Google Scholar 

  14. ISO/IEC 23053:2022. https://www.iso.org/standard/74438.html. Accessed 01 May 2023

  15. Pons, L., Ozkaya, I.: Priority quality attributes for engineering AI-enabled systems (2019). https://arxiv.org/abs/1911.02912

  16. Bosch, J., Crnkovic, I., Holmström Olsson, H.: Engineering AI systems: a research agenda (2020). https://arxiv.org/abs/2001.07522

  17. The AI Act. https://artificialintelligenceact.eu/the-act/. Accessed 01 May 2023

  18. Siebert, J., Joeckel, L., Heidrich, J. et al.: Towards guidelines for assessing qualities of machine learning systems (2020). https://arxiv.org/abs/2008.11007

  19. ISO/IEC 25040:2011, https://www.iso.org/standard/35765.html, last accessed 2023/05/01

  20. ISO 9000:2015, https://www.iso.org/standard/45481.html, last accessed 2023/05/01

  21. What is Really Different in Engineering AI-Enabled Systems?, https://apps.dtic.mil/sti/trecms/pdf/AD1155001.pdf. Accessed 01 May 2023

  22. Xu, X., Wang, C., Wang, Z. et al.: Dependency tracking for risk mitigation in machine learning (ML) systems. In: 2022 IEEE/ACM 44th International Conference on SE: Software Engineering in Practice (ICSE-SEIP), pp. 145–146, Pittsburgh, PA, USA (2022)

    Google Scholar 

  23. Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, Cambridge (2016)

    MATH  Google Scholar 

  24. Kläs, M.: Towards identifying and managing sources of uncertainty in AI and machine learning models - an overview (2018). https://arxiv.org/abs/1811.11669

  25. Ozkaya, I.: What is really different in engineering AI-enabled systems? IEEE Softw. 37(4), 3–6 (2020). https://doi.org/10.1109/MS.2020.2993662

    Article  Google Scholar 

  26. Serban, A., Visser, J. An empirical study of software architecture for machine learning (2021). https://arxiv.org/abs/2105.12422

  27. Hüllermeier, E., Waegeman, W.: Aleatoric and epistemic uncertainty in machine learning: an introduction to concepts and methods. Mach. Learn. 110, 457–506 (2021). https://doi.org/10.1007/s10994-021-05946-3

    Article  MathSciNet  MATH  Google Scholar 

  28. Abdar, M., Pourpanah, F., Hussain, S., et al.: A review of uncertainty quantification in deep learning: techniques, applications and challenges. Inf. Fusion 76, 243–297 (2021)

    Article  Google Scholar 

  29. Jalaian, B., Lee, M., Russell, S.: Uncertain context: uncertainty quantification in machine learning. AI Mag. 40(4), 40–49 (2019)

    Google Scholar 

  30. Hu, X., Chu, L., Pei, J., et al.: model complexity of deep learning: a survey (2021). https://arxiv.org/abs/2103.05127

  31. Scaling Big Data Mining Infrastructure: The Twitter Experience, https://www.kdd.org/exploration_files/V14-02-02-Lin.pdf. Accessed 01 May 2023

  32. Architectural Components in ML-Enabled Systems. https://ckaestne.medium.com/architectural-components-in-ml-enabled-systems-78cf76b29a92. Accessed 01 May 2023

  33. A Quality Model for Machine Learning Systems. https://booking.ai/a-quality-model-for-machine-learning-systems-892118be9e19. Accessed 01 May 2023

  34. Thuraisingham, B.: Trustworthy machine learning. IEEE Intell. Syst. 37(1), 21–24 (2022). https://doi.org/10.1109/MIS.2022.3152946

    Article  Google Scholar 

  35. Ghahramani, Z.: Probabilistic machine learning and artificial intelligence. Nature 521, 452–459 (2015). https://doi.org/10.1038/nature14541. PMID: 26017444

    Article  Google Scholar 

  36. Geiger, R.S., Cope, D., Ip, J., et al.: Garbage in, garbage out revisited: what do machine learning application papers report about human-labeled training data? Quant. Sci. Stud. 2(3), 795–827 (2021). https://doi.org/10.1162/qss_a_00144

    Article  Google Scholar 

  37. ISO/IEC 25012:2008. https://www.iso.org/standard/35736.html. Accessed 01 May 2023

  38. Interpretability versus explainability. https://docs.aws.amazon.com/whitepapers/latest/model-explainability-aws-ai-ml/interpretability-versus-explainability.html. Accessed 01 May 2023

  39. Explainable AI (XAI). https://www.ibm.com/watson/explainable-ai. Accessed 01 May 2023

  40. Schmidt, P., Felix Biessmann, F.: Quantifying interpretability and trust in machine learning systems (2019).https://arxiv.org/abs/1901.08558

  41. Rawal, A., McCoy, J., Rawat, D. et al.: Recent advances in trustworthy explainable artificial intelligence: status, challenges and perspectives. https://www.techrxiv.org/articles/preprint/17054396. https://doi.org/10.36227/techrxiv.17054396.v1

  42. Goldblum, M., Schwarzschild, A., Patel, A. et al.: Adversarial attacks on machine learning systems for high-frequency trading (2021). https://arxiv.org/abs/2002.09565

  43. Piratla, V.: Robustness, evaluation and adaptation of machine learning models in the wild (2023). https://arxiv.org/abs/2303.02781

  44. Chew, Y.J., Wong, K.-S., Ooi, S.Y.: Privacy protection in machine learning: the state-of-the-art for a private decision tree. (2017)

    Google Scholar 

  45. Kurakin, A., Goodfellow, I., Bengio, S.: Adversarial examples in the physical world (2016). https://arxiv.org/abs/1607.02533

  46. Bass, L., Clements, P., Kazman, R.: Software Architecture in Practice, 4th edn. Addison-Wesley Professional, Boston (2021)

    Google Scholar 

  47. Lewis, G.A., Ozkaya, I., Xu, X.: Software architecture challenges for ML systems. In: IEEE International Conference on Software Maintenance and Evolution (ICSME), pp. 634–638, Luxembourg (2021). https://doi.org/10.1109/ICSME52107.2021.00071

  48. Malavolta, I., Muccini, H., Ozkaya, I.: Software architecture and artificial intelligence. J. Syst. Softw. 193, 111436 (2022). https://doi.org/10.1016/j.jss.2022.111436

    Article  Google Scholar 

  49. Muccini, H., Vaidhyanathan, K.: Leveraging machine learning techniques for architecting self-adaptive IoT systems (2020)https://doi.org/10.1109/SMARTCOMP50058.2020.00029

  50. Overcoming Software Architecture Challenges for ML-Enabled Systems. https://apps.dtic.mil/sti/pdfs/AD1150241.pdf. Accessed 01 May 2023

  51. Arpteg, A., Brinne, B., Crnkovic-Friis, L. et al.: Software engineering challenges of deep learning. https://arxiv.org/abs/1810.12034

  52. Kharchenko, V., Fesenko, H., Illiashenko, O.: Basic model of non-functional characteristics for assessment of artificial intelligence quality. Radioelectron. Comput. Syst. 2, 131–144 (2022). https://doi.org/10.32620/reks.2022.2.11

  53. Siebert, J., Joeckel, L., Heidrich, J., et al.: Construction of a quality model for machine learning systems. Software Qual. J. 30, 307–335 (2022). https://doi.org/10.1007/s11219-021-09557-y

    Article  Google Scholar 

  54. ISO/IEC 25010:2011. https://www.iso.org/ru/standard/35733.html. Accessed 01 May 2023

  55. Washizaki, H., Uchida, H., Khomh, F. et al.: Studying software engineering patterns for designing machine learning systems (2019). https://arxiv.org/abs/1910.04736

  56. Yokoyama, H.: Machine learning system architectural pattern for improving operational stability. In: 2019 IEEE International Conference on Software Architecture Companion (ICSA-C), pp. 267–274, Hamburg, Germany (2019). https://doi.org/10.1109/ICSA-C.2019.00055

  57. Cabrera, C., Paleyes, A., Thodoroff, P. et al.: Real-world machine learning systems: a survey from a data-oriented architecture perspective (2023). https://arxiv.org/abs/2302.04810

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yurii Sholomii .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sholomii, Y., Yakovyna, V. (2023). Quality Assessment and Assurance of Machine Learning Systems: A Comprehensive Approach. In: Antoniou, G., et al. Information and Communication Technologies in Education, Research, and Industrial Applications. ICTERI 2023. Communications in Computer and Information Science, vol 1980. Springer, Cham. https://doi.org/10.1007/978-3-031-48325-7_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-48325-7_20

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-48324-0

  • Online ISBN: 978-3-031-48325-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics