Abstract
Machine learning (ML) is opening up new opportunities for the development of innovative systems across a wide range of industries. However, assessing and ensuring the quality of systems with ML components introduces unique challenges related to inherent characteristics of such components like data centricity and unpredictable behavior. Traditional software quality assessment and assurance methods may not be sufficient for ML systems: (1) they focus on software code, while ML systems’ quality is influenced by the characteristics of the data and the algorithms used to create ML components; (2) they do not cover the emerging quality characteristics specific to ML systems, such as interpretability, explainability, fairness and trustworthiness. This PhD project aims to develop a comprehensive approach for assessing and assuring the quality of ML systems, with a focus on bias detection and prevention. The research will (1) explore the problem of bias in production ML systems; (2) analyze the gaps in existing software quality models and methods related to bias detection and prevention; and (3) propose an improved approach to quality assessment and assurance to address the challenges associated with bias in ML systems. The results of this PhD project are expected to contribute to the development of better models and methods for assessing and assuring the quality of ML systems, as well as have practical implications for industries that rely on ML systems to automate complex tasks, facilitate decision-making processes and gain insights from large amounts of data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Parashar, G., Chaudhary, A., Rana, A.: Systematic mapping study of AI/machine learning in healthcare and future directions. SN Comput. Sci. 2, 461 (2021). https://doi.org/10.1007/s42979-021-00848-6
Egger, J., Gsaxner, C., Pepe, A., et al.: Medical deep learning – A systematic meta-review. Comput. Methods Programs Biomed. 221, 106874 (2022). https://doi.org/10.1016/j.cmpb.2022.106874
Kim, S.W., Kong, J.H., Lee, S.W., et al.: Recent advances of artificial intelligence in manufacturing industrial sectors: a review. Int. J. Precis. Eng. Manuf. 23, 111–129 (2022). https://doi.org/10.1007/s12541-021-00600-3
Sircar, A., Yadav, K., Rayavarapu, K., et al.: Application of machine learning and artificial intelligence in oil and gas industry. Pet. Res. 6(4), 379–391. https://doi.org/10.1016/j.ptlrs.2021.05.009
Younis, H., Sundarakani, B., Alsharairi, M.: Applications of artificial intelligence and machine learning within supply chains: systematic review and future research directions. J. Model. Manag. 17(3), 916–940 (2022). https://doi.org/10.1108/JM2-12-2020-0322
The state of AI in 2021. https://www.mckinsey.com/capabilities/quantumblack/our-insights/global-survey-the-state-of-ai-in-2021. Accessed 01 Apr 2023
The state of AI in 2022—and a half decade in review. https://www.mckinsey.com/capabilities/quantumblack/our-insights/the-state-of-ai-in-2022-and-a-half-decade-in-review. Accessed 01 Apr 2023
Carleton, A., Klein, M., Robert, J., et al.: Architecting the Future of Software Engineering: A National Agenda for Software Engineering Research & Development. Carnegie Mellon University, Software Engineering Institute (2021)
Sarker, I.H.: Machine learning: algorithms, real-world applications and research directions. SN COMPUT. SCI. 2, 1–21 (2021). https://doi.org/10.1007/s42979-021-00592-x
Janiesch, C., Zschech, P., Heinrich, K.: Machine learning and deep learning. Electron Markets 31, 685–695 (2021). https://doi.org/10.1007/s12525-021-00475-2
Kühl, N., Schemmer, M., Goutier, M., et al.: Artificial intelligence and machine learning. Electron Markets 32, 2235–2244 (2022). https://doi.org/10.1007/s12525-022-00598-0
Sculley, D., Holt, G., Golovin, D. et al.: Hidden technical debt in machine learning systems. In: Cortes, C., Lawrence, N., Lee, D., Sugiyama, M., R. Garnett, R. (eds.) Proceedings of the 28th International Conference on Neural Information Processing Systems, pp. 2503–2511 (2015)
Horneman, A., Mellinger, A., Ozkaya, I.: AI Engineering: 11 Foundational Practices. Carnegie Mellon University, Software Engineering Institute (2019)
ISO/IEC 23053:2022. https://www.iso.org/standard/74438.html. Accessed 01 May 2023
Pons, L., Ozkaya, I.: Priority quality attributes for engineering AI-enabled systems (2019). https://arxiv.org/abs/1911.02912
Bosch, J., Crnkovic, I., Holmström Olsson, H.: Engineering AI systems: a research agenda (2020). https://arxiv.org/abs/2001.07522
The AI Act. https://artificialintelligenceact.eu/the-act/. Accessed 01 May 2023
Siebert, J., Joeckel, L., Heidrich, J. et al.: Towards guidelines for assessing qualities of machine learning systems (2020). https://arxiv.org/abs/2008.11007
ISO/IEC 25040:2011, https://www.iso.org/standard/35765.html, last accessed 2023/05/01
ISO 9000:2015, https://www.iso.org/standard/45481.html, last accessed 2023/05/01
What is Really Different in Engineering AI-Enabled Systems?, https://apps.dtic.mil/sti/trecms/pdf/AD1155001.pdf. Accessed 01 May 2023
Xu, X., Wang, C., Wang, Z. et al.: Dependency tracking for risk mitigation in machine learning (ML) systems. In: 2022 IEEE/ACM 44th International Conference on SE: Software Engineering in Practice (ICSE-SEIP), pp. 145–146, Pittsburgh, PA, USA (2022)
Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, Cambridge (2016)
Kläs, M.: Towards identifying and managing sources of uncertainty in AI and machine learning models - an overview (2018). https://arxiv.org/abs/1811.11669
Ozkaya, I.: What is really different in engineering AI-enabled systems? IEEE Softw. 37(4), 3–6 (2020). https://doi.org/10.1109/MS.2020.2993662
Serban, A., Visser, J. An empirical study of software architecture for machine learning (2021). https://arxiv.org/abs/2105.12422
Hüllermeier, E., Waegeman, W.: Aleatoric and epistemic uncertainty in machine learning: an introduction to concepts and methods. Mach. Learn. 110, 457–506 (2021). https://doi.org/10.1007/s10994-021-05946-3
Abdar, M., Pourpanah, F., Hussain, S., et al.: A review of uncertainty quantification in deep learning: techniques, applications and challenges. Inf. Fusion 76, 243–297 (2021)
Jalaian, B., Lee, M., Russell, S.: Uncertain context: uncertainty quantification in machine learning. AI Mag. 40(4), 40–49 (2019)
Hu, X., Chu, L., Pei, J., et al.: model complexity of deep learning: a survey (2021). https://arxiv.org/abs/2103.05127
Scaling Big Data Mining Infrastructure: The Twitter Experience, https://www.kdd.org/exploration_files/V14-02-02-Lin.pdf. Accessed 01 May 2023
Architectural Components in ML-Enabled Systems. https://ckaestne.medium.com/architectural-components-in-ml-enabled-systems-78cf76b29a92. Accessed 01 May 2023
A Quality Model for Machine Learning Systems. https://booking.ai/a-quality-model-for-machine-learning-systems-892118be9e19. Accessed 01 May 2023
Thuraisingham, B.: Trustworthy machine learning. IEEE Intell. Syst. 37(1), 21–24 (2022). https://doi.org/10.1109/MIS.2022.3152946
Ghahramani, Z.: Probabilistic machine learning and artificial intelligence. Nature 521, 452–459 (2015). https://doi.org/10.1038/nature14541. PMID: 26017444
Geiger, R.S., Cope, D., Ip, J., et al.: Garbage in, garbage out revisited: what do machine learning application papers report about human-labeled training data? Quant. Sci. Stud. 2(3), 795–827 (2021). https://doi.org/10.1162/qss_a_00144
ISO/IEC 25012:2008. https://www.iso.org/standard/35736.html. Accessed 01 May 2023
Interpretability versus explainability. https://docs.aws.amazon.com/whitepapers/latest/model-explainability-aws-ai-ml/interpretability-versus-explainability.html. Accessed 01 May 2023
Explainable AI (XAI). https://www.ibm.com/watson/explainable-ai. Accessed 01 May 2023
Schmidt, P., Felix Biessmann, F.: Quantifying interpretability and trust in machine learning systems (2019).https://arxiv.org/abs/1901.08558
Rawal, A., McCoy, J., Rawat, D. et al.: Recent advances in trustworthy explainable artificial intelligence: status, challenges and perspectives. https://www.techrxiv.org/articles/preprint/17054396. https://doi.org/10.36227/techrxiv.17054396.v1
Goldblum, M., Schwarzschild, A., Patel, A. et al.: Adversarial attacks on machine learning systems for high-frequency trading (2021). https://arxiv.org/abs/2002.09565
Piratla, V.: Robustness, evaluation and adaptation of machine learning models in the wild (2023). https://arxiv.org/abs/2303.02781
Chew, Y.J., Wong, K.-S., Ooi, S.Y.: Privacy protection in machine learning: the state-of-the-art for a private decision tree. (2017)
Kurakin, A., Goodfellow, I., Bengio, S.: Adversarial examples in the physical world (2016). https://arxiv.org/abs/1607.02533
Bass, L., Clements, P., Kazman, R.: Software Architecture in Practice, 4th edn. Addison-Wesley Professional, Boston (2021)
Lewis, G.A., Ozkaya, I., Xu, X.: Software architecture challenges for ML systems. In: IEEE International Conference on Software Maintenance and Evolution (ICSME), pp. 634–638, Luxembourg (2021). https://doi.org/10.1109/ICSME52107.2021.00071
Malavolta, I., Muccini, H., Ozkaya, I.: Software architecture and artificial intelligence. J. Syst. Softw. 193, 111436 (2022). https://doi.org/10.1016/j.jss.2022.111436
Muccini, H., Vaidhyanathan, K.: Leveraging machine learning techniques for architecting self-adaptive IoT systems (2020)https://doi.org/10.1109/SMARTCOMP50058.2020.00029
Overcoming Software Architecture Challenges for ML-Enabled Systems. https://apps.dtic.mil/sti/pdfs/AD1150241.pdf. Accessed 01 May 2023
Arpteg, A., Brinne, B., Crnkovic-Friis, L. et al.: Software engineering challenges of deep learning. https://arxiv.org/abs/1810.12034
Kharchenko, V., Fesenko, H., Illiashenko, O.: Basic model of non-functional characteristics for assessment of artificial intelligence quality. Radioelectron. Comput. Syst. 2, 131–144 (2022). https://doi.org/10.32620/reks.2022.2.11
Siebert, J., Joeckel, L., Heidrich, J., et al.: Construction of a quality model for machine learning systems. Software Qual. J. 30, 307–335 (2022). https://doi.org/10.1007/s11219-021-09557-y
ISO/IEC 25010:2011. https://www.iso.org/ru/standard/35733.html. Accessed 01 May 2023
Washizaki, H., Uchida, H., Khomh, F. et al.: Studying software engineering patterns for designing machine learning systems (2019). https://arxiv.org/abs/1910.04736
Yokoyama, H.: Machine learning system architectural pattern for improving operational stability. In: 2019 IEEE International Conference on Software Architecture Companion (ICSA-C), pp. 267–274, Hamburg, Germany (2019). https://doi.org/10.1109/ICSA-C.2019.00055
Cabrera, C., Paleyes, A., Thodoroff, P. et al.: Real-world machine learning systems: a survey from a data-oriented architecture perspective (2023). https://arxiv.org/abs/2302.04810
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Sholomii, Y., Yakovyna, V. (2023). Quality Assessment and Assurance of Machine Learning Systems: A Comprehensive Approach. In: Antoniou, G., et al. Information and Communication Technologies in Education, Research, and Industrial Applications. ICTERI 2023. Communications in Computer and Information Science, vol 1980. Springer, Cham. https://doi.org/10.1007/978-3-031-48325-7_20
Download citation
DOI: https://doi.org/10.1007/978-3-031-48325-7_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-48324-0
Online ISBN: 978-3-031-48325-7
eBook Packages: Computer ScienceComputer Science (R0)