Towards engineering higher quality intelligent environments: a multi case study approach

Santokhee, Adityarajsingh; Augusto, Juan Carlos; Brodie, Lindsey

doi:10.1007/s11219-024-09678-0

Towards engineering higher quality intelligent environments: a multi case study approach

Research
Open access
Published: 14 June 2024

Volume 32, pages 1075–1135, (2024)
Cite this article

Download PDF

You have full access to this open access article

Software Quality Journal Aims and scope Submit manuscript

Towards engineering higher quality intelligent environments: a multi case study approach

Download PDF

Adityarajsingh Santokhee¹,
Juan Carlos Augusto² &
Lindsey Brodie²

648 Accesses
Explore all metrics

Abstract

This study addresses the need to enhance the quality of Intelligent Environments, recognizing their unique characteristics and the absence of adequate guidance on quality management during development. It pursues three primary objectives: proposing a novel quality-in-use model, presenting an enhanced version of the User-Centered Intelligent Environment Development Process, and reporting on the application of these approaches through a multiple case study. To embed quality into systems, we advocate for the integration of quality characteristics from ISO/IEC 25000 standards with functional requirements. Stakeholders collaboratively define targets using measures from quality standards, and metrics enable early problem detection and resolution during development. The proposed quality-in-use model provides an insightful and objective perspective on system capabilities, guiding development and ensuring stakeholder involvement. However, challenges such as shortening development cycles for early and regular stakeholder feedback and managing an increased number of system tests were noted. Our study makes a significant contribution to the field of Intelligent Environments by providing a structured approach to embedding and managing quality throughout the development lifecycle. The multiple case study offers empirical evidence of the effectiveness of the proposed approaches, with ongoing considerations for challenges in the development process.

Identifying Quality Factors of Information Systems Integration Design

eFRIEND: an ethical framework for intelligent environments development

Article 14 December 2014

An Integrated Framework to Evaluate Information Systems Performance in High-Risk Settings: Experiences from the iTRACK Project

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

An Intelligent Environment (IE) refers to a physical space which is composed of a network of devices such as sensors and actuators which are orchestrated by algorithms which sensibly but proactively support people in carrying out their daily activities (Augusto et al., 2013). IEs inherit concepts and technologies from various domains such as Ubiquitous or Pervasive Computing (Weiser, 1991), Ambient Intelligence (Aarts & Roovers, 2003), Smart Environments (Rashidi et al., 2011) and Internet of Things (Atzori et al., 2010). In fact, these are closely related domains. IEs are complex systems comprising of sensing technology and are expected to perform a wide variety of personalised functions in various fields such as independent living (Martirano & Mitolo, 2020), education (Rizk & Hillier, 2022), smart home, healthcare (World Health Organization, 2022), ambient assisted living (Memon et al., 2014), agriculture, and smart factory. They are deployed in physical environments and react on real-time data according to contexts defined for specific stakeholders (Banijamali et al., 2020). For instance, the key goal of an ambient assisted living system is to “proactively, but sensibly, support people in their daily lives” (Augusto, 2007).

However, since IEs are highly user-centric systems, it is imperative that they are not only effective but also ethically viable for their end users (Augusto et al., 2013). Additionally, developers also need to cater for important IE specific technical challenges such as context-awareness, tracking preferences of users, implementing reasoning and dealing with hardware malfunctions (ibid.). Traditional software engineering methods and tools lack the maturity needed to effectively engineer these systems and address their specific challenges (Ahmad et al., 2015). Thus, the User-Centred Intelligent Environments Development Process (U-CIEDP) was proposed to develop these types of systems (Augusto et al., 2018). To date, U-CIEDP has been applied to mostly research oriented projects. This led to identification of limitations based on lessons learnt. It has been reported that each phase in U-CIEDP requires strong planning to avoid uncoordinated system development (Augusto et al., 2018; Ogbuabor et al., 2021; Santokhee et al., 2019). Secondly, there is no clear strategy to assess the quality of the systems, during or after development, which is a major limitation (ibid.).

As our reliance on IEs grows, there is an undeniable increase in the demand for higher quality systems. Historically, subpar quality has been a primary cause of failure in software-based products (Jones & Bonsignour, 2011). Given the diverse and critical applications IEs are expected to handle, low quality could result in severe consequences, potentially even endangering lives in medical contexts. System failures lead to the cessation of services, and the complexity of IEs, with their numerous components and intricate human-computer interactions, makes them susceptible to unforeseen issues (ibid.). For instance, upgrading firmware for one hardware component may alter an IE's behaviour, presenting challenges in anticipating and testing such scenarios. Complete control over interfaces is essential for successful system integration, especially as the interactions among components contribute to the emergence of effects unattainable by individual elements alone. Human-machine interfaces add complexity, as these systems must function in dynamic operational conditions involving diverse hardware devices, intricate interactions, unpredictable resource availability, unforeseen usage scenarios, and the occurrence of hard-to-predict errors.

Therefore, this study is motivated by the necessity to engineer higher quality IEs. It aims to contribute to the broader understanding of challenges in the process of developing higher quality IEs. U-CIEDP has been enhanced by incorporating new activities to manage quality to each of its three core development stages: initial scoping, main development, and installation. The overall goal is to produce systems which meet users’ or stakeholders’ expectations while satisfying core quality requirements. A novel IE specific quality model to evaluate IEs is also being proposed. The model is derived from the generic ISO/IEC 25010 quality-in-use model and has been adapted according to the nine guiding principles of IEs, as elaborated in Section 2 (Augusto et al., 2013; ISO/IEC 25010, 2021). A multi case-study approach was applied as to explore applicability of the enhanced U-CIEDP methodological framework (Kurtel & Ozemre, 2013; Runeson & Höst, 2009; Sicari et al., 2019; Scott et al., 2021; Staron et al., 2011; Tröls et al., 2021; Yin, 2018). Design of the case study was formulated following Yin’s five steps (Yin, 2018). One more motivation for using case study research is that we would like to investigate whether the framework would apply to real-world settings. We report on two case studies in this paper. By incorporating quality characteristics from ISO/IEC 25000 (2021) standards into functional requirements, our approach ensures the inherent integration of quality into system development (Brodie & Woodman, 2009; Gilb & Brodie, 2012). Collaborative efforts between stakeholders and the project team facilitate the definition of targets using measures derived from quality standards. Utilising quality metrics enables developers to monitor deviations from quality targets, addressing issues early in the development stage and enhancing the overall assessment of application quality. The proposed quality-in-use model offers valuable insights, guiding the development process by providing an objective perspective on the system's expected capabilities. Continuous stakeholder involvement throughout ensures the delivery of systems that offer optimal value.

The structure of this article is as follows. Section 2 highlights some background concepts underpinning this study. In Section 3, we report on a Systematic Literature Review which was carried out to inform this research study and discusses the research question and propositions pertaining to this study based on gaps identified through analysis of the literature. The proposed quality-in-use model and UCIEDP2 are described in Section 4. Design and details of the multiple case study are explained in Section 5. In Section 6, we discuss the findings of the case studies by aggregating their results. Threats to validity are analysed in Section 7. The paper ends with a conclusion, limitations, and areas for future work in Section 8.

2 Background

The development of IE requires a multidisciplinary team that has to be not only capable of applying a combination of techniques and methods coming from software engineering to improve its reliability, but also from other Computer Science disciplines, such as Artificial Intelligence, Ubiquitous/Pervasive Computing and Human-Computer Interaction, among others, that make the resulting IE less intrusive, while being smarter, more proactive and usable for the user (Augusto et al., 2013; Dyba et al., 2007; Salvi et al., 2015). Table 1 summarises the nine principles proposed in an article by Augusto et al. (2013). The idea is that every IE should aspire to possess these core principles to ensure systems which are developed are technically and ethically viable. However, some of the principles are entangled, in the sense, achieving one may impact on the degree of fulfilment of other principles. For instance, developing a high-performance IE may require reduced security checks. Therefore, the key challenge is in implementing and managing these core principles during the development lifecycle whilst taking into consideration important quality attribute trade-offs. IEs also pose specific technical challenges such as context-awareness, tracking preferences of multiple users in an environment, implementing reasoning, and dealing with hardware malfunctions (Augusto et al., 2013). More importantly, since these systems are highly user centric, human decision may not be rational, repeatable, or testable.

Table 1 Intelligent Environments Manifesto (Augusto et al., 2013)

Full size table

IEs can also be considered as complex software intensive systems, which are described as “systems where the software contributes essential influences to the design, construction, deployment and evolution of the system as a whole” (Sommerville, 2011). There are diverse perspectives towards quality according to some of the reputable references on quality (Anurag & Kamatchi, 2019; Cote et al., 2006). Although quality as a topic has been widely studied in software engineering for different types of systems (Anurag & Kamatchi, 2019; Benghazi et al., 2012; Kara et al., 2017; Kurtel & Ozemre, 2013; Regan et al., 2020; Vogel et al., 2021), we argue that there is lack of consensus regarding how to develop higher quality IEs.

Unfortunately, traditional software engineering methods and tools are not mature enough to engineer high quality IEs and to deal with the design challenges posed by these types of systems (Ahmad et al., 2015). In a recent study, Olianas et al. (2022) reflected that assuring quality in IoTs is challenging and they reported results of applying a prototype tool to perform system level testing of these systems. Therefore, we argue that more specific quality models, methodologies, and tools are required to develop and evaluate high quality IEs. As a first step in this study, it was deemed necessary to investigate more thoroughly how IEs are developed from a quality perspective.

3 Literature review

It was deemed imperative to conduct a more comprehensive investigation into the development of IEs from a quality perspective. A Systematic Literature Review (SLR) was conducted following the guidelines proposed by Kitchenham and Charters (Kitchenham, 2007) in three phases: planning, conducting, and reporting. The planning phase specified the necessity for conducting the analysis and the review framework. In the second phase, research was identified, primary studies were chosen based on criteria, their quality assessed, and data extracted for synthesis. The third phase involved reporting documents and data obtained during the review. Two iterative processes were also implemented during the review process to minimise the introduction of biases in the research (Wohlin, 2014).

3.1 Planning the SLR

The main goal of the Systematic Literature Review (SLR) was to investigate the current state of the art regarding the quality of IEs. To achieve this, the main research question was broken down into more specific inquiries grouped into three broad areas: definition, measurement, challenges, to comprehensively address the inquiry. Table 2 lists the specific research questions for the SLR.

Table 2 Research Questions for SLR

Full size table

RQ1 sought to distinguish the definition of quality for IEs. It aimed to explore researchers' perspectives on quality, the commonly employed quality characteristics, how conflicting quality requirements are addressed, and if there exists any system development methodology specifically focusing on quality within the domains of IEs. The second research question (RQ2) delved into the various approaches used to evaluate the quality of IEs, examining the application of quality models, the system development stage(s) for measuring quality requirements. The third research question (RQ3) investigated the prevailing challenges and identified areas for future work as documented in the literature.

3.2 Search strategy

A crucial step in a SLR involves identifying pertinent studies capable of addressing the research questions. Thus, the selection of appropriate search terms and keywords becomes paramount. In this study, we adopted an iterative approach, facilitating the analysis and gradual refinement of the search string. The search strings were devised by incorporating synonyms and abbreviations, connected through Boolean expressions, with the aim of retrieving a comprehensive set of publications. For this systematic literature review, five distinct databases (ACM, Web of Science, IEEE, Science Direct, and Springer Link) were chosen due to their extensive coverage of topics in software engineering. Table 3, presented below, outlines the executed search strings for each database, with the user guide for each database consulted to enhance the syntax of the search strings.

Table 3 Search Strings

Full size table

3.3 Defining inclusion and exclusion criteria

To determine the primary studies for further consideration, inclusion and exclusion criteria were established following the guidelines of Kitchenham and Charters (Kitchenham, 2007). A study underwent further analysis only if it met all the inclusion criteria and none of the exclusion criteria. The inclusion criteria are defined as follows:

IC1: The study is related to an aspect of quality for IEs.
IC2: The study is a peer-reviewed journal, conference or workshop proceeding.
IC3: The study addresses one or more of the review questions.

The exclusion criteria consist of:

EC1: The study is written in a language other than English.
EC2: The focus of the study is not related to any aspect of quality for IEs.
EC3: Duplicate studies.
EC4: Paper less than four pages in length.
EC5: Magazine, dissertation, tutorial, editorial, book, poster or not peer-reviewed publication.
EC6: Systematic mapping or literature reviews.
EC7: The study is published before 2003.

Following the application of inclusion and exclusion criteria, the main author reviewed the abstract of each shortlisted paper to determine its eligibility for further screening. This process was iterated at least twice on separate occasions to minimize bias. Four duplicate studies were identified and subsequently excluded. Upon completion of the initial screening, the full text of each paper was obtained and comprehensively examined by the main author. Table 4 provides a summary of the number of papers screened at each stage.

Table 4 Number of Retrievals Per Database

Full size table

3.4 Conducting the SLR quality assessment

The remaining 33 papers underwent screening in an Excel sheet, with each paper assessed against the following three quality criteria:

Q1: Are aims and scope of the study clearly stated?
Q2: Are all the study questions answered?
Q3: Are the data source, contexts and conclusions described appropriately for future references?

To establish these quality criteria, we adhered to the guidelines provided by Kitchenham and Charters (Kitchenham, 2007). The following scale-point was applied to each question:

(i)
A study fully meets a given quality criterion –1 point.
(ii)
A study partially meets a given quality criterion –0.5 points.
(iii)
A study does not meet a given quality criterion –0 point.

A total score was calculated by using the following formula:

$$\text{Quality score} = \text{Q}1 + \text{Q}2 + \text{Q}3$$

A study needed to attain a total quality score equal to or greater than 1.5 to qualify for further analysis. All 33 papers successfully met the quality criteria.

3.5 Data extraction

After evaluating the quality of each primary study, the subsequent step involved the extraction of data. A data extraction form was created in Microsoft Excel, structured as follows:

(i)
Reference details
(ii)
Concept of quality
(iii)
Contribution
(iv)
Domain
(v)
Quality characteristic(s)
(vi)
Specification
(vii)
Methodology
(viii)
When measured?
(ix)
Type of study
(x)
Challenges
(xi)
Future work

The extraction of data was significantly streamlined by initially downloading and saving each full paper individually on disk. The researcher examined each paper to extract pertinent data, recording it in the Excel file. Appendix A provides a summary of the contribution of each paper, identified by a Paper ID attribute.

3.6 Reporting the SLR

Descriptive statistics were employed to analyze notable patterns in the publications, exploring trends such as the annual publication count, domains under consideration, empirical methodologies utilized, and the number of citations for each study. As illustrated in Fig. 1, the total number of publications on the topic has been consistently limited to one-two per year over the last 15 years, with a notable peak observed in 2018. In the early stages of growth around 2015, this sector witnessed a surge in interest, particularly in smart homes and cities, improved security, and an enhanced quality of life by 2018. The acceleration of research in this sector is attributed to a growing emphasis on interdisciplinary collaboration across various fields. Furthermore, the upsurge in funding opportunities and backing from academic institutions, government agencies, and industry could have contributed to the escalation in publications on IEs (European Commission, 2009). As shown in Fig. 2, the highest number of studies targeted IoT Systems (13), followed by AAL (7) and Ubiquitous Systems (5) during the last 15 years. Recent technological advancements and the widespread adoption of devices, especially in areas like AI, IoT, and sensor networks have contributed to proliferation of these systems (Reggio et al., 2020; McKinsey, 2021). Figure 3 depicts a notable prevalence of conference papers compared to journals in this field. While conferences offer a valuable platform for disseminating research, journal publications remain crucial for delivering more comprehensive and in-depth studies, rigorous peer review processes, and long-term archival of research in the domain of IEs. This observation highlights the relative novelty and rapid evolution of this field. Researchers also often present their initial findings and innovative ideas at conferences to receive early feedback and establish their presence in the field. We noted a rise in the number of citations per paper from 20 to 120 which again shows a growing interest of researchers in this field leading to an increase in number of publications, as shown in Fig. 4.

3.7 Findings

3.7.1 RQ1 – how is quality defined for IE systems?

RQ1.1

Which aspects of quality have been considered by researchers in the domains of IE?

We noted that researchers adopted a non-functional requirements perspective when addressing the concept of quality. Sommerville (2011) emphasises the importance of non-functional requirements in shaping the overall quality of attributes of a software system. Pressman (2014) discusses the significance of non-functional requirements in determining the success or failure of a software project. In essence, the incorporation of non-functional requirements into the definition and assessment of quality is a recognised and widely accepted approach in Software Engineering literature. Specific quality characteristics were implied [P1, P4, P5, P7, P9, P12, P13, P17, P20, P21, P22, P24, P26, P28, P29]. Aspects of systems which were investigated are: usability [P2, P8, P18, P19, P25], data quality [P3, P31], quality of experience [P14, P15, P33], quality of experience [P6, P16, P27, P32], trust [P10] and quality of context [P23, P30]. This implies that in most of the studies under consideration, researchers approached the assessment and consideration of quality by looking beyond the functional features of the system. Instead, they paid attention to the broader characteristics that contribute to the overall usability, quality of experience, quality of context and reliability of the system. This approach acknowledges that a system's quality is not solely determined by its ability to perform specific tasks but is also influenced by how well it meets criteria related to its overall performance, user experience, security measures, and other non-functional attributes.

RQ1.2

How are quality requirements specified?

There is a scarcity of specific information concerning the specification of quality requirements. The utilisation of metrics emerged as a notable trend in several studies [P1, P5, P6, P17, P29, P30, P31]. In one instance [P3], quality requirements were gathered through communication with AAL service providers, while another study [P8] consulted caregivers for this purpose. In [P4], the definition and measurement of software quality factors were explicitly outlined.

The specification of quality requirements is pivotal in software engineering as it serves as the foundation for designing, developing, and evaluating a system (Pressman, 2014; Sommerville, 2011). Clear and precise quality requirements provide a roadmap for the development team, outlining the essential characteristics and attributes the system must possess to meet user expectations (Sommerville, 2011). This specification not only guides the development process but also forms the basis for subsequent testing and validation activities. It enables stakeholders to establish measurable criteria for success, facilitates effective communication between different project participants, and ultimately ensures that the delivered software aligns with user needs and organizational objectives. In essence, both Pressman (Pressman, 2014) and Sommerville (Sommerville, 2011) assert a well-defined specification of quality requirements is fundamental for achieving a successful and high-quality software product. Oram and Wilson (2010) discovered that deficiencies in precision and completeness within requirements and design documentation resulted in persistent design and requirements faults. These issues continued to be identified throughout the entire testing process, as indicated by their comprehensive survey of challenges encountered during the evolution of a large-scale real-time system.

RQ1.3

Which quality characteristics have been proposed for IEs?

We identified and grouped the quality characteristics according to IE domains discovered through the SLR to answer this question. These are summarised in Appendix B. We note that even within the same IE domain, different quality characteristics have been proposed in different studies. This shows that the choice of quality characteristics depends on the context where the system will be used. These findings corroborate with previous studies. For instance, in their evaluation of six AAL platforms (Alhambra, Hydra, OASIS, OpenAAL, PERSONA and UniversAAL), Antonino et al. (2011) selected quality attributes such as maintainability, efficiency and trustworthiness from ISO/IEC 9126, ISO/IEC 14598, and ISO/IEC 25000 Square standards. They believed that these attributes were critical to AAL systems. Memon et al. (2014) argued to study characteristics which lead to interoperability, usability, security, and accuracy rather than concentrating on isolated aspects of AAL. In their study of Smart Cities, Kakarontzas et al., 2014) identified interoperability, usability, authentication, authorization, availability, recoverability, maintainability, and confidentiality as the most prominent quality drivers. They also suggested that quality requirements could be defined using ISO/IEC 25010 (2021) standard. Garcés et al. (2017) stressed on the necessity of managing critical attributes such as security, freedom of risk, reliability, and performance efficiency since the start of AAL systems development. For Washizaki et al. (2020), interoperability was proposed as key attribute due to many participating entities which need to interact with one another. They also revealed that performance, usability, and scalability are central to certain IoT patterns for IoT systems and software while highlighting the need to study other quality attributes. Ashouri et al. (2021) highlighted that performance, efficiency, time behaviour and resource utilization were the most popular quality attributes. They also reflected that few studies have studied critical attributes for IoT such as security, compatibility, portability, and maintainability. Fizza et al. (2023) proposed a new metric called quality of actuation to quantify correctness of actuation. The study also reiterated the importance of developing a generic model for measuring quality of autonomic applications and a framework to support their development. Thus, the overall picture which emerges is that this is an evolving field of study and researchers have focused on addressing specific quality goals of systems within IE domains.

RQ1.4

How were the quality characteristics derived?

In ten studies, the quality characteristics were derived from ISO/IEC 25010 (2021) or ISO/IEC 25000 (2021) standards [P12, P13, P17, P19, P20, P22, P26, P28, P29, P30], covering IE domains such as AAL, IoT, Smart Environments and Ubiquitous Systems. ISO/IEC 25010 (2021) is a universally recognised standard which provides comprehensive definitions for various quality attributes. However, only half of the cited studies [P13, P19, P22, P26, P30] provide empirical evidence of applying their quality characteristics. According to Hron and Obwegeser (2022) and Humble and Farley (2010), the definitions of these quality attributes have found acceptance across diverse industries including automative, naval, avionics and medical devices. These definitions have been embraced by industries aligning with Industry 4.0 technologies (Abdelouahid & Marzak, 2018; ISO/IEC 25010, 2021). In the remaining studies, the quality characteristics stemmed from prior research, literature reviews, and systematic mapping studies. Nevertheless, a few notable exceptions deviated from this trend. In the case of [P7], field interviews were undertaken after a thorough literature review, providing a unique perspective. Similarly, [P8] incorporated concerns and issues voiced by caregivers into the determination of quality aspects. However, it's noteworthy that the extent of engagement with end users remained relatively limited across most of the studies. Understanding how quality characteristics are derived is vital in software engineering (Sommerville, 2011). It offers transparency into the selection process, enabling stakeholders to assess the reliability and validity of chosen attributes (Pressman, 2014). Whether stemming from empirical studies, literature reviews, or direct user interactions, this knowledge helps gauge the relevance and robustness of quality criteria (Kitchenham & Charters, 2007). Such understanding ensures alignment with user needs, organizational goals, and the broader context (Bass et al., 2012). In essence, a clear grasp of the derivation process enhances the credibility and effectiveness of quality considerations in software development (Sommerville, 2011).

RQ1.5

How are conflicting quality requirements managed and resolved?

IEs are complex systems and present significant challenge due to the intricate trade-offs and dependencies inherent in these systems (Augusto et al., 2013). Quality attributes, such as performance, security, usability, and reliability, are interconnected and often trade-off against each other. Enhancing one attribute may inadvertently compromise another (Rodríguez-Domínguez et al., 2022). For example, optimizing performance by increasing system speed might lead to increased resource utilization and potential security vulnerabilities. IEs typically involve a diverse set of stakeholders with varying needs and expectations. End-users may prioritise usability and ease of use, while administrators may emphasise security and robustness. Balancing these conflicting stakeholder requirements becomes challenging, requiring careful negotiation and compromise. IEs operate in dynamic and evolving contexts, where requirements may change over time. Adapting to new user needs, technological advancements, or emerging security threats may necessitate adjustments in quality attributes. Enforcing a rigid stance on certain attributes may hinder the system's ability to evolve and meet changing demands. IEs often encounter unforeseen challenges and uncertainties during operation, commonly referred to as "unknown unknowns" (Jones & Bonsignour, 2011). Enforcing specific quality attributes without anticipating and addressing these unknowns may lead to system vulnerabilities and unexpected failures. Therefore, we argue that striking a balance between conflicting quality attributes is key to success of IEs. However, we noted a paucity of studies focused on conflicting quality attributes. [P7] highlights that timeliness, reliability and ease of use need to be managed carefully. [P31] discusses trade-off between data usefulness and privacy protection. Maciel et al. (2022) highlight that edge devices which are commonly found in IoT environments contribute to reliability and availability. However, there remains a need to investigate methods for enhancing security and privacy on these resource-constrained devices, or to find a balance with energy consumption to address these attributes effectively. Mohammadi and Javidan (2022) propose a tool to tackle quality of service issues in software defined networks found in IEs by managing efficiency and survivability.

RQ1.6

Which system development methodology is used to develop IEs with a focus on quality?

Out of the 33 scrutinised studies, only a limited subset has delved into system development methodologies with a specific emphasis on quality considerations. Notably, [P16] contends that enhancing the elicitation of UX requirements necessitates the adoption of an Agile or iterative approach, underscoring the complexity of defining these requirements in comparison to usability aspects. The authors advocate for active involvement of end-users in this iterative process. In [P21], an innovative methodology grounded in Unified Modelling Language is introduced to facilitate the accurate design and analysis of AAL solutions. Additionally, [P23] proposes a model-driven approach for effectively modelling the quality of context information in pervasive systems. The literature collectively highlights lack of a tailored methodology to guide developers in engineering higher quality IEs. Prioritising quality in the system development methodology for IEs is essential to ensure their functionality, reliability, and positive impact on society, while addressing the unique challenges posed by the complexity and dynamic nature of these environments.

3.7.2 RQ2 – How is quality of IEs evaluated?

RQ2.1

How are quality requirements measured?

The analysed articles collectively offer a multifaceted exploration of how quality is measured in IEs, encompassing diverse methodologies and perspectives. In [P1], the authors identified the top ten most popular metrics for Object-Oriented Programming (OOP) based on a study by Nuñez-Varela et al. 2017. This provides a quantitative lens on code quality and offers insights into the industry's prevalent practices. However, this approach might be limited in capturing the full spectrum of software quality, especially considering the evolving nature of programming paradigms beyond OOP. The proposal to build a specific approach for usability testing in ubiquitous systems showcases a commitment to a holistic evaluation process [P2]. The establishment of a software process for context-awareness testing, the definition of interoperability measures, the design of context-awareness test cases, and the development of support tools collectively aim to address various dimensions of software quality. However, the comprehensive nature of this approach may introduce challenges such as increased complexity, resource demands, and potential resistance from development teams. A noteworthy theme is the evaluation of AAL technology, focusing on measuring the efficacy through the quality of data generated by AAL systems [P3]. This involves assessing structured or semi-structured data across dimensions like accuracy, completeness, timeliness, and interpretability. The use of both quantitative and qualitative methods demonstrates a consolidated approach to data quality, acknowledging the need for a multifaceted evaluation. However, the exclusive focus on specific data dimensions may overlook other aspects crucial for AAL system performance.

The adoption of the Goal-Question-Metric approach to map low-level code-based metrics to high-level software quality factors signifies a bridge between detailed code analysis and overarching quality assessment [P4]. This method enables quantifiable measurements, facilitating the comparison of quality across different software systems and components. Nevertheless, challenges may arise in defining metrics that accurately represent the desired quality factors, potentially introducing biases. The exploration of software product quality metrics for Context-aware Computing extends the measurement scope to consider context-aware applications [P5]. The proposed quality-aware cross-layered framework for IoT applications acknowledges the intricate layers involved in IoT development. However, the challenge lies in the practical implementation of these frameworks, as achieving modularity, distribution, seamless integration, and transparency may be context-specific and difficult to generalize. The ENACT DevOps Framework introduces novel solutions to address challenges in developing, operating, and assuring the quality of distributed smart IoT systems [P10]. However, the effectiveness and adaptability of this framework across diverse infrastructures require empirical validation (White et al., 2017). The proposed quality in use model for AAL systems aligning with ISO/IEC 25010 (2021) underscores the importance of user-centric quality assessment [P12]. However, the application of predefined standards may limit the model's ability to capture the uniqueness of AAL contexts and user experiences. The refinement of the ISO/IEC 25010 (2021) quality model for Industry 4.0 needs signifies an attempt to tailor existing standards to specific industrial requirements [P20]. While providing actionable support for software engineers, the applicability and generalisability of the model may be contingent on industry-specific conditions. The exploration of data quality characteristics for AAL systems, guided by ISO/IEC 25012 (2021) and ISO/IEC 25010 (2021) standards, seeks to establish relevant quality characteristics [P24]. However, the challenge lies in identifying universally applicable characteristics given the diverse nature of AAL systems.

In the comparison of quality models, [P25] suggests applying measures originally defined for ubiquitous systems to IoT applications. This points to an effort to adapt established metrics to different contexts, recognizing the commonalities between IoT and ubiquitous applications. However, challenges may arise in ensuring the relevance and accuracy of these metrics when applied outside their original domain. The emphasis on a data-driven approach to improve User Experience (UX) through a case study signals an industry-oriented effort to integrate data analytics into UX evaluations [P16]. However, the effectiveness of this approach in diverse UX contexts and its generalizability require validation. Similarly, proposing a quality in use model for AAL systems, focusing on effectiveness, efficiency, satisfaction, freedom from risk, and context coverage, demonstrates an attempt to align software quality assessment with specific application domains. However, the adaptation of the model to diverse AAL contexts may be a potential challenge. [P18] presents heuristics for evaluating the usability of ubiquitous systems. This adds a qualitative dimension to the measurement of software quality. The heuristics offer guidelines for assessing effectiveness, efficiency, and satisfaction, emphasizing a user-centric approach. However, the subjectivity inherent in qualitative assessments may introduce variability in the interpretation of usability.

The proposal of a comparative study of existing quality models of interoperability and the introduction of a hierarchic quality model for interoperability in IoT reflect an effort to standardise and define metrics for assessing interoperability [P29]. The challenge lies in establishing universally applicable criteria for interoperability, given the diverse nature of IoT applications. Additionally, the consideration of metrics based on ISO standards adds a level of standardisation but may introduce challenges in adapting these metrics to specific IoT contexts [P30]. The acknowledgment of limitations, such as the potential loss or alteration of information during the translation of quotes, emphasises the importance of considering reliability in the measurement process. This highlights the need for standardised approaches in data collection and reporting to ensure the consistency and accuracy of measurements.

In summary, the exploration of how quality is measured in the articles reflects a dynamic landscape, encompassing quantitative and qualitative methodologies, context-specific adaptations, and ongoing efforts to standardise metrics. The diverse perspectives and approaches underscore the complexity of software quality assessment, necessitating a balanced consideration of multiple factors and potential challenges in the measurement process.

RQ2.2

How is quality of IEs evaluated?

In [P1, P19, P20, P23, P24, P25, P26, P29, P30, P33], we note that the focus was on developing and proposing quality models for evaluating various aspects of IoT, ubiquitous systems, and cloud services. This included security, interoperability, context quality, and the adaptation of quality models to specific domains like Cloud IoT and Industry 4.0. We note the focus is more on measuring certain aspects of IEs. Some studies have presented quality models, evaluation methods, and indices specifically for AAL systems, aiming to assess their efficacy, quality-in-use, and data quality [P3, P7, P12, P13, P17, P24, P28]. [P2, P4, P5, P18, P19, P25] discuss challenges, metrics, and heuristics for usability testing and HCI quality evaluation in ubiquitous systems with a focus on modularity, context-aware computing, and the design of usability tests that consider context-awareness factors. Quality-of-Experience in IoT systems has been investigated in [P6, P27, P31, P32]. [P8, P10, P11, P14, P15, P16, P21, P22] encompass the development, management, and the evaluation of smart IoT systems and explore methods to ensure their reliability, dependability, and user experience requirements. A quality-in-use model grounded in ISO/IEC 25010 (2021) for AAL systems is presented in [P17]. It was utilised as a guiding framework during the assessment of an intelligent solution. Conversely, other proposals, as indicated by [P5, P7], are still in their early developmental stages. We note there is a paucity of studies which investigate how quality of IEs is evaluated.

RQ2.3

During which phase of system development are the quality requirements measured?

In most studies, quality requirements are typically measured post development or during runtime. Sommerville (2011) reflects that this is done to reflect the actual user experience and to study system behaviour in production environment. Pressman (2014) argues that this leads to a comprehensive assessment of the final product quality characteristics. However, the main limitations of this approach are that it may result in late discovery of quality issues and when significant resources have been invested (Boehm, 1981). [P4, P21] suggest incorporating quality measurements during the design phase. Budgen (2003) believes that this favours early identification and mitigation of potential quality issues during design stage. Both Boehm (1981) and Jones and Bonsignour (2011) concur that addressing issues early in the development process is generally more cost-effective than later. Post delivery software change was about 100 times as expensive as requirements-phase software system for large systems and a ratio of 5:1 for smaller systems (Boehm, 1981). More recent evidence also seems to support these findings and it is recommended that higher investments in early requirements and architecture verification and validation can significantly reduce the high ratio of 100:1 (Oram & Wilson, 2010). It is worth noting that [P4] lacks empirical evidence, and the work was still ongoing. On the other hand, [P5, P8] advocate for measurements during the development process, while [P15] proposes incorporating them during the modelling stage. There is evidence that measuring quality during the development stage leads to early identification and resolution of quality concerns (Sommerville, 2011). Kan (2002) highlights that this encourages continuous refinement and improvement of the product as development progresses. However, the main drawback as pointed out by Kan (2002) is that it may require additional resources and effort. Similarly, incorporating quality measurements during design ensures that the quality considerations are well integrated into the system. However, measurement during design may be less grounded in practical implementation.

3.8 RQ3 – What are the challenges and future research directions?

Several key challenges were identified in the reviewed studies. Bezerra et al. (2014) argue for usability testing in real-life environments for ubiquitous systems, emphasizing the need for meticulous test case design to anticipate various potential contexts. Weyns et al. (2018) stress risks in automated decision-making, calling for improved UX requirements and cautioning against late hardware changes in agile processes. The literature review reveals a scarcity of research in specific areas (Hamzah et al., 2018), prompting a call for experimental testing and the development of an effective Quality of Experience (QoE) framework (Shin, 2017). A study in China emphasizes the need for more humanistic care for the elderly, suggesting that technology falls short in meeting their spiritual needs (Chen et al., 2023). Goncalves et al. (2022) plan to use the Technology Acceptance Model for assessing the usability of a proposed tool among professional developers. There is a persistent challenge in accurately assessing smart systems both functionally and in terms of usability, with a recommendation for developing standard evaluation frameworks (Amiribesheli & Bouchachia, 2018). Communication about quality among team members lacks a standardized approach, with common strategies like unit tests and test cases posing uncertainty about exhaustiveness and potential cost implications. A need for more evaluations in real life scenarios has also been identified.

3.9 Research question and proposition

Based on our analysis of the literature, we note that the state of the art in software technology does not yet present a well-established and widely accepted framework or methodology for engineering high quality IEs. There is lack of support and guidance during the systems development lifecycle. The definition, measurement, and management of quality during the development process remain unclear. While various studies propose methods for evaluating IEs, empirical validation of these methods in industry is limited. In response to these identified gaps, we propose a framework that includes a quality-enhanced methodology and an IE-specific quality-in-use model. This framework aims to provide guidance for engineering higher-quality IEs. Consequently, the research question of this study was formulated as follows:

How can the process of engineering higher quality intelligent environments be improved through the integration of a framework, with a specific focus on improving the specification of quality requirements and the evaluation of quality throughout development?

The SLR emphasises the lack of clarity in defining, measuring, and managing quality during the development process. Drawing on the insights gained from the SLR, we proceeded to formulate propositions and related questions aligned with the research question. These propositions serve to refine and guide the research focus.

The literature analysis underscores the ambiguity surrounding the specification of quality. In the context of complex systems, developers must explicitly address the specification, prioritisation, and metrication of quality characteristics (Gilb, 2005). Consequently, the initial proposition of this study is designed to explore the current practices in specifying quality requirements for IEs within ongoing projects.

Proposition 1

Current projects in IE domains do not capture quality requirements adequately in their specifications.

Questions were then formulated to examine comprehensively the first proposition by addressing various dimensions of quality requirements in IE domain projects. These included initiation, documentation, stakeholder collaboration, monitoring, conflict resolution, historical context, and evaluation phases.

Are quality requirements captured prior to the case study?

This question seeks to understand the timing of capturing quality requirement. It aims to explore whether these requirements are considered since inception of a project or later during development.
How are quality requirements specified in the current system specifications?

This question delves into the methods and processes for specifying quality requirement in the current system specifications. It provides insights into the documentation and communication practices.
Are stakeholders’ vision taken into consideration when specifying the quality requirements?

This is a crucial question which addresses the consideration of stakeholder involvement. It aims to understand whether the expectations of key project stakeholders are considered when defining quality requirements.
How is quality tracked during the development process?

The rationale for this question is to uncover the mechanisms and tools used for monitoring and tracking quality aspects throughout the development lifecycle. It provides insights whether and how quality standards have been maintained.
What is the strategy for managing conflicting quality requirements?

This questions sheds light on strategies employed to handle conflicting quality requirements and how these were reconciled.
Was any previous benchmark data available on quality aspects of the system?

The main purpose for this question was to investigate whether historical benchmark data related to quality aspects of the system exist. It seeks to understand the project reliance on past performance metrics and benchmarks.
How is system evaluated during and post development?

This question looks at the evaluation processes employed both during and post development. It provides insight into the ongoing assessment of the quality of the system.

The literature exposes the absence of a suitable methodology for engineering higher quality IEs. As a result, the second proposition seeks to investigate the effects of implementing a quality-oriented methodology in the development of IEs and whether such an approach contributes to an enhancement in the overall quality of these systems. It is emphasised in the literature that continuous monitoring of quality characteristics throughout the system development lifecycle is essential for creating systems that align with their stringent quality requirements (Gilb, 2005; IPA, 2010). The lack of a dedicated tool for managing quality characteristics during the development of IEs is a notable observation, leading to the formulation of the second proposition.

Proposition 2

A quality enhanced methodology will lead to development of higher quality IEs.

To examine the second proposition, various aspects such as stakeholder engagement, resource implications, developer perspectives, conflict resolution, and the ultimate impact on the quality of the developed IEs were investigated. These questions cover both the qualitative and quantitative dimensions and tries to offer a well-rounded understanding of the effectiveness of the proposed quality-enhanced methodology.

How are stakeholders’ feedback captured during the development process?

This question studies the involvement of stakeholders throughout the development process and focuses on how their feedback is collected. It aims to provide insights into the responsiveness of the methodology to stakeholders needs and expectations.
What is the impact of specifying quality requirement(s) for every functional requirement on development time, cost, and overall quality?

This question explores the potential trade-offs involved in specifying quality requirements for each functional requirement and assesses the impact on development resources, time and the overall quality of developed IEs.
How effective is the methodology to developers?

This question assesses the perception of developers regarding the effectiveness of the proposed methodology. It aims to provide insights into its practicality and feasibility in the development environment.
What is the impact on development cost and time using the proposed methodology?

This question builds on the second question and seeks to quantify and understand the specific effects of the methodology on development costs and timelines, helping to evaluate its economic implications.
How are conflicting quality requirements managed?

This question addresses how the enhanced methodology deals with potential conflicts while ensuring alignment of diverse quality expectations.
Does application of the methodology result in development of higher quality IEs?

This question aims to determine whether the application of the quality-enhanced methodology indeed leads to the development of higher quality IEs.

The ISO/IEC 25010 (2021) quality-in-use model has gained prominence as a widely adopted approach for assessing overall system quality, as revealed in the literature. This model is versatile and can be customized to align with the specific characteristics of the system under examination. Consequently, the third proposition delves into the examination of the influence of an IE-specific quality-in-use model in evaluating the quality of a system throughout both its development and post-development phases.

Proposition 3

An IE specific quality-in-use model is beneficial to evaluate quality of IEs during and post development.

To address key aspects of the third proposition, questions which focused on the specific qualities and benefits associated with the proposed IE specific quality-in-use model were formulated. These investigated the relevance of quality characteristics, examined stakeholder visibility, and assessed the effectiveness of the model to developers. The questions aim to provide a comprehensive understanding of the impact of the quality model on the evaluation of IE quality during and after development.

How relevant is the proposed mandatory list of quality characteristics?

This question evaluates the relevance of the mandatory list of quality characteristics proposed by the IE specific quality-in-use model. This is important because it helps to understand their effectiveness in capturing essential aspects of IE quality.
Does the quality-in-use model provide more visibility about the quality of the system to stakeholders?

This question seeks to explore the communicative aspect of the quality-in-use model and whether it enhances visibility for stakeholders. It provides insight into its effectiveness in conveying the system quality.
How effective is quality-in-use model to developers?

This question assesses the practicality and utility of the quality-in-use model from the perspective of developers. It aims to understand whether the model is effective in guiding development efforts.

These propositions aim to address the research question by exploring different aspects of how the integration of a framework can contribute to improving the engineering of higher quality IEs, with a specific focus on quality requirement definition, specification, and measurement throughout development.

4 Proposed quality-in-use model and methodological framework

In this section, we present the methodological framework and quality-in-use model to develop and evaluate higher quality IEs.

4.1 A quality-in-use model for IEs

We propose to evaluate the quality of Intelligent Environments (IEs) through a refined "quality-in-use" framework, which we define specifically for IEs as " the degree to which a product or system can be used by specific users to meet their needs and accomplish specific goals." This definition is rooted in the universally recognised ISO/IEC 25010 (2021) standard, which we adapt to address the unique demands of IEs. Building on the work of Erazo-Garzon et al. (2021) and Salomón et al. (2023), who respectively adapted ISO/IEC 25010 (2021) for AAL systems and context-aware software systems, we propose a novel quality-in-use model tailored for IEs. Our adaptation process is detailed, ensuring clarity in how metrics are defined and applied:

1.
Adaptation of the Generic ISO/IEC 25010 (2021) Model:
- We assessed each ISO/IEC 25010 (2021) characteristic for its relevance to IEs.
- We decided to retain characteristics like effectiveness, efficiency, and satisfaction.
- We removed or updated certain attributes that were less applicable to the IE context.
2.
Integration of IE Principles:
- We mapped nine fundamental IE principles directly to relevant ISO/IEC 25010 (2021) quality characteristics.
- For principles without a direct ISO/IEC 25010 (2021) counterpart, we introduced new sub-characteristics.
- Each new or updated quality characteristic was clearly defined, ensuring relevance to IEs.
3.
Definition of Specific Metrics:
- For each quality characteristic and sub-characteristic, we established clear, measurable metrics from ISO/IEC 25023:2016 official documentation (ISO/IEC 25023, 2022).
- These metrics were developed to be IE-specific, ensuring they are tangible and relevant.

We followed an iterative process involving multiple refinement cycles to ensure comprehensive and consensus-based metric development. Various models were presented to the two co-authors. This process continued until consensus was reached among all three authors regarding the quality-in-use model depicted in Fig. 5. The finalised model incorporates adjustments to context completeness and flexibility while maintaining all sub-characteristics within freedom from risk. Additionally, seven new sub-characteristics, aligned with the remaining seven principles, were integrated under the categories of effectiveness, efficiency, and satisfaction. The measurement functions for these quality characteristics were then rigorously defined, adhering to the ISO/IEC 25023 (2022) Quality Measurement Framework standards. Each metric was constructed to be measurable, relevant, and specific to the IE context, ensuring practical applicability and clarity. We employed a clear scale, from 0 to a defined maximum, where values closer to the maximum indicate higher quality-in-use. The proximity to 1.0, for instance, signifies exceptional performance. Table 5 provides a summary of each metric, its scale, and the target values for higher-quality performance. By incorporating these details directly into the text, we aim to provide a clear, actionable, and transparent framework for evaluating the quality-in-use of Intelligent Environments, offering stakeholders a detailed and practical tool for assessment.

Table 5 Measurement Functions for Proposed Quality-in-use Model

Full size table

4.2 Proposed methodological framework: UCIEDP2

The initial choice for guiding the development of Intelligent Environments (IEs) was the User-Centred Intelligent Environment Development Process (U-CIEDP), as introduced by Augusto (2014). However, it has been observed that U-CIEDP lacks a well-defined strategy for effectively managing quality requirements throughout the development process, as noted by (Augusto et al., 2018; Ogbuabor et al., 2021; Santokhee et al., 2019). In response to this limitation, we have enriched the U-CIEDP with new activities tailored to each of its core phases: Initial Scoping, Main Development, and IE Installation, as detailed in Tables 6, 7, and 8. This enhanced methodology, dubbed UCIEDP2, is systematically illustrated in Fig. 6 and embodies a more quality-centric approach:

1.
Initial Scoping: UCIEDP2 commences with collecting stakeholders' visions and requirements via interviews, alongside documenting crucial project characteristics — time, cost, and scope. An initial set of functional requirements is established, inviting stakeholders to define specific quality characteristics. These characteristics are anchored at the functional level to enable precise quantification and control, informed by the research of Brodie and Woodman (2009) and Fenton and Bieman (2014), and measured according to standards like ISO/IEC 25010 and ISO/IEC 25012 (ISO/IEC 25010, 2021; ISO/IEC 25012, 2021). We identified that in U-CIEDP, this phase lacked concrete steps for integrating quality requirements from the outset. In UCIEDP2, we have introduced specific activities such as 'Establishing Project Vision' and 'Gathering Critical Success Factors' which ensure that quality is considered from the earliest stages. Each activity is directly linked to overcoming U-CIEDP initial shortcomings, offering a detailed methodology for capturing and documenting stakeholder quality expectations based on ISO/IEC quality standards.
2.
Quality and Requirements Documentation: Targets for each quality characteristic, per functional requirement, are defined in collaboration with stakeholders and captured in a customized Impact Estimation Table (IET), adhering to Gilb's principles (2005). An example of an IET can be found in Appendix C, enhancing reproducibility, and ensuring clear, actionable guidelines for quality assessment. Following this, prototypes and design concepts are developed, rigorously evaluated against the set quality targets, cost, and time constraints. Moving beyond the broad guidelines of U-CIEDP, UCIEDP2 recommends precise quality metrics, drawing from ISO/IEC 25010 (2021) and ISO/IEC 25012 (2021) standards. The IET is designed for tracking and evaluation of these metrics throughout the development lifecycle.
3.
Main Development: This phase involves developing detailed designs and test cases using appropriate design methodologies, with iterations as needed. Implementation follows, preferably in short, incremental cycles, prioritising high-importance requirements and integrating regular stakeholder feedback to ensure alignment with quality objectives. Code verification employs methods such as test-driven development and code reviews (Jones & Bonsignour, 2011). Post-implementation, the system undergoes comprehensive testing and quality evaluations against the IE quality-in-use model, with results documented in the IET to identify any discrepancies from set targets, as outlined in Table 7.
4.
Post-Deployment Assessment: The final phase focuses on assessing the deployed system performance from the users' perspective, incorporating user acceptance testing based on the established quality-in-use model. Findings, along with any variances in quality attributes, are systematically documented in the IET. This stage's specific activities are enumerated in Table 8.

Table 6 Enhancements to Initial Scoping Stage

Full size table

Table 7 Enhancements to Main Development Stage

Full size table

Table 8 Enhancements to IE Installation Stage

Full size table

By integrating these modifications into UCIEDP2, our aim is to present a model that is not only robust and quality-focused but also transparent and replicable. We are committed to fostering a development lifecycle for IEs that meets functional requirements while excelling in user satisfaction and quality assurance. The enhancements introduced here are designed with the broader research community in mind, offering a blueprint for quality-driven development in this dynamic field.

5 Case study research design

In this study, an enhanced methodological framework, UCIEDP2, has been proposed for engineering high quality IEs. To explore applicability of UCIEDP2 in different contexts, a multiple case study research was deemed more appropriate to allow for generalisation (Murzi, 2007; Staron et al., 2011; Kurtel & Ozemre, 2013; Runeson & Höst, 2009; Sicari et al., 2019; Scott et al., 2021; Tröls et al., 2021; Yin, 2018). Another motivation for using case study research is that we would like to investigate whether the framework would apply to real-world settings (Dalcher & Brodie, 2007). We report on two case studies in this paper: a final year undergraduate project and the other in industry respectively. Design of the case studies was largely inspired by Yin (2018) and consisted of the following five steps:

1.
“a study’s questions,
2.
its propositions, if any,
3.
its unit(s) of analysis,
4.
the logic linking the [collected] data to the propositions; and
5.
the criteria for interpreting the findings.’’

The main research question and corresponding propositions underpinning this study are defined in Section 4. The unit of analysis in each project was development of a system using UCIEDP2. As far as the fourth component is concerned, the literature review revealed several questions which were linked to the propositions. Regarding the fifth component of the case study design, the questions defined for each proposition were instrumental to identify the types of data which had to be collected and strategies to analyse the data. Data for this study was mostly collected from interviews, project documentation reports, test scripts for user acceptance testing and bug reports. Employees in different roles were interviewed. The interviews were carried out as semi- structured interviews and recorded as audio files. The data which was collected was mostly qualitative with some quantitative data such as time, cost, and budget. The main researcher’s role in both case studies was to provide support for the application of the UCIEDP2 methodology and to provide an IET for data collection (Alasuutari et al., 2008; Saunders et al., 2016). Informed consent was obtained from all participants involved in the study. Collaboration with the industry partner was made possible due to an existing memorandum of understanding with Middlesex University Mauritius. In the next sections, we describe the two case studies.

5.1 Case study I: smart home monitoring system

The first case study is development of a smart home monitoring system. Figure 7 shows the architecture diagram of the implemented system. Three sensors were connected to an Arduino Nano board. Data collected by the sensors were transmitted to a MySQL database hosted on a Raspberry Pi 4 computer using radio frequency. An Apache Web Server was also installed on the computer. The Web Server hosted a web-based application to monitor energy consumption, and provide recommendations, such as alerts and tips energy for saving, using an adapted k-means clustering algorithm. The project stakeholders were a third-year student as developer, a project supervisor, and an experienced business consultant from a software development company as customer. First, we discuss development of the system using prototyping. A kick-off meeting was scheduled at the beginning of May 2020. Since Mauritius was under lockdown during that period due to the Covid-19 pandemic, the meeting was held online. All the project stakeholders participated. The project specification document was examined to determine how the requirements were specified. The initial list of functional and non-functional requirements is given in Tables 9 and 10 respectively. It is worth mentioning that the non-functional requirements were lacking specific metrics and were specified rather vaguely. Prototypes for the web-based application were developed by following Nielsen Heuristics (Nielsen, 1994). Low fidelity mock-ups were designed by the developer, and these were validated by the business consultant. General feedback and improvements were recorded on paper. Quality requirements were specified as non-functional requirements for the whole system. However, no metrics were specified for the non-functional requirements as given in Table 10. The developer also indicated that test cases were defined for each functional requirements and these were executed towards the end of development. The system was evaluated using the Technology Acceptance Model (TAM) with six constructs and a score of 4.47 was recorded (Sharma et al., 2022).

Table 9 List of functional requirements

Full size table

Table 10 List of non-functional requirements

Full size table

5.1.1 Application of UCIEDP2

An online half-day workshop was held in mid-May 2020 during which the UCIEDP2 methodology and the quality-in-use model were explained to all stakeholders using PowerPoint slides. The stakeholders agreed that the vision of the system was to develop a low cost and easy-to-use web-based application to monitor energy consumption and provide accurate recommendations on energy usage. Upon consultation of the ISO/IEC 25010 (2021) and ISO/IEC 25012 (2021) models, the stakeholders unanimously agreed on a set of quality sub-characteristics for each functional requirement. The details were compiled in an IET, as summarised in Table 11. For brevity, a cut-down sample of the new system specification is given below. In this example, functional requirement FR07 is now quantified as follows:

Functional requirement: FR07
The system shall provide a live graphical display of electricity consumption.
Quality sub characteristic: Usability. Appropriateness recognizability
Measure: Description completeness
Measurement function: X = A/B
A = Number of usage scenarios described in the product description or user documents
B = Number of usage scenarios of the product
0 < = X < = 1
Quality sub characteristic: Usability.Learnability
Measure: User guidance completeness
Measurement function: X = A/B
A = Number of functions described in user documentation and/or help facility as required
B = Number of functions implemented that are required to be documented
0 < = X < = 1

Table 11 Quality measures for functional requirements

Full size table

A second online meeting was convened a week later. This time the developer presented the results recorded for each functional requirement, including the mandatory quality requirements defined in the quality-in-use model. Using this data as baseline, the project stakeholders discussed target values for each of the measures and updated the IET accordingly. One iteration of phase one of UCIEDP2 was sufficient to complete the initial scoping. During the second phase of UCIEDP2, the developer focused on improving the existing system based on available data. Priority was given to functional requirements with very low measure scores. For instance, description completeness was initially measured at 0.14 while user guidance completeness was 0.43 for FR007. These improved to 0.57 and 0.79 respectively in the second prototype. The developer progressively improved the functional requirements. However, significant time was required to research and apply new concepts. Completion of phase two necessitated four iterations. The consultant was involved at the end of each iteration for feedback. Development time increased to 18-person days from 10-person days for the first prototype. The system was then evaluated using the quality-in-use model. Figure 8 shows a comparison of the measurements of the mandatory quality characteristics. All three project stakeholders were then interviewed for feedback at the completion of the project.

5.1.2 Findings

Next, we discuss the findings of this case study against the research propositions.