Static detection of equivalent mutants in real-time model-based mutation testing

Basile, Davide; Beek, Maurice H. ter; Lazreg, Sami; Cordy, Maxime; Legay, Axel

doi:10.1007/s10664-022-10149-y

Static detection of equivalent mutants in real-time model-based mutation testing

An Empirical Evaluation

Open access
Published: 20 September 2022

Volume 27, article number 160, (2022)
Cite this article

Download PDF

You have full access to this open access article

Empirical Software Engineering Aims and scope Submit manuscript

Static detection of equivalent mutants in real-time model-based mutation testing

Download PDF

Davide Basile ORCID: orcid.org/0000-0002-7196-6609¹,
Maurice H. ter Beek¹,
Sami Lazreg²,
Maxime Cordy² &
…
Axel Legay³

2282 Accesses
7 Citations
2 Altmetric
Explore all metrics

Abstract

Model-based mutation testing has the potential to effectively drive test generation to reveal faults in software systems. However, it faces a typical efficiency issue since it could produce many mutants that are equivalent to the original system model, making it impossible to generate test cases from them. We consider this problem when model-based mutation testing is applied to real-time system product lines, represented as timed automata. We define novel, time-specific mutation operators and formulate the equivalent mutant problem in the frame of timed refinement relations. Further, we study in which cases a mutation yields an equivalent mutant. Our theoretical results provide guidance to system engineers, allowing them to eliminate mutations from which no test case can be produced. Our empirical evaluation, based on a proof-of-concept implementation and a set of benchmarks from the literature, confirms the validity of our theory and demonstrates that in general our approach can avoid the generation of a significant amount of the equivalent mutants.

Time for Mutants — Model-Based Mutation Testing with Timed Automata

Multiple Mutation Testing for Timed Finite State Machine with Timed Guards and Timeouts

Debugging with Timed Automata Mutations

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Testing a real-time system against safety-critical requirements is a difficult problem due to the time-sensitiveness of its behaviour. To help in this task, model-based testing methods automate the generation of test cases by using a formal model of the system (Utting et al. 2012). The model drives the generation of test cases according to different criteria, such as classical state or branch coverage (Masri and Zaraket 2016), or feature combination coverage in the specific context of software product lines (Lee et al. 2020). Testing a formal model rather than source code allows to detect, among others, misinterpretations of requirements or systemic issues arising from time-dependent interactions of the system with its environment. Such detections would be harder at source code level.

Mutation testing (Aichernig et al. 2015; Brillout et al. 2009) is a technique commonly used to evaluate the thoroughness of test cases or to support their generation (Andrews et al. 2006; Offutt 2011). It can be applied both to the implementation (source code) and to the specification (model). A set of mutation operators, simulating possible faults in the system, are applied to the model, obtaining a so-called mutant. Thus, given a set of mutants, the effectiveness of a set of test cases can be evaluated according to the number of mutants it detects (i.e., mutants that produce different output than the original system). Test cases generated from a mutant are capable to detect bugs mimicked by that mutation. The fundamental underlying assumption is the existence of a coupling effect, i.e. the fact that “simple faults are coupled to more complex faults in such a way that a test suite that detects simple faults is sensitive enough to likely detect complex faults as well” (Petrovic et al. 2021). It has been shown (Andrews et al. 2006) that mutation-based testing is more effective in finding real faults than other techniques (Offutt 2011; Baker and Habli 2013; Aichernig et al. 2013).

Scalability of this approach is of paramount importance, because a large number of mutations is required in order to build effective test cases. However, many of these mutations are useless because they generate mutants that have no behaviour that the original system had not. In such cases, no test case can be generated to differentiate the mutant from the original system, leading to useless analyses and waste of computational resources. Code-based mutation testing research has worked on methods to detect and avoid equivalent mutants that are semantically equivalent to the original program (Madeyski et al. 2014). In model-based mutation testing, this problem generalizes to that of detecting subsumed mutants, which have less (or equal) behaviour than the original system model.

One viable method is to organise the mutants as a product line of mutations, in the featured mutant model (Devroey et al. 2016). Such a product line enables the effective generation and validation of mutants against given test cases. However, an efficient featured mutant model should be built upon a set of effective mutations (i.e., those producing useful mutants), rather than from random mutations. This constitutes an important contribution to avoiding the equivalent/subsumed mutant problem.

In this paper, we tackle the problem of testing real-time systems effectively and efficiently. We adopt the model-based mutation testing approach for real-time systems presented in Larsen et al. (2017). We augment the set of existing mutations with a few mutation operators that affect the timing of the system behaviour (e.g., one such operator delays the execution of an action by the system), first introduced in Basile et al. (2020a). Then, we address the subsumed mutant problem: we formally prove the conditions under which mutations inherently (i.e., by construction) produce subsumed mutants. We achieve this on the basis of refinement relations, which can be used to show that a model (the system) subsumes another (the mutant). Our endeavour yields clear guidelines for real-time system engineers, which they can follow in order to reduce their testing effort by ignoring equivalent mutants.

This paper builds on results from Basile et al. (2020a); more precisely, we extend it in the following way. We prove novel auxiliary theoretical results concerning non-subsumed mutants (in Section 4.2), while we refer to Basile et al. (2020a) for proofs of earlier theoretical results reported here for the sake of completeness. Moreover, we add a thorough empirical evaluation (in Section 5), considering more case studies retrieved from the literature, and extending the experiments to consider second-order mutations, for all case studies. Finally, the experiments protocol has been fully automatised (cf. Section 5.2) and the implementation of mutant generation and checking is open source and available online, thus allowing reproducibility of the experiments.

Summarising, our contributions are as follows.

1.
We propose novel time-specific mutation operators for real-time models.
2.
We study and formally prove under which conditions mutation operators, including the time-specific ones, yield equivalent (or subsumed) mutants, from which no test case can be generated, and we provide guidelines that can be used to prevent the generation of such mutants.
3.
We study and formally prove under which conditions mutation operators, including the time-specific ones, yield non-equivalent (or non-subsumed) mutants. These results must also consider when a mutation produces a non-redundant mutant, and auxiliary results not presented before address this issue formally.
4.
We formalise our theoretical results using product lines of mutants. We use the featured mutant model (Devroey et al. 2016) to model the variability of the different mutations that can be applied, using a feature-aware extension of timed graphs (the mathematical structure used to check refinement relations), in a similar way that other formalisms have been extended with variability (Cordy et al. 2012b; Cordy et al. 2012a; Classen et al. 2013; Ter Beek et al. 2016; Basile et al. 2020b; Ter Beek et al. 2020; Ter Beek et al. 2021).
5.
We implement our approach in a proof-of-concept tool and validate the soundness and effectiveness of the guidelines, based on an industrial system from the automotive domain and several other case studies from the literature, for first-order as well as second-order mutants.
6.
The experiments protocol is completely automatised based on software for (i) mutant generation, (ii) mutant checking with the provided tool, and (iii) automatic refinement checking using the off-the-shelf tools Uppaal TIGA (Behrmann et al. 2007) and Ecdar (David et al. 2010b), to validate the proposed approach empirically.

Outline

In Section 2, we discuss related work, followed by background material on (featured) timed games and the featured mutant model in Section 3, where we also introduce the novel mutation operators. Our main contributions are presented in Section 4, where we classify mutation operators and present guidelines for selecting effective mutations, and in Section 5, where we report the results of an empirical evaluation of our guidelines for both first- and second-order mutants. In Section 6, finally, we conclude the paper and provide some ideas for future work. Due to their size, the results of the aforementioned empirical evaluation for second-order mutants are reported in Appendix A.

2 Related Work

This paper, as an extension of Basile et al. (2020a), mainly builds upon two recent results on mutation-based testing (Devroey et al. 2016; Larsen et al. 2017). Featured mutant models were introduced in Devroey et al. (2016) for efficiently validating test cases against different possible mutations. Indeed, a single execution on the generated featured transition system (Classen et al. 2013) suffices to check all mutants at once. However, in contrast to our approach, no guidelines are provided on how to select the mutations to generate the featured mutant model, that is, the mutations are selected randomly.

While Devroey et al. (2016) studies the problem of checking given test cases, Larsen et al. (2017) considers the problem of generating valid test cases for real-time system models. Basically, a test case generated through mutation-based testing is guaranteed by construction to distinguish certain mutants from the system model.

Compared to Devroey et al. (2016), in Larsen et al. (2017) the mutants are not organised as a product line and thus have to be checked one by one to generate the test cases. Moreover, both approaches generate random mutations that may result ineffective for generating/validating the test cases. Our approach improves on this by providing clear guidelines that allow to establish upfront which mutations can safely be ignored since no test case can be produced from them.

In Luthmann et al. (2017) and Luthmann et al. (2019), an approach to the generation of non-subsumed mutants is proposed, using Configurable Parametric Timed Automata (CoPTA) models, which analyses constraints of the generated zone graph. We do not generate zone graphs but instead statically identify mutations to be discarded based on the fact that we know from our theoretical results that they will generate subsumed mutants.

Earlier, in Aichernig et al. (2013), mutation-based testing for timed automata was introduced, extending standard mutation operators presented in Fabbri et al. (1999) with new mutations tailored for timed automata. We use some of those mutations, but also some of the new ones we introduced in Basile et al. (2020a). Compared to Larsen et al. (2017), a k-bounded language inclusion test between the mutant and the system model is used rather than refinement checking with Ecdar.

In Aichernig et al. (2013), a Car Alarm System (CAS) model of Ford is used as case study for experiments and evaluation. In Basile et al. (2020a), we used the same case study; in this paper, we consider five further case studies from the literature (Hune et al. 2001; Feo-Arenis et al. 2014; Hoxha et al. 2015; André et al. 2019; Basile et al. 2020c). Similar to Larsen et al. (2017), the approach in Aichernig et al. (2013) comes without a procedure or guidelines for selecting effective mutations, and no product line is used either. In particular, 471 out of a total of 1099 generated mutants are tested and subsequently discarded, because they cannot be used for generating test cases. We present a technique that allows to avoid the generation of ineffective mutants.

Mutation-based test-case generation is also discussed in Aichernig et al. (2015), for the case of UML state machine diagrams. The technique for comparing the mutant with the system model is similar to the one in Aichernig et al. (2013), and the same CAS case study is used for experiments. Mutations are applied randomly and ineffective mutants (i.e., mutants subsumed by the system model) are discarded subsequent to their generation.

Finally, the survey in Jia and Harman (2011) points out that “one barrier to wider application of mutation testing centers on the problems associated with equivalent mutants”. Our paper is an effort in the direction of reducing the generation of ineffective mutants upfront, within the framework proposed by Larsen et al. (2017) and adopting the featured mutant model construction of Devroey et al. (2016).

3 Background

In this section, we provide some background needed for the sequel.

3.1 Timed Games

Timed games (TG) are transition systems which can remain in a certain state or location only a specific amount of time, can execute a transition only within a certain time interval, and distinguish between controllable and uncontrollable actions. TG are based on timed (game) automata (Alur and Dill 1994; Asarin et al. 1998) and form the underlying behavioral structure of featured timed game (automata) (Cordy et al. 2012b; Cordy et al. 2013).

In reactive systems, one usually distinguishes between uncontrollable and controllable actions, that are assigned to inputs and outputs, respectively, if the environment is uncontrollable and vice versa otherwise.

Time is represented by clocks whose values evolve continuously. Clocks can be regarded as chronometers: their value can be inspected and reset, but not modified arbitrarily. Conditions over clock values are called clock constraints.

Definition 1 (Clock constraints)

A clock constraint over a set C of clocks is formed according to the grammar $g ::= \top \mid n \sim c \mid g \land g$, with $n \in \mathbb {N}$, c ∈ C, and $\sim \in \{<, \leq , \geq , >\}$.

We denote by CC(C) the set of clock constraints over C. In TG, a clock constraint can label either a state or a transition. In case it labels a state, the constraint is a location invariant, which defines the interval of time in which the system can be in the state. In case it labels a transition, it is a transition guard specifying the interval of time during which the system can execute the transition. Note that the domain of the numeric constants in clock constraints is limited to natural numbers. Without loss of generality, we could use real numbers. However, natural numbers facilitate the implementation of clock constraints by allowing efficient data structures.

Definition 2 (Timed games)

Let (Loc,Act,C,Trans,ℓ₀,Inv,AP,L) be a timed game (TG) where

Loc is a finite set of locations;
Act is a finite set of actions, partitioned into controllable actions Act^c and uncontrollable actions Act^u;
C is a finite set of clocks;
$\textit {Trans} \subseteq \textit {Loc} \times \textit {CC}(C) \times \textit {Act} \times 2^{C} \times \textit {Loc}$ is a transition relation;
ℓ₀ ∈Loc is the initial location;
$\textit {Inv} : \textit {Loc} \rightarrow \textit {CC}(C)$ is a total function associating locations with invariants;
AP is a set of atomic propositions; and
$L: \textit {Loc} \rightarrow 2^{\textit {AP}}$ is a total function associating locations with atomic propositions satisfied in those locations.

For a transition $t = (\ell , g, \alpha , R, \ell ^{\prime })$, ℓ is the starting location, g is the transition guard, α is the action triggering the transition, R is the subset of clocks to reset, and $\ell ^{\prime }$ is the target location. We may also write t as $\ell \xrightarrow {g,\alpha ,R} \ell ^{\prime }$ and omit g and/or R when immaterial, and instead of {x} for a reset of clock x, we may also write x := 0.

Example 1

In Fig. 1(left), a TG model of a soda vending machine is depicted. From its initial state s₀, the insertion of a euro coin (€) results in the clock being (re)set to zero and a move to state s₁. This input action is modelled as a controllable transition (drawn as a solid arc). The vending machine can remain in this state for at most 5 time units but only within 2 time units it can deliver a soda bottle (), returning to its initial state. The latter action is modelled as an uncontrollable transition (drawn as dotted arc). Note that we may speak of (un)controllable transitions when their action labels are (un)controllable. A TG model of a tea vending machine is depicted in Fig. 1(right).

The semantics of a TG is commonly defined as an infinite transition system (TS) whose states consist of a location and a valuation of the clocks. The transitions can be categorised into two types. Delay transitions do not change the location of the system, but only represent the passing of time. They may occur only if the invariant of the current location is still satisfied after the delay modelled by the transition. Discrete transitions instead occur when the system moves from one location to another. They may occur only if the current clock values satisfy both the guard of the executed transition and the invariant of the target location. After the execution of such transitions, clock values can be reset.

Definition 3 (TG semantics)

We define the semantics of a timed game tg = (Loc,Act,C, Trans,ℓ₀,Inv,AP,L) as the semantics of the TS $(\textit {Loc} \times \textit {Val}(C), \textit {Act} \cup \mathbb {R}_{\geq 0}, \textit {Trans}^{\prime },(\ell _{0},$ $v_{0}), \textit {AP} \cup \textit {CC}(C), L^{\prime })$, denoted by [ [tg] ]_TG, and such that Val(C) is the set of clock evaluations, i.e., the set of total functions $v : C \rightarrow \mathbb {R}^{+}$ that assign a non-negative real value to every clock; v₀ = {v₀(c) = 0∣c ∈ C}; $L^{\prime }(\ell , v) = L(\ell ) \cup \{ cc \in \textit {CC}(C) \mid v \models cc \}$; and

$$ \begin{array}{@{}rcl@{}} [\![tg]\!]_{\textit{TG}} = \{ L(\ell_{0}), L(\ell_{1}), &\ldots& \in (2^{\textit{AP} \cup \textit{CC}(C)}) \mid\\ &&\forall i \in \mathbb{N} {\scriptstyle\ \bullet\ } \exists \alpha_{i} \in \textit{Act} \cup \mathbb{R}_{\geq 0} {\scriptstyle\ \bullet\ } ((\ell_{i},v) \xrightarrow{\alpha_{i}} (\ell_{i+1},v^{\prime})) \} \end{array} $$

3.2 Featured Timed Games

Featured timed games (FTG) extend TG with variability in the same way that featured transition systems (FTS) (Classen et al. 2013) extend (labelled) transition systems (LTS). FTS concisely model the behaviour of all products of a product line in a single superimposed LTS through the annotation of transitions with feature expressions, i.e., conditions expressing their existence in products, based on a feature model.

We assume products to be represented by sets of Boolean features and a feature model to be defined as a pair $(F,P\subseteq 2^{F})$, where F is a set of features and P is the set of valid products. The semantics of a feature model φ, denoted by [ [φ] ]_FM, is then its set of valid products. It can be represented by either a propositional formula or by the usual feature diagram. Let $\mathbb {B} = \{\top ,\bot \}$ denote the Boolean constants true (⊤) and false (⊥), and let $\mathbb {B}(F)$ denote the set of Boolean expressions over F (i.e., using features as propositional variables). The elements of $\mathbb {B}(F)$ are also called feature expressions. Formally, a feature expression χ is a total function $\{\top ,\bot \}^{|F|} \rightarrow \{\top ,\bot \}$ that associates every combination of features with a truth value. A feature expression can be interpreted as a set of products $[\![\chi ]\!]\subseteq 2^{F}$ defined as all products p for which the induced truth assignment (⊤ for f ∈ p, ⊥ for f∉p, for features f ∈ F) validates χ. Feature expressions and clock constraints allow modelling the behaviour of real-time variable-intensive systems.

Definition 4 (Featured timed games)

Given a timed game (Loc,Act,C, Trans,Loc₀,Inv, AP,L), the decuple (Loc,Act,C,Trans,Loc₀,Inv,AP,L,φ,γ) is a featured timed game (FTG) where

φ is a feature model over a finite set F of features; and
$\gamma : (\textit {Trans} \cup (\textit {Loc} \rightarrow \textit {CC}(C))) \rightarrow \mathbb {B}(F)$ is a total function associating feature expressions to transitions and invariants.

As for FTS, the function γ associates a feature expression χ to some transition $t = (\ell , g, \alpha , R, \ell ^{\prime })$ such that γ(t) = χ encodes the set of products able to execute t. We may also write t as $\ell \xrightarrow {[\chi ]g,\alpha ,R} \ell ^{\prime }$ and omit g and/or R when immaterial. The function γ moreover associates a feature expression χ to a location invariant Inv(ℓ) = g, for some ℓ ∈Loc, such that γ(g) = χ, which we may also write as [χ]g, encodes the set of products with the invariant g in location ℓ. Note that [⊤] stands for a feature expression that is always satisfied (by any product).

Example 2

In Fig. 2(left), an FTG ftg of a product line of vending machines is depicted. The feature model is s ∨ t, with features s for soda and t for tea. From the initial state s₀, the insertion of a euro coin (€), which is always possible (the feature expression is always true) and which results in the clock being (re)set to zero, leads to state s₁. This is a controllable (input) action. A vending machine can remain in this state for at most 5 time units. Vending machines with feature s can deliver a soda bottle () before 2 time units have passed. Vending machines with feature t can deliver a cup of tea () after at least 2 time units have passed (producing tea takes more time). Note that in the presence of both features, after precisely 2 time units have passed, a choice occurs. Both (output) actions are uncontrollable.

FTG model real-time behaviour of a product line. Moreover, from an FTG we can derive TG modelling behaviour of specific products. This is achieved by projection of an FTG onto a product p obtained in much the same way as an LTS is obtained from an FTS (Classen et al. 2013): all transitions and invariants unavailable in product p are removed.

Definition 5 (FTG projections)

The projection of an FTG ftg = (Loc,Act, C,Trans,Loc₀, Inv,AP,L,φ,γ) onto a valid product p ∈ [ [φ] ]_FM is the TG $\textit {ftg}\! _{|p} = (\textit {Loc}, \textit {Act}, C, \textit {Trans}^{\prime },$ $\textit {Loc}_{0},\textit {Inv}^{\prime }, \textit {AP}, L)$ where

$$ \begin{array}{@{}rcl@{}} \textit{Trans}^{\prime} &=& \{ t = (\ell, g, \alpha, R, \ell^{\prime}) \mid t \in \textit{Trans} \land p \models \gamma(t) \}; \text{ and}\\ \textit{Inv}^{\prime}(\ell) &=& \textit{Inv}(\ell) \! _{|p}, \forall \ell \in \textit{Loc} \text{ and the projection of an invariant \textit{g}}\\ &&\phantom{\qquad\quad\qquad\ } \text{onto a product \textit{p} is recursively defined as} \end{array} $$

$$g\! _{|p} = \left \lbrace \begin{array}{ll} (g_{1})\! _{|p} \land (g_{2})\! _{|p} &\text{ if } g = g_{1} \land g_{2} \\ g^{\prime} &\text{ if } (g = [\chi]g^{\prime}) \land p\in[\![\chi]\!] \\ \top &\text{ if } (g = [\chi]g^{\prime}) \land p\not\in[\![\chi]\!] \end{array} \right .$$

Example 3

In Fig. 2(right), products ftg _|{s} and ftg _|{t} of the FTG ftg are depicted. The TG ftg _|{s} in Fig. 2(bottom-right) is a model of the vending machine that can only deliver soda bottles, whereas the TG ftg _|{t} in Fig. 2(top-right) is a model of the vending machine that can only deliver tea. Product ftg _|{s,t} is not shown.

The semantics of an FTG model of a product line is defined as a function that associates every valid product with the semantics of its projection.

Definition 6 (FTG semantics)

The semantics of an FTG ftg = (Loc,Act,C, Trans,Loc₀, Inv,AP,L,φ,γ) is defined as the function [ [ftg] ]_FTG such that

$$\forall p \in [\![\varphi]\!]_{\textit{FM}} {\scriptstyle\ \bullet\ } [\![\textit{ftg}]\!]_{\textit{FTG}}(p) = [\![\textit{ftg} \! _{|p}]\!]_{\textit{TG}}$$

3.3 Featured Mutant Model

The idea underlying model-based mutation testing is to guide the test-case generation by mutants, which are typically obtained through random mutations of the original model. Organising the mutants as a product line of mutations, a family of variations of the system under test (SUT), coined the featured mutant model (FMM) in Devroey et al. (2016), enables the efficient generation, configuration, and execution of mutants. Each feature in the FMM corresponds to a single application of one mutant operator on the original model.

Like Devroey et al. (2016), we use a selection of the operators proposed by Fabbri et al. (1999), based on Chow (1978) and Weyuker et al. (1994), to generate mutants from a TS:

TMI :: Transition MIssing operator removes a transition;
TAD :: Transition ADd operator adds a transition between two states;
SMI :: State MIssing operator removes a state (other than the initial state) and all its incoming/outgoing transitions.

Additionally, we introduce the following operators specific to timed models, which change the constant in clock constraints, which we recall to be either a transition guard or a location invariant:

CXL :: ConstanteXchangeL operator increases the constant of a clock constraint;
CXS :: ConstanteXchangeS operator decreases the constant of a clock constraint;
CCN :: Clock Constraint Negation operator negates a clock constraint.

The CCN operator is inspired by the μ_ng operator from Aichernig et al. (2013), where only clock constraints appearing as transition guards are negated.

Each operator can be used to generate mutants using either the enumerative approach or the FMM approach. In the enumerative approach, each mutation transforms an FTG model ftg, representing the SUT behaviour, into a mutant ftg_m.

Example 4

The FTG in Fig. 3(right) has been obtained from the FTG in Fig. 3(left) by applying the mutation operators TMI, CXL, and CXS. The transition labelled with a soda bottle was removed (TMI). Moreover, constant 2 in the clock constraint that acts as transition guard was increased to 4 to model that producing a tea takes more time (CXL). Instead, constant 5 in the clock constraint that acts as location invariant was decreased to 4 to model that the vending machine takes less time to produce a drink (CXS). Thus, the transition from s₁ to s₀ that models the delivery of a cup of tea now occurs (instantaneously) precisely when x = 4. The feature model was not changed.

In the FMM approach, each mutation operator is added as a feature to the existing feature model. When considering first-order mutation (only one mutation can be applied to the original system), the features/mutations are mutually exclusive. For higher order mutations, disjunction is used instead.

Example 5

Adding the TMI, CXL, and CXS operators to the FTG in Fig. 4(left), results in the FTG ftg_fmm depicted in Fig. 4(right) with feature model φ_fmm depicted in Fig. 5. We now explain this.

To begin with, the TMI operator removes the transition of the base model in the following way:

1.
The feature expression ¬tmi is added to the feature expression of t₁, resulting in transition , meaning that this transition may be fired only if the tmi mutation is deactivated (and if s is true);
2.
The feature tmi is added to the feature model φ_fmm representing the application of the mutation operator (cf. Figure 5).

Moreover, the CXL operator increases the constant 2 to 4 in the clock constraint that acts as guard on the transition of the base model, in the following way:

1.
The feature expression ¬cxl is added to the feature expression of t₂, resulting in transition , meaning that this transition may be fired only if the cxl mutation is deactivated (and if t is true);
2.
The transition is added, meaning that this transition with feature expression t ∧cxl and clock constraint x ≥ 4 may be fired only if the cxl mutation is activated (and if t is true);
3.
The feature cxl is added to the feature model φ_fmm representing the application of the mutation operator (cf. Figure 5).

Finally, the CXS operator decreases the constant 5 to 4 in the featured clock constraint [⊤]x ≤ 5, which acts as invariant of the state s₁ of the base model, in the following way:

1.
The feature expression ¬cxs is added to the featured clock constraint of state s₁, meaning that the updated featured clock constraint [¬cxs]x ≤ 5 acts as invariant x ≤ 5 of s₁ only if the cxs mutation is deactivated;
2.
The feature expression cxs is added to the featured clock constraint of state s₁, meaning that the updated featured clock constraint [cxs]x ≤ 4 acts as invariant x ≤ 4 of s₁ only if the cxs mutation is activated;
3.
The feature cxs is added to the feature model φ_fmm representing the application of the mutation operator (cf. Figure 5).

Hence, mutation operators are added to the FMM under construction.

4 Classifying Mutations

Our main theoretical contribution is a classification of mutations to identify those that are effective (i.e., can be used to generate test cases). Our key idea from Basile et al. (2020a) is that, by construction, some mutations produce mutants that have the same (or a subset of the) behaviour of the SUT. Discarding them will speed-up the mutation testing process, as we would avoid fruitless attempts to generate test cases. Thus, we aim to characterise these mutations by formally proving under which conditions (i.e., mutation operator and the elements of the model to which it is applied) the produced mutant is subsumed by the SUT.

Recall that a test case generated from a mutant provides a sequence of inputs that makes the mutant behave differently than the SUT (in terms of accepted inputs, produced outputs, or execution time). Thus, the goal of the test case is to distinguish whether the system on which it is executed is the original one or the mutant. For a mutant to remain “live” (as it is named in the jargon), there must be no test case that can distinguish it from the SUT. This is equivalent to proving that the mutant is a refinement of the SUT (Larsen et al. 2017). Refinement checking is solved as a two-player timed game, where one player (playing the “whenever” transitions of the forthcoming Definition 7) wins if the mutant is not a refinement of the system (the mutant is killed) and the other player (playing the “then” transitions of Definition 7) wins if the mutant is a refinement (the mutant is alive). If the mutant is not a refinement, then the counterexample represents the test case that distinguishes the mutant from the SUT.

In what follows, we consider the mutation operators mentioned in Section 3.3 and state under which conditions their application results in a refinement of the original model. First-order mutations were shown to offer a higher fault-revealing ability (Papadakis and Malevris 2010). Our theoretical results hold for first-order and also higher order mutations. As such, when proving refinement relations, we consider the general case where mutations are applied to mutants of the SUT (either subsumed or not). Similarly, our work generalises to the case where the original model represents the behaviour of not only one system, but of a whole product line of systems. Thus, our theoretical developments are defined over FTG rather than single TG. To summarise, all results described hereafter apply to (1) any-order mutations and (2) families of systems.

4.1 Subsumed Mutants

To begin with, we formalise the notion of refinement between TG, adapted from David et al. (2010a) and Larsen et al. (2017). In Larsen et al. (2017), real-time systems are modelled as timed I/O automata, in which input actions are defined controllable and output actions are defined uncontrollable. The main idea is to perform a refinement check between the mutant and the system model, using Ecdar (David et al. 2010b), which is a tool built on top of Uppaal TIGA (Behrmann et al. 2007) that implements the timed interface theory from David et al. (2010a). Basically, a refinement model (i.e., a live mutant) must be able to mimic all controllable transitions of the original system model, while the original model must be able to mimic all uncontrollable transitions of the refinement. In our case, controllable transitions correspond to inputs (since a live mutant must accept all inputs that the original system accepts), whereas uncontrollable transitions correspond to outputs and delays (since a live mutant should not exhibit any behaviour that does not belong to the system). Note that this is the opposite of the standard notion of modal refinement, where the inputs are seen as sent by an uncontrolled environment (Larsen et al. 2007). In other words, here the viewpoint is switched to the environment (David et al. 2010a; Larsen et al. 2017).

Definition 7 (Refinement)

A TG tg₁ = (Loc₁,Act₁,C₁,Trans₁,ℓ₀₁,Inv₁, AP₁,L₁) is a refinement of a TG tg₂ = (Loc₂,Act₂,C₂,Trans₂,ℓ₀₂,Inv₂,AP₂, L₂), denoted as tg₁ ≼tg₂, if there exists a binary relation $R \subseteq (\textit {Loc}_{1}, \textit {Val}(C_{1})) \times (\textit {Loc}_{2}, \textit {Val}(C_{2}))$ that contains s = ((ℓ₀₁,v₀₁),(ℓ₀₂,v₀₂)) and is such that for each pair of locations and clocks values ((ℓ₁,v₁),(ℓ₂,v₂)) ∈ R, it holds:

whenever $({\ell }_{2},{v}_{2}) \xrightarrow {\alpha } (\ell ^{\prime }_{2},{v}_{2})$ for some $\ell ^{\prime }_{2}$ and $\alpha \in \textit {Act}^{c}_{2}$, then $({\ell }_{1},{v}_{1}) {\xrightarrow {\alpha }} (\ell ^{\prime }_{1},{v}_{1})$ for some $\ell ^{\prime }_{1}$, $\alpha \in \textit {Act}^{c}_{1}$ and $((\ell _{1},{v}_{1}) ,(\ell ^{\prime }_{2},{v}_{2})) \in R$
whenever $({\ell }_{1},{v}_{1}) {\xrightarrow {\alpha }} (\ell ^{\prime }_{1},{v}_{1})$ for some $\ell ^{\prime }_{1}$ and $\alpha \in \textit {Act}^{u}_{1}$, then $({\ell }_{2},{v}_{2}) {\xrightarrow {\alpha }} (\ell ^{\prime }_{2},{v}_{2})$ for some $\ell ^{\prime }_{2}$, $\alpha \in \textit {Act}^{u}_{2}$ and $((\ell ^{\prime }_{1},{v}_{1}) ,(\ell ^{\prime }_{2},{v}_{2})) \in R$
whenever $({\ell }_{1},{v}_{1}) {\xrightarrow {\delta }} ({\ell }_{1},v^{\prime }_{1})$ for some $v^{\prime }_{1}$ and $\delta \in \mathbb {R}_{\geq 0}$, then $({\ell }_{2},{v}_{2}) {\xrightarrow {\delta }} ({\ell }_{2},v^{\prime }_{2})$ for some $v^{\prime }_{2}$ and $(({\ell }_{1},v^{\prime }_{1}) ,({\ell }_{2},v^{\prime }_{2})) \in R$

We now provide a definition of subsumed mutant, where Op_fmm is the set of mutations. Basically, after applying an additional mutation the resulting mutant is a refinement of the former one on which the additional mutation was not applied.

Definition 8 (Subsumed mutant)

Let ftg be an FTG and let [ [φ] ] be the set of mutants with $m,m^{\prime } \in [\![\varphi ]\!]$. We say that m differs from $m^{\prime }$ by op iff $m=m^{\prime } \cup \textit {op}$ for some op ∈Op_fmm. Moreover, we say that m is subsumed by $m^{\prime }$ iff $\textit {ftg}\! _{|m} \preceq \textit {ftg}\! _{|m^{\prime }}$, and we say that it is non-subsumed otherwise.

Example 6

Consider the FTG ftg_fmm of Example 5, reproduced in Fig. 6(left), and its mutants $m_{1} = \{s,\textit {tmi}_{t_{1}}\}$ and $m^{\prime }_{1} = \{s\}$, depicted in Figs. 6(top-right) and 6(bottom-right), respectively, i.e., with .

In this case, m₁ differs from $m^{\prime }_{1}$ by $\textit {tmi}_{t_{1}}$. Moreover, let $tg_{1} = \textit {ftg}_{\textit {fmm}}\! _{|m_{1}}$ and $tg_{2} = \textit {ftg}_{\textit {fmm}}\! _{|m^{\prime }_{1}}$. It holds that tg₁ ≼ tg₂, i.e., tg₁ is subsumed by tg₂. Indeed, for all values v of x in the interval [0,5], the three points of Definition 7 hold for $(({s_{1}}_{tg_{1}},{v}_{tg_{1}}),({s_{1}}_{tg_{2}},{v}_{tg_{2}}))$, and there is no configuration (s₁,v) with v > 5 because it would violate the invariant of s₁.

We only consider deterministic TG, as usual (Larsen et al. 2017; Aichernig et al. 2015; Aichernig et al. 2013). The following proposition identifies conditions under which a mutant is subsumed.

Proposition 1

(Basile et al. 2020a) Let ftg be an FTG, let [ [φ] ] be the set of mutants with $m,m^{\prime } \in [\![\varphi ]\!]$, and let m differ from $m^{\prime }$ by op. Then m is a subsumed mutant of $m^{\prime }$ iff op has introduced either less uncontrollable or more controllable behaviour (or trivially if the behaviour is unchanged).

In the remainder of this section, we present several results for identifying mutations that generate subsumed mutants by construction. Proof (sketches) can be found in Basile et al. (2020a). We start with those operations that were proposed by Fabbri et al. (1999), followed by the novel ones introduced in this paper.

TMI mutation

The TMI mutation is used to remove a transition from the system. The following lemma shows that removing an uncontrollable transition from a mutant, by construction the resulting mutant is subsumed by the original one.

Lemma 1 (TMI Subsumed)

Let ftg be an FTG and let [ [φ] ] be the set of mutants with $m,m^{\prime } \in [\![\varphi ]\!]$ and $m=\{\textit {tmi}_{t}\} \cup m^{\prime } $ for some $t \in \textit {Trans}_{\textit {ftg}\! _{|m^{\prime }}}$ with action in Act^u. Then ftg _|m is subsumed by $\textit {ftg}\! _{|m^{\prime }}$.

Example 7

We illustrate the usefulness of this result. Recall that test-case generation is more effective if the number of subsumed mutants is minimised. Continuing the previous example, since t₁ is an uncontrollable transition, Lemma 1 implies that $\textit {ftg}_{\textit {fmm}}\! _{|m_{1}}$ is subsumed by $\textit {ftg}_{\textit {fmm}}\! _{|m^{\prime }_{1}}$, i.e., this is not a good candidate mutation for the configuration $m^{\prime }_{1}$.

TAD mutation

The TAD mutation is used to add a transition to the system. The next lemma shows that by adding a controllable transition to a mutant, the obtained mutant is subsumed by the original one.

Lemma 2 (TAD Subsumed)

Let ftg be an FTG and [ [φ] ] be the set of mutants with $m,m^{\prime } \in [\![\varphi ]\!]$ and $m=\{\textit {tad}_{t}\} \cup m^{\prime } $ for some t with action in Act^c. Then ftg _|m is subsumed by $\textit {ftg}\! _{|m^{\prime }}$.

SMI mutation

The state missing SMI mutation removes a location from the system (not the initial location however). This is equivalent to making the location unreachable, i.e., removing all its incoming transitions. Hence, the results on TMI can be applied. The following lemma shows when this mutation produces a subsumed mutant.

Lemma 3 (SMI Subsumed)

Let ftg be an FTG and let [ [φ] ] be the set of mutants with $m,m^{\prime } \in [\![\varphi ]\!]$ and $m=\{\textit {smi}_{\ell }\} \cup m^{\prime } $ for some $\ell \in \textit {Loc}_{\textit {ftg}\! _{|m^{\prime }}}$. Then ftg _|m is subsumed by $\textit {ftg}\! _{|m^{\prime }}$ if there exists no transition t with target location ℓ and action α ∈Act^c.

We continue with the mutation operators that were firstly introduced in Basile et al. (2020a).

CXL mutation

We first turn our attention to the mutation CXL, that increases the constant of a clock constraint. The next lemma shows when the mutation operator CXL applied on a transition produces a mutant that is subsumed.

Lemma 4 (CXL Subsumed Transitions)

Let ftg be an FTG and let [ [φ] ] be the set of mutants with $m,m^{\prime } \in [\![\varphi ]\!]$ and $m=\{\textit {cxl}_{t}\} \cup m^{\prime } $ for some $t \in \textit {Trans}_{\textit {ftg}\! _{|m^{\prime }}}$ with source ℓ and either (i) action in Act^c and guard g = x ≤ k or (ii) action in Act^u, guard g == k and Inv(ℓ) = x ≤ k or (iii) action in Act^u and guard g = x ≥ k. Then ftg _|m is subsumed by $\textit {ftg}\! _{|m^{\prime }}$.

Example 8

Recall from Example 5 the mutation operator CXL applied to the FTG ftg_fmm that is reproduced in Fig. 7(left), and now consider its mutants $m_{1} = \{t,\textit {cxl}_{t_{2}},\textit {cxs}_{s_{1}}\}$ and $m^{\prime }_{1} = \{t,\textit {cxs}_{s_{1}}\}$, depicted in Figs. 7(top-right) and 7(bottom-right), respectively, i.e., with . Since t₂ is an uncontrollable transition, Lemma 4(iii) implies that $\textit {ftg}_{\textit {fmm}}{\! _{|{m_{1}}}}$ is subsumed by $\textit {ftg}_{\textit {fmm}}{\! _{|{m^{\prime }_{1}}}}$, i.e., this is not a good candidate mutation for the configuration $m^{\prime }_{1}$.

Finally, the next lemma identifies the conditions under which applying CXL on an invariant yields a subsumed mutant.

Lemma 5 (CXL Subsumed Invariants)

Let ftg be an FTG and let [ [φ] ] be the set of mutants with $m,m^{\prime } \in [\![\varphi ]\!]$ and $m=\{\textit {cxl}_{\ell }\} \cup m^{\prime } $ for some location $\ell \in \textit {Loc}_{\textit {ftg}\! _{|m^{\prime }}}$ with Inv(ℓ) = x ≥ k and for all valuations v of clock x such that $k < v \leq k^{\prime }$ for $k^{\prime }$ mutation, (ℓ,v) can only be reached through a transition with action in Act^u and target ℓ. Then ftg _|m is subsumed by $\textit {ftg}\! _{|m^{\prime }}$.

Example 9

Consider Fig. 8, assume that tg₁ = ftg _|m and $tg_{2}=ftg\! _{|m^{\prime }}$, for some ftg and $m = \{cxl_{s_{1}}\} \cup m^{\prime }$, where cxl increases by one unit the clock constraint of location s₁. By Lemma 5, it holds that tg₁ is subsumed by tg₂.

CXS mutation

We now turn our attention to the mutation CXS that decreases the constant of a clock constraint. The next lemma shows that the mutation operator CXS applied to a guard of the form x ≤ k of an uncontrollable transition or to a guard of the form x ≥ k of a controllable transition produces a mutant that is subsumed.

Lemma 6 (CXS Subsumed Transitions)

Let ftg be an FTG and let [ [φ] ] be the set of mutants with $m,m^{\prime } \in [\![\varphi ]\!]$ and $m=\{\textit {cxs}_{t}\} \cup m^{\prime } $ for some $t \in \textit {Trans}_{\textit {ftg}\! _{|m^{\prime }}}$ with either (i) action in Act^u and guard g = x ≤ k; or (ii) action in Act^c and guard g = x ≥ k. Then ftg _|m is subsumed by $\textit {ftg}\! _{|m^{\prime }}$.

Finally, the next lemma identifies the conditions under which the application of CXS on an invariant yields a subsumed mutant.

Lemma 7 (CXS Subsumed Invariants)

Let ftg be an FTG and let [ [φ] ] be the set of mutants with $m,m^{\prime } \in [\![\varphi ]\!]$ and $m=\{cxs_{\ell }\} \cup m^{\prime } $ for some location $\ell \in \textit {Loc}_{\textit {ftg}\! _{|m^{\prime }}}$ and either (i) Inv(ℓ) = x ≤ k and for all valuations v of clock x such that $k^{\prime } \leq v < k$ for $k^{\prime }$ mutation, (ℓ,v) can only be reached through a transition with action in Act^u and target ℓ or (ii) Inv(l) = x ≥ k and for all valuations v of clock x such that $k^{\prime } \leq v < k$ for $k^{\prime }$ mutation, (ℓ,v) can only be reached through a transition with action in Act^c and target ℓ. Then ftg _|m is subsumed by $\textit {ftg}\! _{|m^{\prime }}$.

Example 10

Consider again Fig. 8, now assuming that $tg_{3} = ftg\! _{|m^{\prime }}$ and tg₄ = ftg _|m, for some ftg and $m = \{cxs_{s_{1}}\} \cup m^{\prime }$, where cxs decreases by one unit the constant of the clock constraint of location s₁. By Lemma 7, it holds that tg₃ is subsumed by tg₄.

4.2 Auxiliary Results on Non-subsumed Mutants

Although our focus is on detecting subsumed mutants, the developed theory is also helpful in spotting when a specific mutation yields by construction a non-subsumed mutant. The following auxiliary results target non-subsumed mutants and complement the results stated so far (from Basile et al. 2020a). At this point, it is important to note that the experiments in Section 5 will only exploit results on subsumed mutants to discard ineffective mutations. However, the auxiliary results presented in this section could be exploited to perform a different evaluation: instead of discarding subsumed mutants, only generate (statically known) non-subsumed ones. This evaluation is harder, because only in specific cases it is possible to statically detect when a mutation is non-redundant. This is left for future work. A TG is said to be non-redundant if every location ℓ is reachable in at least one trace, it is not time-locked (i.e., delay is possible), and every transition is executable in at least one trace. For the non-subsumed lemmata, we will only consider non-redundant TG. Indeed, mutating a redundant element may produce a subsumed mutant. Note that redundant specifications are ill-defined and should be amended prior to any application, be it testing, model checking or any other.

We introduce the auxiliary definition of non-redundancy preserving mutation. This will be useful for the forthcoming results about non-redundant higher order mutations, whose hypothesis is that the mutated mutant is non-redundant.

Proof (sketches) of the results already reported earlier can be found in Basile et al. (2020a).

Definition 9 (Non-redundancy preserving mutation)

Let ftg be an FTG and let [ [φ] ] be the set of mutants with $m,m^{\prime } \in [\![\varphi ]\!]$ and with m that differs from $m^{\prime }$ by the application of some mutation op ∈Op_fmm. Then, m is a non-redundancy preserving mutation of $m^{\prime }$ iff $\textit {ftg}{\! _{|{m^{\prime }}}}$ is non-redundant implies ftg _|m is non-redundant.

Below follow two generic results on non-redundancy preserving on location or transition for any mutation. Note that the results below are operating at the syntactic level (i.e., statically). This means that the information on the specific values of clocks is missing. This information is only known at the semantics level (i.e., during the execution), where states are pairs of locations and clock evaluations. However, under specific hypothesis, it is possible to infer (statically) the values of clocks. Intuitively, if all clocks are reset when entering a location, the clocks evaluation when entering the location is statically known to be zero, and it is possible to predict whether guards and invariants will be satisfied. This allows to provide a result on non-redundancy preserving of a generic mutation involving a location that can be checked statically.

Proposition 2 (Non-redundancy preserving mutation on location)

Let ftg be an FTG and let [ [φ] ] be the set of mutants with $m,m^{\prime } \in [\![\varphi ]\!]$, and with m that differs from $m^{\prime }$ by the application of some mutation op ∈Op_fmm mutating a location ℓ, with invariant Inv(ℓ).

If there exists a transition $t^{\prime }=(\ell _{t^{\prime }}, g_{t^{\prime }}, \alpha _{t^{\prime }}, R_{t^{\prime }}, \ell )$, for some $\ell _{t^{\prime }},g_{t^{\prime }},\alpha _{t^{\prime }}, R_{t^{\prime }}$ and with $R_{t^{\prime }}=C$, such that v₀⊧Inv(ℓ), and for all transitions $\hat t$ with source ℓ, it holds that $\textit {Inv}(\ell ) \wedge g_{\hat t} \not \models \textit {false}$ and $R_{\hat t}=C$, then $\textit {ftg}\! _{|m^{\prime }}$ is non-redundant implies ftg _|m is non-redundant.

Proof

(sketch) By assumption $t^{\prime }$ is non-redundant, so ℓ is reachable through $t^{\prime }$. By assumption, when reaching ℓ through $t^{\prime }$, Inv(ℓ) is satisfied. Moreover, all guards of outgoing transitions of ℓ at some point are satisfied by hypothesis. By the fact that all variables are reset in each transition, it holds that the behaviour of the underlying transition system is unchanged, thus ftg _|m is non-redundant. □

We also provide a result on non-redundancy preserving of a generic mutation involving a transition that can be checked statically.

Proposition 3 (Non-redundancy preserving mutation on transition)

Let ftg be an FTG and let [ [φ] ] be the set of mutants with $m,m^{\prime } \in [\![\varphi ]\!]$, and with m that differs from $m^{\prime }$ by the application of some mutation op ∈Op_fmm mutating a transition $t=(\ell , g_{t}, \alpha _{t}, R_{t}, \ell ^{\prime }_{t})$ for some ℓ, g_t, α_t, R_t, $\ell ^{\prime }_{t}$ such that R_t = C. If Inv(ℓ) ∧ g_t⊮false, then $\textit {ftg}\! _{|m^{\prime }}$ is non-redundant implies ftg _|m is non-redundant.

Proof

(sketch) By assumption ℓ is reachable, and at some point transition t can be fired. By the fact that all variables are reset in t, it holds that the behaviour of the underlying transition system is unchanged, thus ftg _|m is non-redundant. □