Introduction

Statistical process monitoring (SPM) schemes are used from monitoring production and manufacturing processes (e.g. Gupta et al. 2018) to monitoring project performance (e.g. Mortaji et al. 2017) to monitoring profiles (e.g. Zakour and Taleb 2017). In SPM, two sources of variation are distinguished. On the one hand, we have chance (or common) causes of variation and on the other, special (or assignable) causes of variation. Common causes of variation are unavoidable and can be found in any process. A quality process that runs in the occurrence of common causes only is said to be in-control (IC). However, when the quality process runs in the occurrence of special causes of variation the quality process is said to be out-of-control (OOC). In this case, the causes of variation must be identified and removed as soon as possible. The faster a scheme is in detecting an OOC state, the more efficient it is (cf. Montgomery 2013).

A basic Shewhart \(\bar{X}\) scheme is known to be more efficient (or sensitive) in unmasking large shifts (i.e. changes) in the location process parameter. However, it is relatively insensitive in unmasking small and moderate shifts. This popular scheme gives a signal if a single sample mean (or point) falls beyond the upper or lower control limits (UCL and LCL) defined by

$${\textit{UCL/LCL}} = \mu_{0} \pm k\sigma_{0} ,$$
(1)

where \(\mu_{0}\) and \(\sigma_{0,}\) are the IC process mean and standard deviation, respectively, and \(k\) is a charting constant that is typically found such that some IC metric [such as the average run-length (ARL)] is equal to a pre-specified value. In order to improve the sensitivity of the basic \(\bar{X}\) scheme in detecting smaller shifts, Wu and Spedding (2000a) proposed a synthetic \(\bar{X}\) scheme for monitoring the location (or mean) process parameter which consists of two sub-charts, one, a basic \(\bar{X}\) sub-chart and a second, a conforming run-length (hereafter CRL) sub-chart. For a basic synthetic scheme, an OOC signal is not based on a single charting statistic (i.e. point) plotting beyond the threshold values given by Eq. (1). However, when a point plots beyond the threshold values defined in Eq. (1), the corresponding sample is marked as a “nonconforming sample” and the monitoring procedure moves to the second sub-chart where an OOC signal is obtained depending on the outcome of the CRL sub-chart. Note that whenever a point falls between LCL and UCL, the corresponding sample is marked as a “conforming sample” (cf. Wu and Spedding 2000a, b). Bourke (1991) defines a CRL as the number of conforming samples (or points) between two successive nonconforming points, including the nonconforming point at the end. Figure 1 illustrates an example with CRL = 2, CRL = 5 and CRL = 3.

Fig. 1
figure 1

\({\textit{CRL}}\) values

Note that whenever we do not get any conforming point between two nonconforming points, the CRL value is equal to one (i.e. CRL = 1). The control limit of the CRL sub-chart is denoted by H (where H is a positive integer greater or equal to 1). Thus, the CRL sub-chart gives a signal whenever the CRL value is less than or equal to H (cf. Huang and Chen 2005; Wu et al. 2010; Guo et al. 2015). To make the computation of the run-length distribution of the synthetic scheme easier, Davis and Woodall (2002) showed that “a synthetic chart is a special case of a run-rule scheme, i.e. a 2-of-(H + 1) rule with a head-start (HS) feature”. The standard 2-of-(H + 1) rule gives an OOC signal when two consecutive plotting statistics, out of \(H + 1\) consecutive plotting statistics, plot above (below) the UCL (LCL) where H is a positive integer greater or equal to 1. The HS feature implies that at time 0 the first sample is assumed to be nonconforming; therefore, at least one other nonconforming sample is needed within the following H sampling points, for a 2-of-(H + 1) runs-rules scheme to issue an OOC signal (cf. Shongwe and Graham 2016).

Before proceeding any further, let us acknowledge that synthetic charts have received a lot of criticism in the literature (Knoth 2016). Knoth (2016) advised against the use of synthetic charts, however, Knoth (2016) only considered one type of synthetic chart and, it has been shown in Shongwe and Graham (2017a), that there are actually four types of synthetic charts and that the other three types outperform the type considered by Knoth (2016). It is highly recommended that the use of synthetic charts be investigated further, i.e. a thorough investigation of the other three types of synthetic charts should be done and compared to Knoth (2016)’s findings. Thus, it is of our opinion that synthetic charts should not yet be discarded, as recommended by Knoth (2016), and the abovementioned reasons are motivation to continue developing synthetic monitoring schemes even after Knoth (2016)’s warning not to do so.

Besides the basic design of the synthetic schemes, synthetic schemes that are based on the sub-chart limits in Eq. (1) can be classified into four principal types, which are given as follows:

  1. 1.

    the NSS synthetic scheme gives an OOC signal when two nonconforming points, out of \(H + 1\) consecutive points, plot beyond the threshold values given in Eq. (1) no matter whether one (or both) of the nonconforming points lie(s) above the UCL and the other (or both) lie(s) below the LCL, which are separated by at most \(H - 1\) conforming points that plot between the LCL and the UCL (Wu and Spedding 2000a). The control charting regions of the NSS scheme are shown in Fig. 2a. From the operation of the NSS synthetic scheme, the \({\textit{CRL}}\) value can be defined as the number of conforming points that plot between the \({\textit{LCL}}\) and UCL in Fig. 2a that are plotted in between the two successive nonconforming points, irrespective of whether one (or both) fall above the UCL and the other (or both) below the LCL.

    Fig. 2
    figure 2

    Different regions of the Burr-type XII \(\bar{X}\) sub-chart

  2. 2.

    The standard side-sensitive (SSS) synthetic scheme gives an OOC signal when two nonconforming points, out of \(H + 1\) consecutive points, plot above (below) the UCL (LCL) which are separated by at most \(H - 1\) points that plot below (above) the UCL (LCL), respectively (Davis and Woodall 2002). The control charting zones (or regions) of the SSS scheme are shown in Fig. 2b. From the operation of the SSS synthetic scheme, two different types of \({\textit{CRL}}\)s denoted \({\textit{CRL}}_{{{\bar{\text{L}}}}}\) and \({\textit{CRL}}_{{{\bar{\text{U}}}}}\) can be defined. The \({\textit{CRL}}_{{{\bar{\text{L}}}}}\) value is the number of conforming samples that fall above the \({\textit{LCL}}\) in Fig. 2b that are plotted in between the two consecutive nonconforming points below the LCL (i.e. in region L), including the nonconforming point at the end, whereas the \({\textit{CRL}}_{{{\bar{\text{U}}}}}\) value is the number of conforming samples that fall below the \({\textit{UCL}}\) in Fig. 2b, that are plotted in between the two consecutive nonconforming points above the UCL (i.e. in region U), including the nonconforming point at the end.

  3. 3.

    The revised side-sensitive (RSS) synthetic scheme gives an OOC signal when two nonconforming points, out of \(H + 1\) successive points, plot above (below) the UCL (LCL) which are separated by at most \(H - 1\) conforming points that plot between the LCL and the UCL (Machado and Costa 2014). The control charting regions of the RSS scheme are shown in Fig. 2b. From the operation of the RSS synthetic scheme, two different types of \({\textit{CRL}}\)s denoted \({\textit{CRL}}_{{\check{L}}}\) and \({\textit{CRL}}_{{\check{\text{U}}}}\) can also be defined. The \({\textit{CRL}}_{{\check{\text{L}}}}\) is the number of conforming samples that fall within region I in Fig. 2b that are plotted in between the two consecutive nonconforming points below the LCL (i.e. in region L), including the nonconforming point at the end, whereas the \({\textit{CRL}}_{{\check{\text{U}}}}\) is the number of conforming points that within region I in Fig. 2b that are plotted in between the two consecutive nonconforming points above the UCL (i.e. in region U), including the nonconforming point at the end.

  4. 4.

    The modified side-sensitive (MSS) synthetic scheme gives an OOC signal when two nonconforming points, out of \(H + 1\) successive points, plot above (below) the UCL (LCL) which are separated by at most \(H - 1\) conforming points that plot between the CL and the UCL (LCL), respectively (Shongwe and Graham 2016, 2018). The control charting regions of the MSS scheme are shown in Fig. 2c. From the operation of a MSS scheme, two types of \({\textit{CRLs}}\) which are: the lower \({\textit{CRL}}\) (denoted as \({\textit{CRL}}_{\text{L}}\)) and the upper (denoted as \({\textit{CRL}}_{\text{U}}\)) are needed. A \({\textit{CRL}}_{\text{L}}\) is the number of lower conforming points (i.e. conforming points that fall within region 3 in Fig. 2c) that are plotted in between the two consecutive lower nonconforming points (i.e. nonconforming points that fall below the LCL, including the lower nonconforming point at the end). However, a \({\textit{CRL}}_{\text{U}}\) is the number of upper conforming points (i.e. conforming points that fall within region 2 in Fig. 2c) that are plotted between the two consecutive upper nonconforming points (i.e. nonconforming points that fall above the UCL, including the nonconforming point at the end). Note that the absence of a conforming point implies that either the \({\textit{CRL}}_{\text{U}}\) or \({\textit{CRL}}_{\text{L}}\) equals one.

The classical NSS and SSS \(\bar{X}\) synthetic schemes (i.e. NSS and SSS \(\bar{X}\) synthetic schemes for  normal data) were first proposed by Wu and Spedding (2000a, b) and Davis and Woodall (2002), respectively. Later on, Machado and Costa (2014) proposed a classical RSS \(\bar{X}\) synthetic scheme. More recently, several authors have pointed out the need to develop synthetic schemes (Lee and Khoo 2017; Shongwe and Graham 2017b, c, 2018). Lee and Khoo (2017) investigated the performance of the synthetic double sampling S scheme, which was found to perform better than the existing double sampling S scheme for a wide range of shifts. Shongwe and Graham (2017b, c) studied the zero-state and steady-state run-length characteristics of synthetic and runs-rules \(\bar{X}\) schemes, respectively. Later on, Shongwe and Graham (2018) proposed the MSS synthetic scheme for monitoring the location parameter. The above-mentioned schemes are called parametric (or classical) schemes since they are based on the normality assumption. It is well known that parametric schemes are not IC robust and they are relatively inefficient under the violation of the normality assumption. Therefore, there is a need of developing nonparametric schemes and adaptive schemes based on flexible probability distributions. The Burr-type XII distribution can be used for this purpose since it can represent any type of unimodal distribution (Malela-Majika et al. 2018b; Wooluru et al. 2016).

In these last few decades, an important discussion amongst SPM researchers is whether to monitor process shifts using traditional monitoring schemes (in the form of traditional control charts) or using support vector machines (Du et al. 2012, 2013; Du and Lv 2013). Du and Lv (2013) stated that “Support vector machine (SVM) has recently become a new generation learning system based on recent advances on statistical learning theory for solving a variety of learning, classification and prediction problems”. They proposed an enhanced minimal Euclidean distance scheme for monitoring process mean shifts of auto-correlated processes and made use of support vector regression (SVR) to predict the values of a variable in time series. SVR is an extension of SVM, and it is a regression method by introduction of an alternative loss function. SVMs have been shown to be effective in minimising both Type I and Type II errors for detecting shifts in auto-correlated processes (Chinnam 2002). SVMs are also very useful as classifiers to identify the source of a change in multivariate processes (Cheng and Cheng 2008). However, since the focus of this paper is not on multivariate or auto-correlated processes, SVMs are not explore further in this paper.

In this paper, NSS, SSS, RSS and MSS \(\bar{X}\) synthetic schemes for non-normal data are introduced in the SPM context. The Burr-type XII (BTXII) distribution is used in the design of the proposed synthetic schemes because of its simplicity and flexibility.

The remainder of this paper is organized as follows: Sect. 2 introduces the basic design of the synthetic BTXII \(\bar{X}\) scheme. The proposed NSS and side-sensitive synthetic BTXII \(\bar{X}\) schemes are introduced in Sect. 3. The zero-state and steady-state characteristics of the run-length distribution are derived using the Markov chain approach. The IC and OOC performances of the proposed schemes are discussed in Sect. 4. The proposed schemes are also compared to their parametric (or classical) counterparts. Section 5 presents a real-life example demonstrating the design and implementation of the proposed synthetic schemes. A summary and some concluding remarks are given in Sect. 6.

Operation and basic design of a BTXII \(\bar{X}\) synthetic scheme for non-normal data

Assume that \(\{ X_{ij} ;\;i \ge 1\}_{j = 1}^{n}\) is a sequence of independent and identically distributed (iid) samples from a normal distribution with IC process mean \(\mu_{0}\) and IC process standard deviation \(\sigma_{0}\). The cumulative distribution function (cdf) of the BTXII distribution is given by Burr (1973), Malela-Majika et al. (2018a)

$$F\left( y \right) = 1 - \frac{1}{{(1 + y^{c} )^{q} }}\;{\text{for}}\;y \ge 0$$
(2)

where \(c\) and \(q\) are greater than one and represent the skewness and kurtosis of the Burr distribution, respectively. There is a relationship between a Burr variable, Y, and any random variable X. For more details, see for example, Burr (1942, 1973) and Chen (2003). Assuming that the random variables \(X\) and \(Y\) have the same skewness and kurtosis, the sample mean can be defined by

$$\bar{X} = \mu_{0} + \left( {Y - M} \right)\frac{{\sigma_{0} }}{S\sqrt n },$$
(3)

where \(\bar{X}\) and \(s_{x}\) represent the sample mean and standard deviation of the data set, respectively, and \(M\) and \(S\) represent the mean and standard deviation of the corresponding BTXII distribution with different shapes. Tables of the expected mean, standard deviation, skewness coefficient and kurtosis coefficient of the Burr distribution for various combinations of BTXII parameters \(c\) and \(q\) are given in Burr (1942, 1973).

The basic synthetic BTXII \(\bar{X}\) scheme signals when a nonconforming sample plots above (or below) the UCL (LCL) of the BTXII \(\bar{X}\) sub-chart and \(CRL \le H\).

The basic synthetic BTXII \(\bar{X}\) scheme operates as follows:

  1. 1.

    At the ith sampling time, take a sample of size n and compute \(\bar{X}_{i}\).

  2. 2.

    If LCL\(< \;\bar{X}_{i}\; < \;{\textit{UCL}}\) then return to Step (1).

  3. 3.

    However, if \(\bar{X}_{i} \; \le \; {\textit{CL}}\) or if \(\bar{X}_{i} \; \ge \;{\textit{UCL}}\) go to Step (4).

  4. 4.

    If CRL ≤ H go to Step (5), otherwise return to Step (1).

  5. 5.

    Issue an OOC signal, and then take necessary corrective actions to find and remove the special cause(s). Then return to Step (1).

Thus, the CRL decreases as p increases, and increases as the fraction nonconforming in a process, p, decreases. Note that the CRL is a geometric random variable. Therefore, the expected value of the CRL, i.e. \(E\;({\textit{CRL}})\), and cdf of the CRL, \(F\;({\textit{CRL}})\), are given by

$$\begin{aligned} & E\;({\textit{CRL}}) = \frac{1}{p} \\ & {\text{and}} \\ & F_{\text{p}} ({\textit{CRL}}) = 1 - \left( {1 - p} \right)^{\textit{CRL}} ,\quad {\textit{CRL}} = 1, 2, 3, \ldots \\ \end{aligned}$$
(4)

respectively. To detect an upward shift in p, it is recommended to set a LCL, say H, for the CRL. If \({\textit{CRL}} \le H\), then there is sufficient evidence that p has increased. Therefore, the CRL sub-chart gives an OOC signal when \({\textit{CRL}} \le H\). At this stage, the average number of CRL required to detect an OOC fraction nonconforming p is given by

$${\textit{ARL}}_{H} = \frac{1}{{F_{p} \left( H \right)}} = \frac{1}{{1 - \left( {1 - p} \right)^{H} }}$$
(5)

where p is the probability of declaring a sample nonconforming, which is given by

$$p = 1 - \frac{1}{{\left[ {1 + \left( {M - S\left( {k - \delta \sqrt n } \right)} \right)^{c} } \right]^{q} }} + \frac{1}{{\left[ {1 + \left( {M + S\left( {k + \delta \sqrt n } \right)} \right)^{c} } \right]^{q} }} .$$
(6)

When δ = 0, the process is in-control.

Thus, the \({\textit{ARL}}\) of the basic synthetic scheme is computed as follows

$${\textit{ARL}}\left( \delta \right) = \frac{1}{{p \times \left[ {1 - \left( {1 - p} \right)^{H} } \right]}}$$
(7)

where p is given by Eq. (6).

To measure the overall performance of the basic synthetic scheme, the average extra quadratic loss (AEQL) is used. Therefore, using Eqs. (6) and (7), the AEQL of the basic synthetic chart is defined by

$${\textit{AEQL}} = \frac{1}{{\delta_{\mathrm{max} } }} \int \limits_{0}^{{\delta_{\hbox{max} } }} \delta^{2} \times \frac{1}{{p \times \left[ {1 - \left( {1 - p} \right)^{H} } \right]}}{\text{d}}\delta .$$
(8)

When comparing the overall performance of two or several monitoring schemes, the scheme with the smallest (or minimum) \({\textit{AEQL}}\) value is considered to be the best.

Operation and design consideration of the NSS and side-sensitive synthetic schemes for non-normal data

In this section, necessary notations are introduced and mathematical foundations of synthetic schemes are presented under the violation of the assumption of normality. These mathematical foundations are later on used to derive the run-length properties of the proposed synthetic schemes using a Markov chain approach.

The operation of the proposed synthetic schemes is given in Table 1.

Table 1 Operation of the different types of synthetic Shewhart-type schemes

Before we construct the transition probability matrices (TPMs) of the synthetic BTXII \(\bar{X}\) schemes, it is important to define the probability that a plotting statistic falls in a specific region. Table 2 gives the probability that a sample mean, \(\bar{X}\), falls in a specific region of two-sided NSS, SSS, RSS and MSS synthetic BTXII Shewhart \(\bar{X}\) schemes.

Table 2 Probability that a plotting statistic falls in a specific region of the synthetic schemes

TPMs for the proposed synthetic schemes

To construct the TPMs of the proposed synthetic schemes, the Markov chain approach is used to construct the compound patterns that result in an OOC event. For instance, each of the four digits 1, 2, 3 and 4 of a MSS synthetic scheme indicates the state of a test sample. The symbol ‘\(\pm\)’ indicates that at time \(t = 0\), the first charting (or plotting) statistic lies either above the UCL or below the LCL. Therefore, the sequence of charting statistics ‘\(423\)’ of a MSS synthetic scheme indicates that in a sequence of three consecutive test samples, the first is a lower nonconforming (i.e. the charting statistic of this sample falls on or below the LCL), the second is an upper conforming (i.e. the charting statistic falls between the CL and UCL) and the third is a lower conforming sample (i.e. the charting statistic falls between the LCL and CL). The sequence of charting statistics ‘\(\pm \;33\)’ indicate that the first charting statistic falls either above the UCL (region 1) or below the LCL (region 4), and the second and third fall between the LCL and CL (region 3).

The compound patterns have ω sequences (or element) having each \(H\) or \(H + 1\) states. For instance, when \(H = 2\), the absorbing state of the NSS and MSS synthetic schemes (denoted by \(\varLambda\)) are given by \(\left\{ {{\text{OO}},\;{\text{OIO}}} \right\}\) and \(\left\{ {121, \;11, \;44, \;434, \; \pm \;1, \; \pm \;4,\; \pm \;21, \; \pm \;34} \right\}\), respectively. The elements of the absorbing state are denoted by \(\varLambda_{1}\), \(\varLambda_{2}\) … and \(\left( {\varLambda_{\omega } } \right)\). To evaluate the zero-state run-length (\({\text{ZSRL}}\)) properties of the proposed synthetic schemes, we decompose the absorbing (or compound) pattern \(\varLambda\) into simple transient sub-patterns, denoted by \(\eta\), of size ς by removing the last state of each element, which means \(\eta = \{ \eta_{1} ,\;\eta_{2} , \ldots ,\eta_{\varsigma } \}\). In our example, the simple transient sub-patterns of the NSS and MSS are given by \(\left\{ {{\text{O}},{\text{OI}}} \right\}\) and \(\left\{ {12, \;1,\; 4,\; 43,\; \pm , \; \pm \;2,\; \pm \;3} \right\}\), respectively. Afterwards, create dummy states denoted \(\phi\), which are defined by \(\left\{ {\text{I}} \right\}\) and \(\left\{ {2,\;3} \right\}\) for the NSS and MSS, respectively. Finally, the state space, denoted by Ω, is the set of all the components. The state space of the NSS and MSS synthetic schemes is given by \(\{ \phi , \;\eta_{1} ,\;\eta_{2} , {\text{OOC}}\}\) and \(\{ \eta_{1} ,\;\eta_{2} ,\; \phi ,\; \eta_{3} ,\;\eta_{4} ,\;\eta_{5} , \eta_{6} ,\;\eta_{7} , \;{\text{OOC}}\}\), respectively, where \(\eta_{5} = \varphi = \{ \pm \} ,\;\eta_{6} = \varphi_{2} = \{ \pm \;2\} \;\eta_{7} = \varphi_{3} = \{ \pm \;3\}\). The state space of the SSS and RSS synthetic schemes is constructed in a similar way. Table 3 presents the decomposition of the TPMs state space of the proposed synthetic schemes.

Table 3 Decomposition of the TPMs state space of a two-sided synthetic schemes for H = 1, 2 and 3

When \(H = 1\) the TPM of the NSS synthetic scheme is given by

(9)

The TPM of the SSS, RSS and MSS synthetic schemes is given by

(10)

In Eq. (10), for the MSS scheme, the probabilities that a charting statistic falls in a specific region are defined as follows:

  • \(p_{u} = p_{1}\) = probability that a charting statistic plots on or above the \({\textit{UCL}}\),

  • \(p_{i} = p_{2} + p_{3}\) = probability that a charting statistic falls between the \({\textit{LCL}}\) and the \({\textit{UCL}}\), and

  • \(p_{l} = p_{4}\) = probability that a charting statistic plots on or below the \({\textit{LCL}}\).

Consequently, \(p_{2} = p_{3} = \frac{{p_{i} }}{2}\). Table 3 yields the TPMs in Table 4 using a look forward approach when \(H = 2\) and 3 where the probabilities are found using the equations in Table 2.

Table 4 TPMs of the proposed synthetic schemes when H = 2 and 3

The construction of the TPMs is similar for any values of H. For any H > 0, the dimension of the TPMs in Table 4 is equal to \(\varsigma + 2\) where \(\varsigma\) is the number of sub-patterns in the compound pattern. Therefore,

$$\left\{ {\begin{array}{*{20}l} {{\text{NSS}}\;{\text{scheme}}:\quad \varsigma + 2 = \left( {H + 1} \right) + 1. \quad {\text{Hence}},\;\varsigma = H} \hfill \\ {{\text{SSS}}\;{\text{scheme}}:\quad \varsigma + 2 = (H^{2} + H + 1) + H + 1. \quad {\text{Hence}},\;\varsigma = H\left( {H + 2} \right)} \hfill \\ {{\text{RSS}}\;{\text{scheme}}:\quad \varsigma + 2 = \left( {2H + 1} \right) + H + 1.\quad {\text{Hence}},\;\varsigma = 3H} \hfill \\ {{\text{MSS}}\;{\text{scheme}}:\quad \varsigma + 2 = \left( {2H + H - 1} \right) + \left( {H + 1} \right) + 1.\quad {\text{Hence}},\;\varsigma = 4H - 1} \hfill \\ \end{array} } \right.$$
(11)

For instance, when \(H = 2\), the TPMs of the NSS, SSS, RSS and MSS synthetic schemes are of size \(4 \times 4\), \(10 \times 10\), \(8 \times 8\) and \(9 \times 9\), respectively.

Table 5 gives the number of sub-patterns in the compound pattern and the dimension of the TPMs (in brackets) of the NSS, SSS, RSS and MSS synthetic schemes for H = 1, 2, 3, 4 and 5. It can be observed that when H = 1, the SSS, RSS and MSS synthetic schemes have the same number of sub-patterns in the compound pattern which means that the TPMs of the SSS, RSS and MSS synthetic schemes have the same dimension. The larger the value of H, the higher the dimension of the TPMs.

Table 5 Number of sub-patterns (ς) and dimension of the TPMs of the NSS, SSS, RSS and MSS synthetic schemes for H = 1, 2, 3, 4 and 5

Run-length characteristics of the NSS and side-sensitive synthetic schemes

Once the TPM has been formulated, we may easily calculate any of the following run-length properties (see Fu and Lou 2003). Therefore, the expected value, probability mass function, cdf and the variance of the run-length distribution are given by

$$E\left( N \right) = \xi_{1 \times \tau } \cdot \varvec{ARL}_{{\left( {\tau \times 1} \right)}} \left( \delta \right)$$
(12)
(13)
(14)
(15)

respectively, where and \(\xi_{1 \times \tau }\) is the initial probability vector that depends on whether a zero-state or a steady-state mode analysis is of interest. \({\mathbf{I}}_{{\left( {\tau \times \tau } \right)}}\) is a \(\tau \times \tau\) identity matrix and \(1_{{\left( {\tau \times 1} \right)}}\) is a \(\tau \times 1\) column vector of ones.

Note that the zero-state and steady-state modes of analysis are used to characterize the short-term and long-term run-length characteristics of a monitoring scheme. Koutras et al. (2007) analysed the run-length of the runs-rules schemes based on probability-generating functions, whereas Low et al. (2012) designed runs-rules schemes using Eq. (14). Note that the \(E\left( N \right)\) defined in Eq. (12) is typically the most used metric on the performance of a monitoring scheme in SPM, and it is denoted by \({\textit{ARL}}\) in this study.

Initial probabilities vectors

The \(\xi_{1 \times \tau } = {\mathbf{q}}_{1 \times \tau } = \left( {0\;1\;0\; \ldots \;0} \right)\) is the row vector of initial probabilities associated with the zero-state case and it has a one in the component corresponding to the state in which the monitoring scheme begins and each of the other components of the vector are equal to zero. For the SSS, RSS and MSS synthetic schemes, the initial state corresponds to the element of the TPM equal to ‘±’ (i.e. \(\varphi\)), whereas for the NSS synthetic scheme, it corresponds to the element with ‘O’.

The \(\xi_{1 \times \tau } = {\mathbf{s}}_{1 \times \tau }\) is the row vector of initial probabilities associated with the steady-state case and its elements are non-zero. There are a number of method used to compute the \({\mathbf{s}}_{1 \times \tau }\), and this study focuses on one of the steady-state probability vector (SSPV) methods proposed by Champ (1992), which is defined by

$${\mathbf{s}} = \left( {1^{\prime}{\mathbf{z}}} \right)^{ - 1} \cdot {\mathbf{z}}$$
(16)

where \({\mathbf{z}}\) is the \(\tau\) × 1 vector with \({\mathbf{z}}_{{\left( {\tau \times 1} \right)}} = \left( {{\mathbf{G}} - {\mathbf{Q^{\prime}}}} \right)^{ - 1} {\mathbf{e}}_{j}\) and the matrix \({\mathbf{G}}\) in Champ (1992) can be generalized as \({\mathbf{G}} = {\mathbf{e}}_{j} \cdot 1^{\prime} + {\mathbf{I}}_{\tau \times \tau }\) where \({\mathbf{e}}_{j}\) is the jth unit vector corresponding to \({\mathbf{e}}_{1}\) for the one-sided as well as the two-sided NSS synthetic scheme and \(j\) corresponds to the element of the TPM equal to ‘±’ for the two-sided SSS, RSS and MSS synthetic schemes. For more details, see “Appendix”.

Performance study

Performance of the two-sided NSS and side-sensitive BTXII \(\bar{X}\) synthetic schemes for different values of H

A monitoring scheme is designed such that when the process is IC, the \({\textit{ARL}}_{0}\) is set at some desirable level (or equivalently, the significance level is set at some standard value). For instance, a significance level of size 0.0027, 0.0020 and 0.0010 (or equivalently, the \({\textit{ARL}}_{0} = 370.4\), 500 and 1000), the k-sigma limits of the basic design of the two-sided BTXII \(\bar{X}\) synthetic schemes are as given in Table 6 when h = 1, 2, 3, 4 and 5. For instance, when the \(\left( {M, S, c, q} \right)\) combination is given by (0.6447, 0.162, 4.8737, 6.1576) we found \(k =\) 1.94757, 2.01131 and 2.15251 so that the basic synthetic scheme yields an attained \({\textit{ARL}}_{0}\) value of 370.4, 500 and 1000, respectively. It can be observed that the value of \(k\) increases as H value increases. Moreover, for a given H value, the value of \(k\) increases as the nominal \({\textit{ARL}}_{0}\) value increases. For a given nominal \({\textit{ARL}}_{0}\) value, the larger the value of H, the more efficient the BTXII \(\bar{X}\) synthetic scheme.

Table 6 Optimal \(k\) values and (\({\textit{AEQL}}\) in brackets) of the synthetic BTXII \(\bar{X}\) scheme when the nominal \({\textit{ARL}}_{0}\) = 370.4, 500 and 1000 for different value of H (basic design)

The design parameters found in Table 6 are used to assess the OOC performance of the proposed scheme for a nominal \({\textit{ARL}}_{0}\) of 370.4. In Tables 7, 8, 9 and 10, the results of the best scheme are in italic. When two or several columns are in italic, the schemes under consideration perform similarly. Table 7 gives the IC and OOC zero-state and steady-state performance of the proposed synthetic scheme when H = 1, 2, 3, 4 and 5 as well as the overall performance with \(\delta_{\hbox{min} } = 0\) and \(\delta_{\hbox{max} } = 2.5\). Table 6 shows that the proposed synthetic scheme is efficient for large values of H (Fig. 3a). The bigger (smaller) the magnitude of a shift, the more (less) sensitive the proposed scheme is. For large shift, the \({\textit{ARL}}\) value converges towards 1. Figure 3b shows that the performance of the proposed synthetic scheme depends on the magnitude of the shifts and many other factors such as the choice of the design parameters. The design parameters are subject to minimum\({\textit{AEQL}}\). The smaller the \({\textit{AEQL}}\), the more reliable the design parameters. Regardless of the magnitude of the shift, the higher the value of H, the more efficient the scheme becomes (see Fig. 3a, b).

Table 7 IC and OOC zero-state and steady-state performance of the synthetic BTXII \(\bar{X}\) scheme for different value of H when \({\textit{ARL}}_{0}\) = 370.4 (basic design)
Table 8 IC and OOC zero-state and steady-state performance of the synthetic BTXII \(\bar{X}\) schemes when H = 1
Table 9 IC and OOC zero-state and steady-state performance of the synthetic BTXII \(\bar{X}\) schemes when H = 2
Table 10 IC and OOC zero-state and steady-state performance of the synthetic BTXII \(\bar{X}\) schemes when H = 3
Fig. 3
figure 3

Performance of the Synthetic BTXII \(\bar{X}\) scheme for different values of H (basic design)

Tables 8, 9 and 10 present on one hand the zero-state and steady-state performance of the NSS, SSS, RSS and MSS BTXII \(\bar{X}\) synthetic schemes with \(\delta\)  = 0 (0.2) 2 for H = 1, 2 and 3, respectively, when (\(M,S,n,c,q\)) = (0.5951, 0.1801, 5, 4, 6) referred to as “design 1” and (\(M,S,n,c,q\)) = (0.6447, 0.162, 5, 4.8737, 6.1576) referred to as “design 2”. On the other hand, Tables 8, 9 and 10 give the overall performance of the proposed synthetic schemes for \(\delta_{\hbox{min} } = 0\) and \(\delta_{\hbox{max} } = 2.5\). From Table 8 it can be seen that when H = 1, the zero-state and steady-state performance of the SSS, RSS and MSS synthetic schemes are equivalent. This can also be shown by the TPMs, which are similar (see Eq. 10). For both design 1 and 2, the side-sensitive schemes perform best. In terms of the overall performance, the proposed schemes perform better under design 1. From Tables 9 and 10 it can be observed that when H = 2 and 3, for both zero-state and steady-state mode, the MSS scheme performs better from small to moderate mean shifts (\(0 < \delta < 1.5\)). However, from large shifts onwards (\(\delta \ge 1.5\)), under the zero-state mode, all four schemes are equivalent (\({\text{ZSARL}}_{\delta } = 1\)) for both designs, whereas under the steady-state mode, for all four schemes, the \({\text{ZSARL}}_{\delta }\) values are closer to 2. In terms of the \({\textit{AEQL}}\) values, in zero-state mode, the MSS scheme performs best followed by the SSS scheme for H = 2, whereas when H = 3, the MSS scheme performs best followed by the RSS scheme.

Remarks 1

  • Unlike runs-rules, synthetic schemes perform better in zero-state mode compared to steady-state mode.

  • For large shifts, in zero-state mode, the \({\text{ZSARL}}\) values converge towards 1, whereas the \({\text{SSARL}}\) values are slightly smaller than 2.

Performance comparative study

In this section, the proposed schemes, that is, the NSS and side-sensitive synthetic BTXII \(\bar{X}\) schemes, are compared to the traditional (or classical) Shewhart-type \(\bar{X}\) counterparts using similar synthetic and runs-rules schemes (cf Shongwe and Graham 2017a, 2018; Malela-Majika et al. 2018a, b). For a fair comparison, the competitive schemes are investigated under symmetric (here we use the normal) and heavy-tailed distributions with a sample of size 5, (\(\delta_{\hbox{min} }, \;\delta_{\hbox{max} }\)) = (0, 2) and H = 3. Sherill and Johnson (2009) reported that schemes based on the Box–Cox and Johnson transformations would perform better when using non-normal data. Kilinc et al. (2012) showed that the Johnson \(S_{\text{B}}\) (i.e. unbounded form) distribution presents attractive properties in building models. Therefore, the proposed BTXII \(\bar{X}\) synthetic schemes are also compared to the well-known \(\bar{X}\) schemes for non-normal data based on the Box–Cox and Johnson \(S_{\text{B}}\) transformation under both heavy-tailed and symmetric distributions when H = 3. Moreover, the proposed BTXII \(\bar{X}\) synthetic schemes are also compared to memory-type control schemes such as the cumulative sum (CUSUM) and exponentially weighted moving average (EWMA) monitoring schemes.

The comparison of the proposed synthetic schemes and the well-known classical Shewhart \(\bar{X}\), \(\bar{X}\)-CUSUM and \(\bar{X}\)-EWMA schemes as well as the BTXII \(\bar{X}\)-CUSUM and \(\bar{X}\)-EWMA schemes is displayed in Fig. 4. To challenge Knoth (2016)’s claim about the NSS synthetic scheme, the proposed NSS, SSS, RSS and MSS synthetic schemes are compared to the classical and BTXII \(\bar{X}\)-CUSUM and \(\bar{X}\)-EWMA schemes. The comparison is done under symmetric and heavy-tailed distributions. Under symmetric distributions, and more precisely under the standard normal distribution, when the smoothing parameter λ of the classical \(\bar{X}\)-EWMA scheme is equal to 0.1 and 0.5, it is found that the optimal parameter L = 2.698 and 2.977 so that the attained ZSARL0 = 369.90 and 368.90, respectively, for a nominal ZSARL0 value of 370.4. Under heavy-tailed distributions, and more specifically under the GAM (1,1) distribution, the optimal parameters 2.698 and 2.977 yield ZSARL0 values of 271.40 and 77.20 when λ = 0.1 and 0.5, respectively. These results show that the \(\bar{X}\)-EWMA chart is not IC robust (which is what we expected to find) because the attained ZSARL0 values of 271.40 and 77.20 are far different from the nominal ZSARL0 value of 370.4. For the classical \(\bar{X}\)-CUSUM scheme, we found that the UCL value is equal to 13.26 so that the attained ZSARL0 value under the N (0,1) distribution is equal to 369.5. However, under the GAM (1,1) distribution, when UCL = 13.26, the \(\bar{X}\)-CUSUM scheme yields an attained ZSARL0 value of 301.27, which shows that the classical \(\bar{X}\)-CUSUM scheme is not IC robust as well (which is what we expected to find).

Fig. 4
figure 4

Synthetic BTXII \(\bar{X}\), classical and BTXII \(\bar{X}\), \(\bar{X}\)-EWMA and \(\bar{X}\)-CUSUM schemes performance comparison: aAEQL comparison of the synthetic BTXII and classical Shewhart \(\bar{X}\) schemes under zero-state mode, bAEQL comparison of the synthetic BTXII and classical Shewhart \(\bar{X}\) schemes under steady-state mode, c ZSARL comparison of the synthetic BTXII \(\bar{X}\) with both classical and BTXII \(\bar{X}\)-EWMA and \(\bar{X}\)-CUSUM schemes under symmetric distribution and d ZSARL comparison of the synthetic BTXII \(\bar{X}\) with both classical and BTXII \(\bar{X}\)-EWMA and \(\bar{X}\)-CUSUM schemes under heavy-tailed distribution

Table 11 shows that in zero-state mode, under heavy-tailed distributions, both the proposed MSS BTXII \(\bar{X}\) synthetic scheme (introduced in this paper) and MSS BTXII runs-rules \(\bar{X}\) schemes [proposed by Malela-Majika et al. (2018b)] outperform all other competing charts from small to moderate shifts. For large shifts, the proposed BTXII \(\bar{X}\) synthetic scheme and BTXII \(\bar{X}\) improved runs-rules scheme as well as the Johnson \(S_{\text{B}}\)\(\bar{X}\) synthetic scheme perform better regardless of the type of design (i.e. NSS, SSS, RSS and MSS designs). In steady-state mode, from small to moderate shifts, the MSS BTXII \(\bar{X}\) synthetic and MSS BTXII \(\bar{X}\) runs-rules schemes outperform all competing charts. For large shifts, the SSS, RSS and MSS BTXII \(\bar{X}\) improved runs-rules schemes are superior to all other competing charts.

Table 11 IC and OOC zero-state and steady-state performance comparison of the synthetic and runs-rules control schemes under heavy-tailed distributions when H = 3

Under symmetric distributions (see Table 12), for both zero-state and steady-state modes, the classical MSS Shewhart \(\bar{X}\) runs-rules and MSS synthetic \(\bar{X}\) scheme combined with an \(\bar{X}\) chart [proposed by Shongwe and Graham (2016)] outperform all other charts from small to moderate shifts. For large shifts, in zero-state mode, these charts are equivalent to the proposed BTXII \(\bar{X}\) synthetic schemes, the classical synthetic \(\bar{X}\) schemes [proposed by Shongwe and Graham (2017a)], the Johnson \(S_{\text{B}}\) synthetic schemes as well as the Box–Cox \(\bar{X}\) synthetic schemes. However, in steady-state mode, the control charts proposed by Shongwe and Graham (2017a) outperform the competing charts.

Table 12 IC and OOC zero-state and steady-state performance comparison of the synthetic and runs-rules control schemes under symmetric distributions when H = 3

From Fig. 4a, b, we can draw the following conclusions:

  • The proposed synthetic BTXII \(\bar{X}\) schemes outperform the traditional \(\bar{X}\) schemes.

  • The synthetic schemes are more sensitive in zero-state (small values of the AEQL).

  • The proposed NSS scheme is less sensitive when compared to other schemes.

  • In general, when the value of H increases, the sensitivity of synthetic BTXII \(\bar{X}\) scheme increases as well. After investigating the sensitivity of the proposed synthetic schemes, it is observed that increasing the value of H does not always increase the sensitivity of the schemes. For instance, for the NSS scheme, from \(H =\) 2 to 3, the sensitivity of the proposed NSS synthetic BTXII \(\bar{X}\) scheme decreases. The latter is shown by the AEQL value increasing from 33.13 to 34.83. Therefore, it is important to investigate the optimal value of H that increases the sensitivity of synthetic schemes.

  • In zero-state mode, the proposed synthetic BTXII \(\bar{X}\) schemes perform best under the SSS and MSS schemes when H = 2. Under the steady-state mode, the MSS scheme performs best for H = 3.

Figure 4c, d yields the following findings:

  • Under symmetric distributions, when H = 1 and 2, the classical and BTXII \(\bar{X}\)-EWMA scheme outperforms the NSS synthetic scheme for small values of λ under small and moderate shifts (see for instance, Fig. 4c for λ = 0.1). When λ increases, the NSS synthetic scheme outperforms both the classical and BTXII \(\bar{X}\)-EWMA scheme regardless of the size of the mean shifts (Fig. 4c when λ = 0.1).

  • Under heavy-tailed distributions, when H = 1 and 2, both classical and BTXII \(\bar{X}\)-EWMA and \(\bar{X}\)-CUSUM schemes outperform the NSS synthetic scheme regardless of the values of λ for small and moderate shifts (Fig. 4d). For large shifts, the NSS \(\bar{X}\) synthetic scheme performs better than classical and BTXII \(\bar{X}\)-EWMA and \(\bar{X}\)-CUSUM schemes.

  • Under symmetric distributions, when H = 1, the SSS, RSS and MSS synthetic schemes are equivalent and perform better than the classical and BTXII \(\bar{X}\)-EWMA and \(\bar{X}\)-CUSUM schemes regardless of the size of the shifts.

  • Under heavy-tailed distributions, the SSS, RSS and MSS synthetic schemes outperform the classical \(\bar{X}\)-EWMA and \(\bar{X}\)-CUSUM schemes for two reasons, (1) they are IC robust and (2) yield small OOC ARL values. It can also be observed that the proposed SSS, RSS and MSS synthetic schemes are more sensitive than the BTXII \(\bar{X}\)-EWMA and \(\bar{X}\)-CUSUM schemes.

  • Under symmetric and heavy-tailed distributions, when H = 2, the proposed SSS, RSS and MSS \(\bar{X}\) synthetic schemes perform better than the classical \(\bar{X}\)-EWMA and \(\bar{X}\)-CUSUM schemes. In this case, the MSS scheme performs better than the SSS scheme and slightly better than the RSS scheme.

  • The BTXII \(\bar{X}\)-EWMA and \(\bar{X}\)-CUSUM schemes perform uniformly better than the classical \(\bar{X}\)-EWMA and \(\bar{X}\)-CUSUM schemes under symmetric and heavy-tailed distributions regardless of the size of the shift in the location parameter.

Illustrative example

In this section, a real-life example is given to illustrate the design and implementation of the proposed synthetic schemes using the dataset from Mahmoud and Aufy (2013) (see Table 13). The data represent the shaft diameter which is expected to be around 7.995 millimetres (mm). To assess the production process, measurements of twenty-five samples have been taken, each consist of five items from the final production stage for which a goodness of fit test for normality is rejected.

Table 13 Measurements of the shaft diameter

When H = 1, for both zero-state and steady-state modes, the control limits of the NSS and side-sensitive synthetic BTXII \(\bar{X}\) schemes are given by (\({\textit{LCL,}}\;{\textit{UCL}}\)) = (0.374, 0.6) and (0.38, 0.59), respectively. A plot of the charting statistics for H = 1 is shown in Fig. 5 (a). It can be seen that both NSS and side-sensitive schemes signal for the first time on the fourth subgroup. When H = 2, the control limits of the NSS and MSS synthetic BTXII \(\bar{X}\) schemes are given by (\({\textit{LCL,}}\;{\textit{UCL}}\)) = (0.37, 0.61) and (0.38, 0.6), respectively. A plot of the charting statistics for H = 3 is shown in Fig. 5 (b). It can be seen that the MSS scheme signals for the first time on the seventh subgroup while the NSS scheme does not issue a signal. This shows the superiority of the MSS scheme over the NSS scheme.

Fig. 5
figure 5

NSS and MSS synthetic BTXII \(\bar{X}\) schemes of the measurements of shaft diameter for both zero-state and steady-state modes

Summary and recommendations

In this paper, synthetic \(\bar{X}\) schemes for non-normal data were proposed as alternatives to the classical Shewhart-type and synthetic \(\bar{X}\) schemes when the assumption of normality fails to hold. It was observed that the proposed schemes outperform the classical ones in many cases, and present very interesting run-length characteristics under normal and non-normal distributions. It is highly recommended that practitioners, in the industries, and researchers make use of the proposed schemes instead of the classical schemes when the process is not stable or when there are doubts about the nature (or the shape) of the underlying process distribution. For the steady-state mode, when small and moderate shifts are of interest, the recommendation is to use side-sensitive synthetic schemes regardless of the size of the sample and H value. For the zero-state mode, for small and moderate shifts, the recommendation is to use side-sensitive synthetic schemes regardless of the value of H.

It must be noted that the use of synthetic schemes for large values of H is not recommended in practice because, in most of the cases, the dimension of the TPM increases exponentially as H increases. The design (or construction) of such schemes becomes cumbersome and sometimes unrealistic. Therefore, the recommendation is to use small values of H (say \(H \le 3\)) for which the schemes perform better.

The comparison of the proposed synthetic schemes with the \(\bar{X}\)-EWMA and \(\bar{X}\)-CUSUM schemes reveals that the SSS, RSS and MSS synthetic schemes outperform both classical and BTXII \(\bar{X}\)-EWMA and \(\bar{X}\)-CUSUM schemes regardless of the size of the shift in the location parameter. The NSS synthetic scheme is inferior when compared to the classical and BTXII \(\bar{X}\)-EWMA and \(\bar{X}\)-CUSUM schemes for small and moderate shifts in the location parameter. However, for large shifts, the proposed NSS synthetic scheme performs better than the classical and BTXII \(\bar{X}\)-EWMA and \(\bar{X}\)-CUSUM schemes. Therefore, we do not support Knoth (2016)’s claims of discarding synthetic schemes since the three schemes, namely the SSS, RSS and MSS synthetic schemes have very interesting ARL and AEQL properties over the classical Shewhart \(\bar{X}\), \(\bar{X}\)-EWMA and \(\bar{X}\)-CUSUM schemes.

It must also be observed that the classical Shewhart \(\bar{X}\) schemes are not IC robust and present some weakness in many situations. To fix this problem, flexible schemes such as BTXII Shewhart \(\bar{X}\) and nonparametric schemes may be used.

In future, we will consider the design non-side-sensitive and side-sensitive synthetic Shewhart-type \(\bar{X}\) schemes combined with a basic \(\bar{X}\) for non-normal data using the BTXII and Weibull distributions.