Introduction

Process capability indices have been used in industries to measure the performance of the processes. A process is called capable if the product meets customer expectations. There are copious studies on process capability estimations. Kotz and Johnson (2002) reviewed the studies on process capability indices during the years 1992–2000. Yum and Kim (2010) provided a bibliography of approximately 530 journal papers and books published during the years 2000–2009. The process capability indices C p, C pk and C pm are the most commonly used indices. The process capability index C p was proposed by Juran (1974). This process capability index cannot evaluate the capability of the process properly when the process mean and target value are not equal. Kane (1986) introduced C pk index to solve this problem. Hsiang and Taguchi (1985) proposed C pm index. The C pm process capability index considers the deviation of the process mean from its target value as well as the process variance. This index shows the ability of the process to manufacture products around the target value. Pearn et al. (1992) proposed the process capability index C pkm that have a same link between C p and C pk with C pm. Boyles (1994) proposed S pk index based on the process yield of the process. Sharma and Rao (2013) employed define, measure, analysis, improvement and control (DMAIC) approach to reduce the variation of the engine connecting rod machining process and as a result to improve the capability of the process. Also, Sharma and Rao (2014) used DMAIC approach to improve the capability of the engine crankshaft manufacturing process.

The process capability indices are proposed under the assumption that each stage is independent from the other stages. While in the multistage processes, the process in each stage is affected by the processes in the previous stages. Therefore, the capability of the process in each stage is dependent on the capability of the processes in the previous stages and the process capability indices calculate the overall capability of the process in this stage. Most of the researches on multistage processes are related to control chart and multistage process capability analyses are not studied much. Zhang (1980) was the first who designed a control charts for multistage processes. The cause-selecting control charts are developed by Zhang (1984) for multistage processes. Zhang (1985a, b, 1989, 1990, 1992) developed the cause-selecting control chart. Zhang (1990) proposed two kinds of process capability indices in each stage of multistage processes including total and specific process capability indices. Total process capability index calculates the process capability in each stage when the quality characteristic in the current stage is affected by the quality characteristics of precedent processes. Specific process capability index calculates the capability of the process in the current stage when the effects of precedent processes are omitted. The total and specific process capability indices are calculated by Eq. (1).

$$\begin{aligned} C_{\text{pt}} = \frac{{{\text{USL}} - {\text{LSL}}}}{{6\sigma_{t} }}, \\ C_{\text{ps}} = \frac{{{\text{USL}} - {\text{LSL}}}}{{6\sigma_{s} }}, \end{aligned}$$
(1)

where USL and LSL are the upper and lower specification limits, respectively, \( \sigma_{t} \) is the total standard deviation of the process in each stage and \( \sigma_{s} \) is the specific standard deviation of process that is obtained from cause-selecting control chart. Wade and Woodall (1993) proposed using prediction limits to improve the statistical performance of cause-selecting control charts. Yang (1997) proposed a new approach to compute the cost model for a two-stage process. Then, he designed economic X chart and cause-selecting control chart to monitor a two-stage process. Yang and Yang (2006) proposed an approach to monitor two-stage process when data are autocorrelated. Yang and Yeh (2011) proposed a cause-selecting control chart for the two-stage process with attribute data. Ghahyazi et al. (2014) examined the effects of cascade property in multistage process in monitoring simple linear profile in Phase II. Davoodi et al. (2015) monitored multistage Poisson count processes using exponentially weighted moving average (EWMA) and C control charts and proposed a changed point estimator to find the real time of a change in the process parameters. Pirhooshyaran and Niaki (2015) proposed a double-max multivariate exponentially weighted moving average (MEWMA) control chart to monitor the parameters of the multistage processes when data are multivariate and autocorrelated. Goodarzi et al. (2016) developed two cause-selecting control charts to monitor the censored lognormal reliability data in a three-stage process. Linn et al. (2002) proposed a method to determine the priority of the process variation reduction in multistage processes to improve the overall capability index. Chen et al. (2012) proposed a process capability index CPMPCI for the complex product machining process (multistage process) based on Taguchi loss function.

In the multistage processes, the process capability indices are calculated when the process is in control. The conditions of the processes are determined using case-selecting control charts. As mentioned former, in a two-stage process, the quality characteristic in the first stage affects the quality characteristic in the second stage referred to as the cascade property. Therefore, using the traditional process capability index to measure the capability of the second stage may provide misleading results due to ignoring the cascade effect. In other words, if one is going to measure the capability of the second stage specifically, she cannot use the traditional process capability indices on the quality characteristic in the second stage and she must use a statistic independent from the quality characteristic in the first stage. This is the main point which is not considered in any studies so far.

In this paper, we propose a method to eliminate the effects of the first stage on the quality characteristic of the second stage and calculate the specific capability of the second stage in a two-stage process. In our method, the process capability indices are calculated for the residuals in the second stage and these indices represents the specific capability of that stage instead of the overall capability. The residuals in the second stage are independent from the quality characteristic in the first stage; therefore, the process capability indices for the residuals calculate the specific capability of the second stage. In addition, a method is proposed to estimate the specification limits of the residuals based on the specification limits of the quality characteristics in the first and second stages. The performance of the proposed method is evaluated for two-stage processes with different mean and standard deviation values of the quality characteristic in the first and second stages. The structure of the paper is as follows: The proposed method is outlined in the following section. A simulation study to evaluate the performance of the proposed method is presented in the next section which is followed by a section on a real case study and finally conclusion is presented in the last section.

Proposed method

In the multistage processes, the process capability indices in each stage are affected by the capability of the processes in the previous stages. Therefore, in some cases the incapability of the process is the effect of incapability of its preceding processes. Hence, we attempt to estimate the process capability of each process independent of the capability of its previous processes. In our method, the specific process capability indices for each stage are calculated by measuring the process capability indices for the residuals. In the two-stage process, assumed that the parts in stage 1 feed into stage 2 and the quality characteristic in stage 1 is x that follows a normal distribution x ~ N(μ x , σ 2 x ). The quality characteristic of the second stage is defined as:

$$ y_{i} = \beta_{0} + \beta_{1} x_{i} + \varepsilon_{i} , $$
(2)

where \( \varepsilon_{i} \) follows normal distribution \( \varepsilon_{i} \) ~ N(0,σ 2 ε ) and the parameters \( \beta_{0} \) and \( \beta_{1} \) are estimated based on the analysis on historical data using Eqs. (3) and (4)

$$ \hat{\beta }_{0} = \bar{y} - \hat{\beta }_{1} \bar{x}, $$
(3)
$$ \hat{\beta }_{1} = \frac{{\mathop \sum \nolimits_{i = 1}^{n} y_{i} \left( {x_{i} - \bar{x}} \right)}}{{\mathop \sum \nolimits_{i = 1}^{n} \left( {x_{i} - \bar{x}} \right)^{2} }}, $$
(4)

where \( \bar{x} \), \( \bar{y} \) are the mean values of the quality characteristics in the first and second stages, respectively.

The mean and variance of the quality characteristic in the second stage can be expressed as:

$$ \mu_{y} = \hat{\beta }_{0} + \hat{\beta }_{1} \mu_{x} . $$
(5)
$$ \sigma_{y}^{2} = \hat{\beta }_{1}^{2} \sigma_{x}^{2} + \sigma_{\varepsilon }^{2} . $$
(6)

The Eqs. (5) and (6) show that the mean and variance of the quality characteristic in the second stage are affected by the mean and variance of the quality characteristics in the first stage. Therefore, the process capability indices for the quality characteristic in the second stage are affected by the quality characteristic in the first stage and calculate the overall capability of that stage. In the multistage processes, the residuals have been used to omit the effects of the previous stages. The residuals are not affected by the previous processes. Therefore, the process capability indices for the residuals calculate the specific process capability of the process in the second stage. The residuals are obtained by:

$$ e_{i} = y_{i} - \hat{y}_{i} , $$
(7)

where \( \hat{y}_{i} \) is the prediction value for the quality characteristic \( y_{i} \) and can be obtained as follows:

$$ \hat{y}_{i} = \hat{\beta }_{0} + \hat{\beta }_{1} x_{i} . $$
(8)

The specification limits are the allowable range of the quality characteristic that meet the customers’ expectations. The lower and upper specification limits are LSL and USL, respectively, and determined by design engineers or customers. The specification limits for the quality characteristics of the stages 1 and 2 are [LSL x USL x ] and [LSL y USL y ], respectively. To estimate the process capability of the second stage based on the residuals, we require the specification limits for the residuals. We used the specification limits of the quality characteristics in the first and second stages to obtain the specification limits for residuals. In our method, the specification limits for the residuals are determined using the process yield. The process yield for the quality characteristic x can be obtained by:

$$ P_{x} = {\text{pr}}\left( {{\text{LSL}}_{x} \le x \le {\text{USL}}_{x} } \right) = \varPhi \left( {\frac{{{\text{USL}}_{x} - \mu_{x} }}{{\sigma_{x} }}} \right) - \varPhi \left( {\frac{{{\text{LSL}}_{x} - \mu_{x} }}{{\sigma_{x} }}} \right), $$
(9)

where \( \varPhi ( \cdot ) \) is the cumulative density function of the standard normal distribution.

Similarly, the process yields of the quality characteristic y can be computed. The process yield of the quality characteristic y determines the process yield of the second stage when it is affected by the quality characteristic in the first stage. Since the residual and the quality characteristic x are independent, the process yield of the quality characteristic y can be calculated by Eq. (10) in the case that the specification limits of the residuals are known.

$$ P_{y} = P_{x} P_{e} - \left[ {1 - {\text{pr}}({\text{LSL}}_{y} \le \beta_{0} + \beta_{1} x + \varepsilon \le {\text{USL}}_{y} |{\text{LSL}}_{x} \le x \le {\text{USL}}_{x} , {\text{LSL}}_{e} \le \varepsilon \le {\text{USL}}_{e} )} \right], $$
(10)

where P x and P e are the process yields of the quality characteristic x and the residuals, respectively. The process yield of the residuals is calculated by Eq. (9).

In this paper, the specification limits of the residuals are determined under the following assumptions:

  1. 1.

    The mean values of the quality characteristics in the first and second stages are equal to target values.

  2. 2.

    The term \( {\text{pr}}({\text{LSL}}_{y} \le \beta_{0} + \beta_{1} x + \varepsilon \le {\text{USL}}_{y} |{\text{LSL}}_{x} \le x \le {\text{USL}}_{x} , {\text{LSL}}_{e} \le \varepsilon \le {\text{USL}}_{e} ) \) is equal to 1 when the process yields of the quality characteristic in the first stage and the residual are equal to 0.9973. Therefore, P y can be obtained by:

$$ P_{y} = P_{x} P_{e} . $$
(11)

The second assumption represents that the quality characteristic in the second stage should be located between its specification limits if the residuals and the quality characteristic in the first stage lie between their corresponding specification limits. The proposed method for determining the specification limits of the residuals consists of two steps. The first step is calculating the variance of the residuals under the assumption that the mean of the residuals is equal to the target value. The second step is determining the specification limits of the residuals.

The process capability indices C p, C pk and S pk for the residuals can be obtained using Eqs. (12), (13) and (14), respectively

$$ C_{\text{pe}} = \frac{{{\text{USL}}_{e} - {\text{LSL}}_{e} }}{{6\sigma_{e} }}, $$
(12)
$$ C_{\text{pke}} = { \hbox{min} }\left\{ {\frac{{{\text{USL}}_{e} - \mu_{e} }}{{3\sigma_{e} }}, \frac{{\mu_{e} - {\text{LSL}}_{e} }}{{3\sigma_{e} }}} \right\}, $$
(13)
$$ S_{\text{pke}} = \frac{1}{3}\varPhi^{ - 1} \left\{ {\frac{1}{2}\varPhi \left( {\frac{{{\text{USL}}_{e} - \mu_{e} }}{{\sigma_{e} }}} \right) + \frac{1}{2}\varPhi \left( {\frac{{\mu_{e} - {\text{LSL}}_{e} }}{{\sigma_{e} }}} \right)} \right\}, $$
(14)

where, USL e and LSLe are the specification limits of the residuals, \( \mu_{e} \) and \( \sigma_{e} \) are the mean and standard deviation of the residuals, respectively. The variance of the residuals is calculated by

$$ \sigma_{e}^{2} = \frac{{\mathop \sum \nolimits_{i = 1}^{n} \left( {Y_{i} - \hat{Y}_{i} } \right)}}{n - 2} = \frac{{\mathop \sum \nolimits_{i = 1}^{n} e_{i}^{2} }}{n - 2}. $$
(15)

Similarly, the C p, C pk and S pk indices can be calculated for the quality characteristics x and y.

Variance of the residuals

The standard deviation values of the quality characteristics x and y are determined under the assumption that the corresponding mean values are equal to target values and the process yields of the quality characteristic in the first stage and the residuals are 0.9973. Therefore, the process yield of the quality characteristic in the second stage is calculated by Eq. (11). The standard deviation values of the quality characteristics are obtained by Eqs. (16) and (17), respectively

$$ \sigma_{x} = \frac{{{\text{USL}}_{x} - T_{x} }}{{\varPhi^{ - 1} (0.99865)}} = \frac{{{\text{LSL}}_{x} - T_{x} }}{{\varPhi^{ - 1} (0.00135)}}. $$
(16)
$$ \sigma_{y} = \frac{{{\text{USL}}_{y} - T_{y} }}{{\varPhi^{ - 1} (0.9973)}} = \frac{{{\text{LSL}}_{y} - T_{y} }}{{\varPhi^{ - 1} (0.0027)}}. $$
(17)

The variance of the residuals based on the standard deviations of the quality characteristics x and y that are calculated by Eqs. (16) and (17), respectively, and obtained as follows:

$$ \sigma_{e}^{2} = \sqrt {\sigma_{y}^{2} - \hat{\beta }_{1}^{2} \sigma_{x}^{2} } = \sqrt {\left( {\frac{{{\text{USL}}_{y} - {\text{LSL}}_{y} }}{{\varPhi^{ - 1} (0.9973)}}} \right)^{2} - \hat{\beta }_{1}^{2} \left( {\frac{{{\text{USL}}_{x} - {\text{LSL}}_{x} }}{{\varPhi^{ - 1} (0.99865)}}} \right)^{2} } . $$
(18)

The specification limits for the residuals

Under the assumptions that the mean of the residuals is equal to the target value and the process yield of the residuals is 0.9973, the specification limits of the residuals are obtained as follows:

$$ {\text{USL}}_{e} = \sigma_{e} \varPhi^{ - 1} (0.99865). $$
(19)
$$ {\text{USL}}_{e} = \sigma_{e} \varPhi^{ - 1} (0.00135). $$
(20)

The target value for the residuals is zero.

Simulation studies

In this section, simulation studies are carried out to check the accuracy of the proposed method to determine the specification limits of the residuals and the process capability indices calculated based on them. We considered the two-stage process, the quality characteristic for the stage 1 is x and generated from a normal distribution as x ~ N(μ x , σ x ) and the quality characteristic for the second stage is y that is generated from the following equation:

$$ y_{i} = 7.86 + 0.5x_{i} + \varepsilon_{i} , $$

where \( \varepsilon_{i} \) follows a normal distribution as N(μ e , σ e ).

The specification limits for the quality characteristics x and y are set to [6.56, 19.73] and [9.46, 19.32], respectively. We assume that the process yields for the residuals and the quality characteristic x are 0.9973. Then, the process yield of the quality characteristic y is calculated from Eq. (11). The standard deviation values of the quality characteristics x and y are calculated as

$$ \sigma_{x} = \frac{19.73 - 13.145}{{\varPhi^{ - 1} (0.99865)}} = \frac{19.73 - 13.145}{2.99998} = 2.195 $$
$$ \sigma_{y} = \frac{19.32 - 14.39}{{\varPhi^{ - 1} (0.9973)}} = \frac{19.32 - 14.39}{2.78215} = 1.772 $$

The specification limits of the residuals are obtained by Eqs. (19) and (20). The lower and upper specification limits of the residuals are obtained as −4.1736 and 4.1736, respectively.

We calculate the C p, C pk, S pk and the process yields for the residuals and the quality characteristics x and y under different sample sizes, mean and variance values of the residuals and the quality characteristic x. For each simulated case, the true values of the process capability indices are calculated. The true values of the process capability indices are calculated based on the true values of the mean and variance of the quality characteristics in stage 1, stage 2 and the residuals as shown in Tables 1, 2, and 3. The process capability indices for each case are calculated for different sample sizes n = 25, 50, 100, 200. The mean values and variances of the quality characteristics in the first and second stages and the residuals are estimated from the random samples and then the process capability indices are calculated. These simulations were repeated 10,000 times and the process capability indices and process yields were calculated in each replicate. The mean and the true values of the C p, C pk and S pk indices for different cases under different sample sizes are shown in Tables 1, 2 and 3, respectively.

Table 1 C p index for the residuals and the quality characteristics x and y under different sample sizes and simulated case
Table 2 C pk index for the residuals and the quality characteristics x and y under different sample sizes and simulated case
Table 3 S pk index for the residuals and the quality characteristics x and y under different sample sizes and simulated case

Tables 1, 2 and 3 show that when the sample size increases the mean values of process capability indices approach to the corresponding true values. Tables 1, 2 and 3 show the result of using the proposed method for different simulation cases. In the first two-stage process, the processes in the first and second stages are capable. In this condition, the process capability indices for residuals is greater than 1, therefore, the process in the second stage is capable. In the second process, the variance of the quality characteristic in the first stage is large and the process in the first stage is not capable and it affects the capability of the process in the second stage and makes it incapable. While the process capability indices for the residuals are greater than 1. It shows that the incapability of the process in the second stage is the effects of the incapability of the process in the first stage. In the third case, the process in the first stage is capable but the process in the second stage is not capable. The process capability indices for the residuals show that the process in the second stage is not capable. In the fourth process, the deviation of the mean value of the process in the first stage from the target value is large, therefore, the C pk index for the x and y quality characteristics show that the process in the first stage and the process in the second stage are not capable. Moreover, C pk index for the residuals are greater than 1. The other simulation cases can be interpreted similarly. Table 4 shows the accuracy of the second assumption.

Table 4 Process yield of the residuals and the quality characteristics x and y under different sample sizes and simulated case

Table 4 shows that the process yield of the quality characteristic y, that is obtained using Eq. (11), accurately estimates the true value of the process yield of the quality characteristic y properly when the process in the first stage and the residuals are capable. When the sample size increases, the gap between the estimated value of the process yield of the quality characteristic y decreases for all cases.

A real case

In this section, the proposed method is used to calculate the specific capability of the process in the second stage of the automobile brake system measurement process. This case was discussed by Constable et al. (1988) and Linn et al. (2002). In this case, the quality characteristics for the first and second stages are ROLLWT and BAKEWT, respectively. The sample size is 70 and the first 45 observations were used to estimate the parameters \( \hat{\beta }_{0} \) and \( \hat{\beta }_{1} \) using Eqs. (2) and (3), respectively. Based on the parameters \( \hat{\beta }_{0} \) and \( \hat{\beta }_{1} \), the relationship between the quality characteristics in the first and second stage is defined as:

$$ \hat{y} = 104.325 + 0.460456x, $$

where, x = RollWT and y = BAKEWT.

The specification limits for the quality characteristics in the first and second stages are [202.38, 218.10] and [194.91, 207.35], respectively. The mean and standard deviation values for the residuals and the quality characteristics in the first and second stages based on the last 25 observations are shown in Table 5.

Table 5 The mean and standard deviation values for the residuals and the quality characteristics in the first and second stages

We assumed that the process yields of the residuals and the quality characteristic x are 0.9973. Then, the process yield of the quality characteristic y is calculated by Eq. (11). The lower and upper specification limits of the residuals were obtained using Eqs. (19) and (20), respectively. The specification limits of the residuals are [−5.67, 5.67]. The process capability indices C p, C pk and S pk for the residuals and the quality characteristics in the first and second stages are shown in Table 6.

Table 6 The process capability indices C p, C pk and S pk for the residuals and the quality characteristics in the first and second stages

Table 6 shows that the processes in the first stage and whole process are capable. The process capability indices for the residuals show the specific capability of the process in the second stage. It shows that the process in the first stage is less capable than the process in the second stage. This is an import result, as the manufacturer knows which process should be invested and improved.

Conclusion and future researches

In this paper, a method was proposed to calculate the specific capability indices for the process in the second stage of a two-stage process. The process in the second stage is affected by the process in the first stage; therefore, sometimes the incapability in the second stage is a result of the incapability of the first stage. We omitted the effect of the quality characteristic of the first stage on the quality characteristic of process in the second stage using the residuals. The process capability for the residuals shows the specific process capability of the process in the second stage. In addition, we proposed a method to determine the specification limits of the residuals based on the specification limits of the quality characteristics in the first and second stages. The simulation studies and the real case study showed that the process capability indices for the residuals calculate the specific process capability of the process in the second stage properly. This method can be extended to be used for the multistage processes with more than two stages. Future studies may also include calculating specific process capability indices for the multistage process when each stage has several correlated quality characteristics. The linear relation between quality characteristics is considered in this paper. Therefore, calculating specific process capability indices when the relations among quality characteristics are non-linear can be considered as future research.