Bayesian change point estimation in Poisson-based control charts

Assareh, Hassan; Noorossana, Rassoul; L Mengersen, Kerrie

doi:10.1186/2251-712X-9-32

Bayesian change point estimation in Poisson-based control charts

Original research
Open access
Published: 20 November 2013

Volume 9, article number 32, (2013)
Cite this article

Download PDF

You have full access to this open access article

Journal of Industrial Engineering International

Bayesian change point estimation in Poisson-based control charts

Download PDF

Hassan Assareh¹,
Rassoul Noorossana² &
Kerrie L Mengersen³

3286 Accesses
Explore all metrics

Abstract

Precise identification of the time when a process has changed enables process engineers to search for a potential special cause more effectively. In this paper, we develop change point estimation methods for a Poisson process in a Bayesian framework. We apply Bayesian hierarchical models to formulate the change point where there exists a step change, a linear trend and a known multiple number of changes in the Poisson rate. The Markov chain Monte Carlo is used to obtain posterior distributions of the change point parameters and corresponding probabilistic intervals and inferences. The performance of the Bayesian estimator is investigated through simulations and the result shows that precise estimates can be obtained when they are used in conjunction with the well-known c-, Poisson exponentially weighted moving average (EWMA) and Poisson cumulative sum (CUSUM) control charts for different change type scenarios. We also apply the Deviance Information Criterion as a model selection criterion in the Bayesian context, to find the best change point model for a given dataset where there is no prior knowledge about the change type in the process. In comparison with built-in estimators of EWMA and CUSUM charts and ML based estimators, the Bayesian estimator performs reasonably well and remains a strong alternative. These superiorities are enhanced when probability quantification, flexibility and generalizability of the Bayesian change point detection model are also considered.

Robust Estimation of Multiple Change Points in Multivariate Processes

Bayesian Analysis of Change Point Problems Using Conditionally Specified Priors

Article Open access 08 August 2023

Identifying the time of a step change in bivariate binomial processes

Article 12 October 2014

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Background

Statistical process control charts are used to detect changes in a process by distinguishing between assignable causes and common causes of the process variation. When a control chart signals, process engineers initiate a search to identify and eliminate the source of variation. Knowing the time at which the process began to vary, the so-called change point, would help to conduct the search more efficiently in a tighter time-frame.

A Poisson process is often used to model the number of occurrences in an interval of time. In this regard, Poisson-based control charts have been developedand frequently applied in an industry context to monitor the number of defects and nonconformities in a product (Gardiner and Montgomery 1987; White et al. 1997) and in a health context to monitor patient mortality and spread of an infection in a hospital (Benneyan 1998; Limayea et al. 2008). The most commonly used control chart procedures adopted for Poisson-distributed data include c-charts (Shewhart 1926 1927), cumulative sum of quality characteristic measurement (CUSUM; Page 1954 1961; Brook and Evans 1972), and exponentially weighted moving average (EWMA; Roberts 1959; Trevanich and Bourke 1993; Borror and Rigdon 1998); see Woodall (1997) and Montgomery (2008) for more details. Furthermore, appropriate control charts and methods were developed in monitoring more complex Poisson data including correlated (Chiu and Kuo 2007; Niaki and Abbasi 2008; Niaki and Nafar 2008; Amiri et al. 2011) and auto-correlated observations (Weiß, 2007; Vermaat et al. 2008).

It has been shown that Poisson CUSUM and Poisson EWMA charts are more sensitive for detecting small shifts in the process parameters whereas a c-chart still remains efficient for the detection of large shifts (Montgomery 2008). However, upon signaling, none of them provide specific information regarding the time at which the process changed and the magnitude and the type of the change.

In recent years, statistical and machine-learning methods have been employed in the development of change point estimators for a broad range of processes and change types (Amiri and Allahyari 2012; Atashgar 2013). For Poisson processes, maximum likelihood (ML) methods were applied to estimate the true time of a step change (Samuel and Pignatiello 1998; Perry 2004) and a linear trend (Perry et al. 2006) in the Poisson rate. It was shown that more precise estimates were obtained when ML estimators were used in conjunction with Poisson control charts, compared to charts’ signals and CUSUM (Page 1954) or EWMA built-in estimators (Nishina 1992). A confidence interval on the estimated change point was also constructed (Perry 2004; Perry et al. 2006). Furthermore, Perry et al. (2007a) challenged the underlying assumption of knowing the form of change types and derived a ML estimator for non-decreasing multiple step change points (unknown number of consecutive changes) using isotonic regression models. The estimator was reported a reasonable alternative for some magnitudes of the step and linear trend disturbances. In the presence of multiple change points, it was found to be the superior. ML estimators have also been extended for step change scenarios in correlated Poisson observations (Niaki and Khedmati 2012; 2013a; Sharafi et al. 2013). Similar methods were extended to other attributes including binary data (Perry et al. 2007b; Noorossana et al. 2009; Amiri et al. 2011; Hou et al. 2013; Niaki and Khedmati 2013b).

A Bayesian formulation has recently been proposed as an alternative in change point estimation within a clinical context (Assareh et al. 2011a). It can easily capture complexity of patient mix and provide highly informative and precise estimates for the true time of a step change (Assareh et al. 2011c) or linear trend (Assareh et al. 2011b) in the odds ratio of clinical outcomes or mean survival time following a clinical procedure (Assareh and Mengersen 2012). Application of the Bayesian framework to change point estimation provides a way of making a set of inferences based on posterior distributions for the time and the magnitude of a change as well as assessing the validity of underlying assumptions in the change point model itself (Gelman et al. 2004).

In this paper, we model the change point in a Poisson process using a Bayesian framework and compare the performance of the Bayesian estimator with ML estimators. We model and estimate change points assuming that the underlying change type is known. In this scenario, the changes are in the form of a step change, a linear trend and a multiple change with known number of changes. For each model, we analyze and discuss the performance of the Bayesian change point model through posterior estimates and probability-based intervals. The three models are demonstrated and evaluated in sections ‘Bayesian change point model’, ‘Evaluation’, and ‘Performance analysis’ and then compared with respect to goodness of fit in section ‘Comparative performance and model selection’. We then compare the Bayesian estimator with ML estimators and others in section ‘Comparison of Bayesian estimator with other methods’ and summarize the study and obtained results in section ‘Conclusion’.

Bayesian change point model

Statistical inferences for a quantity of interest in a Bayesian framework are described as the modification of the uncertainty about their value in the light of evidence, and Bayes’ theorem precisely specifies how this modification should be made as below:

Posterior \propto Likelihood \times Prior,

(1)

where ‘Prior’ is the state of knowledge about the quantity of interest in terms of a probability distribution before the data are observed; ‘Likelihood’ is a model underlying the data, and ‘Posterior’ is the state of knowledge about the quantity after data are observed which also is in the form of a probability distribution. This structure is expendable to multiple levels in a hierarchical fashion, so-called Bayesian hierarchical models (BHM), which allows to enrich the model by capturing all kinds of uncertainties for data observed as well as priors. In complicated BHMs, it is not easy to obtain the posterior distribution analytically. This analytic bottleneck has been eliminated by the emergence of Markov chain Monte Carlo (MCMC) methods. In MCMC algorithms, a Markov chain, also known as a random walk, is constructed whose stationary distribution is the posterior distribution of the parameters. Samples generated from a long run of the Markov chain using a proposal transition density are drawn from posterior distributions of interest. Some common MCMC methods for drawing samples include Metropolis-Hastings and the Gibbs sampler (see Gelman et al. (2004) for more details).

Consider a Poisson process X_t, t=1,…,T, that is initially in-control, with independent observations coming from a Poisson distribution with a known rate λ₀. At an unknown point in time, τ, the Poisson rate parameter changes from its in-control state of λ₀ to λ₁, λ₁=λ₀+δ,δ≠0. The Poisson process step change model can thus be parameterized as follows:

\begin{matrix} p (x_{t} ∣ λ_{t}) = \{\begin{matrix} exp (- λ_{0}) λ_{0}^{x_{t}} / x_{t}! & if t = 1, 2, \dots, τ \\ exp (- λ_{1}) λ_{1}^{x_{t}} / x_{t}! & if t = τ + 1, \dots, T, \end{matrix}, \end{matrix}

(2)

where δ is the magnitude of the step change and τ and T are the change time and the current time, respectively.

The departure from the in-control state may occur due to a non-constant change type scenario which can be explained by a linear trend model λ_t=λ₀+β(t−τ) for t>τ. In this model, β is the magnitude of the linear trend disturbance (slope) in the process parameter and its positive value implies an increasing trend in which λ_t>λ₀, while a negative β leads to a linear reduction of the Poisson rate and λ_t<λ₀ for t=τ+1,…,T. The Poisson process linear trend change model can be modelled as follows:

\begin{matrix} p (x_{t} ∣ λ_{t}) = \{\begin{matrix} exp (- λ_{0}) λ_{0}^{x_{t}} / x_{t}! & if t = 1, 2, \dots, τ \\ exp (- (λ_{0} + β (t - τ))) \\ {(λ_{0} + β (t - τ))}^{x_{t}} / x_{t}! & if t = τ + 1, \dots, T. \end{matrix} \end{matrix}

(3)

In order to address the possibility of having change types other than step and linear trend forms (Perry et al. 2007a), we introduce a multiple change point scenario where the number of change points is known. This prior knowledge might have been obtained based on awareness and past experience of process engineers in factors such as changes in operators, materials, procedures, tools, and policies which may lead to increasing or decreasing step changes in the Poisson rate. Here, we consider the case of two sequential step changes. Other cases with more than two change points can be modelled in the same way. In this scenario, at an unknown point in time, τ₁, the Poisson rate parameter changes from its in-control state of λ₀ to λ₁, λ₁=λ₀+δ₁,δ₁≠0. For a period of time, the process continues with the new parameter, λ₁, and then at an unknown point in time, τ₂, it changes to λ₂, λ₂=λ₀+δ₂,δ₂≠δ₁≠0. The Poisson process multiple change point model with two step changes can thus be parameterized as follows:

p (x_{t} ∣ λ_{t}) = \{\begin{matrix} exp (- λ_{0}) λ_{0}^{x_{t}} / x_{t}! & if t = 1, 2, \dots, τ_{1} \\ exp (- λ_{1}) λ_{1}^{x_{t}} / x_{t}! & if t = τ_{1} + 1, \dots, τ_{2} \\ exp (- λ_{2}) λ_{2}^{x_{t}} / x_{t}! & if t = τ_{2} + 1, \dots, T. \end{matrix}

(4)

Regarding above models to Equation 1, p(.∣.), is the likelihood that underlies the observations; and posterior distributions of the time (τ, τ₁, τ₂) and the magnitude of change (δ, β, δ₁, δ₂) will be constructed and investigated as they are the unknown parameters of interest in the change point analysis. Assume that the process X_t is monitored by a control chart that signals at time T. We assign a zero-mean normal distribution with a standard deviation of $6 \times \sqrt{λ_{0}}$ as a prior distribution for all change sizes (δ, β, δ₁, δ₂). This is a reasonably informative prior for the magnitude of the change in an in-control Poisson rate as the control chart is sensitive enough to detect very large shifts and estimate associated change points. Other distributions such as uniform or Gamma might also be of interest; see Gelman et al. (2004) for more details on selection of prior distributions. We place a uniform distribution on the range of (1, T−1) as a prior for the time of the change (τ, τ₁, τ₂). To avoid obtaining a negative value for process mean after a change, λ₁₍₂₎, within MCMC, particularly when a drop has occurred, we added a constraint such that λ₁₍₂₎ must be positive. Although other methods such as modelling the process on the log scale may be of interest, we do not pursue these here as we may lose simplicity and explicit or correct reflection of the Poisson process. See the Appendix for the change model codes in WinBUGS (Spielgelhalter et al. 2003).

Evaluation

We used Monte Carlo simulation to study the performance of the constructed BHMs in change estimation following a signal from c-, Poisson CUSUM, and Poisson EWMA control charts when a change (step, linear, multiple) is simulated to occur at τ=100. We generated 100 observations of a Poisson process with an in-control rate of λ₀=20. To investigate the behavior of the Bayesian estimators over the population for different change sizes, we replicated this simulation method 100 times. Simulated datasets that were obvious outliers were excluded. This setting allows us to have distribution of estimates with standard errors in orders of 10. The number of replication studies is a compromise between excessive computational time, considering MCMC iterations and sufficiency of the achievable distributions even for tails.

In the step of change scenario, we induced step changes of sizes δ={+2,+6} as an example and δ={±2,±6,±15} for a replication study until the control charts signalled. In the linear trend model, changes of slopes β={±0.5,±1.0,±2.0} were induced until the control charts signalled. For the multiple change point case, two consecutive changes are simulated to occur at (τ₁,τ₂)=(100,110). We induced two changes of sizes (δ₁,δ₂)={(±4,±8),(±4,±12)} as part of a replication study at the determined times of change (τ₁,τ₂) until the control charts signalled. In this scenario, the replication study was limited to c-chart, since other control charts mostly signalled prior to the induction of the second change point.

Because we know that the process is in-control, if an out-of-control observation was generated in the simulation of the early 100 in-control observations, it was taken as a false alarm and the simulation was restarted. However, in practice, a false alarm may lead to stopping the process and analyzing root causes. When no cause is found, the process would follow without adjustment. Furthermore, for the multiple change scenario, if in any simulation, the charts signalled earlier than simulating the second change, that simulation was terminated and not followed. The simulation was also repeated for rate parameters of 5 and 10 over equivalent change scenarios; since the results were similar to these obtained for λ₀=20, they are not reported here.

To construct control charts, we applied the Shewhart (1926 1927), Brook and Evans (1972), and Trevanich and Bourke (1993) procedures for c-, Poisson CUSUM, and Poisson EWMA control charts, respectively. A Poisson CUSUM accumulates the difference between an observed value and a reference value k through $S_{i}^{+} = max {0, x_{i} - k^{+} + S_{i - 1}^{+}}$ and $S_{i}^{-} = max {0, k^{-} - x_{i} + S_{i - 1}^{-}}$ , where $k^{+} = (λ_{1}^{+} - λ_{0}) / (ln (λ_{1}^{+}) - ln (λ_{0}))$ and $k^{-} = (λ_{0} - λ_{1}^{-}) / (ln (λ_{0}) - ln (λ_{1}^{-}))$ . If $S_{i}^{\pm}$ exceeds a specified decision interval h^± then the control chart signals that an increase (a decrease) in the Poisson rate occurred. We calibrated the charts to detect a 25% shift in Poisson rates and have an in-control average run length ( ${A \hat{R} L}_{0}$ ) of 370 approximately, close to a standard c-chart (see Woodall and Adams (1993)). The resultant Poisson CUSUM charts had (k⁺,h⁺)=(22.4,22) and (k⁻,h⁻)=(17.4,14). For simplicity, the values were rounded to one decimal place.

In a Poisson EWMA cumulative values of observations are obtained through Z_i=r×x_i+(r−1)×Z_i−1, where Z₀=λ₀, and plotted in a chart with $UCL = λ_{0} + A^{+} \sqrt{Var Z_{i}}$ and $LCL = λ_{0} - A^{-} \sqrt{Var Z_{i}}$ . We let r=0.1 and A^±=2.67 to build a chart with an ARL₀ of 370, close to a standard c-chart.

All changes and control charts were simulated in the R package. To obtain posterior distributions of the time and the magnitude of the changes, we used the R2WinBUGS interface (Sturtz et al. 2005) to generate 100,000 samples through MCMC iterations in WinBUGS (Spielgelhalter et al. 2003) for all change point scenarios with the first 20,000 samples ignored as burn in. We then analyzed the results using the CODA package in R (Plummer et al. 2010). See the Appendix for the change point model codes in WinBUGS.

Performance analysis

Step change model

The posterior distributions for the time and the magnitude of a step change of size +6 are presented in Figure 1. For all control charts, posterior distributions of the change point concentrate on the 100th sample which is the real change point. Since the posteriors are asymmetric and skewed, particularly for the time of the change, the posterior mode is used as an estimator for change point model parameters (τ,δ).

Table 1 shows the posterior estimates for increases of sizes +2 and +6 in the process mean. The c-chart detects a fall of around half a standard deviation (δ=+2) in the Poisson rate after 101 samples where the mode of the posterior distribution reports the 101st sample as the change point. For a medium-shift size, δ=+6, around one-and-a-half standard deviation, the posterior mode concentrates on the 100th sample whereas the c-chart signals with 38 samples delay. The Poisson EWMA chart detects the shifts +2 and +6, after 42 and 13 samples, where the posterior distributions report the 103rd and 100th samples as the change points, respectively. This result implies that although the obtained posterior modes overestimate the change point for small shifts, they still perform relatively better than the Poisson EWMA chart. The resultant posteriors from a Poisson CUSUM are almost identical to those from Poisson EWMA. Bayesian estimates of the magnitude of the change tend to estimate small shifts almost precisely. However, the medium shift sizes are underestimated, although this slight bias must be considered in the context of their corresponding standard deviations.

Table 1 Posterior estimates (mode, SD) of step change point model parameters τ and δ following signals (RL)

Full size table

Applying the Bayesian framework enables us to construct probability-based intervals around estimated parameters. A credible interval (CI) is a posterior probability-based interval which involves those values of highest probability in the posterior density of the parameter of interest. Table 2 presents 50% and 80% credible intervals for the estimated time and the magnitude of step changes in all three control charts. As expected, the CIs are affected by the dispersion and higher order behaviors of the posterior distributions. Under the same probability of 0.8 for the c-chart, the CI for the time of the step change of size δ=+2 covers 53 samples around the 100th sample whereas it decreases to 6 samples for δ=+6 due to the smaller standard deviation (see Table 1).

Table 2 Credible intervals for step change point model parameters τ and δ

Full size table

The comparison of the 50% and 80% CIs for the estimated time of a step change of size δ=+6 in the Poisson EWMA chart reveals that the posterior distribution of the time is highly left skewed and the increase in the probability contracts the left boundary of the interval, from 96.9 to 88 in comparison with the shift in the right boundary. This investigation can be extended to other shift sizes and control chart scenarios for the time estimates. As shown in Table 1 and discussed above, the magnitude of the changes are not estimated as precisely as the time. However, Table 2 shows that in most cases for δ=+2, the real size of change are contained in the respective posterior 50% and 80% CIs.

Having a distribution for the time of the change enables us to make other probabilistic inferences. As an example, Table 3 shows the probability of the occurrence of the change point in the last 10, 25, and 50 observed samples prior to signalling in the control charts. For a step change of size δ=+2, since the c-chart signals very late (see Table 1), it is unlikely that the change point occurred in the last 10, 25, and even 50 samples. In contrast, in the Poisson EWMA and CUSUM charts, where they both signal earlier than the c-chart, the probabilities of occurrence in the last 10 samples are 0.55 and 0.59, then increase to 0.76 and 0.82, respectively as the next 15 samples are included. In the case of δ=+6, most of the probability density (0.98) is located between the last 25 and 50 samples for the c-chart, whereas with 0.80, it is between the last 10 and 25 samples for the Poisson EWMA chart and with probability 0.91, it is in the last 10 samples for the Poisson CUSUM chart. These kinds of probability computations and inferences can be extended to other change scenarios.

Table 3 Probability of occurrence of change point in last 10, 25, and 50 observed samples prior to signalling

Full size table

Table 4 shows the average of the estimated parameters obtained from the 100 replicated datasets. As seen, although the c-chart detects small to medium shifts with a large delay, it performs better where there exists a jump. Having a longer delay in the detection of a decrease in the Poisson rate in comparison with an increase of the same size in the c-chart is due to the equality of mean and the variance of the Poisson distribution. Therefore, a fall in the mean leads to less dispersed observations. The Poisson EWMA and CUSUM charts behave in the same manner.

Table 4 Average of posterior estimates (mode, SD) of step change point model parameters τ and δ following signals (RL)

Full size table

For a step change of size around half a standard deviation (δ=±2) in the Poisson rate, the average of the modes, $E (\hat{τ})$ , reports the 103^rd sample as the change point in all three control charts, whereas the charts detect the changes with delays greater than 17 samples, obtained in the Poisson CUSUM. This superiority persists where a medium shift of size δ=±6 has occurred in the process mean. As expected, for large-shift sizes (δ=±15), around three standard deviations, all control charts perform well, yet the mean of modes outperform them by a delay of less than one observation.

Table 4 reveals that in all three control charts, the variation of Bayesian estimates for time tends to reduce when the magnitude of shift in the process mean increases. However, by the nature of the Poisson distribution, for small to medium drops, δ=(−2,−6), the observed variation is less than those obtained in the estimation of jumps. The mean of the standard deviation of the posterior estimates of time, $E (σ_{\hat{τ}})$ , also decreases by moving for small-shift sizes to medium and large sizes in the Poisson EWMA and CUSUM charts. In contrast, the greatest variation is obtained for a large shift of size δ=±15 in the c-chart. This is due to the early detection of such shifts by the c-chart that leads to a very short run of samples after the change which then compresses the data and hence informs the MCMC algorithm.

The average of the Bayesian estimates of the magnitude of the change, $E (\hat{δ})$ , shows that the modes of posteriors for change sizes do not perform as well as the posterior distributions of the time across different shift sizes; however, promising results are obtained where a small shift, δ=±2, has occurred in the process mean. This estimator tends to underestimate the sizes, particularly where there exists a jump. This bias increases when the shift size increases since a very short run of samples coming from the out-of-control state of the process with a high variance was used. As seen in Table 4, the best estimates are obtained in Poisson EWMA cases. Having said that, Bayesian estimates of the magnitude of the change must be studied in conjunction with their corresponding standard deviations. In this manner, analysis of credible intervals would be effective.

Linear trend model

Posterior distributions and probabilistic intervals and inferences are obtainable for the time and the slope of the linear trend, similar to the step change scenario illustrated in section ‘Step change model’. We, here, limited our results to the replication study over the simulated datasets outlined in section ‘Evaluation’.

Table 5 shows the average of the estimated parameters over trends with slope sizes of β={±0.50,±1.0,±2.0}. For a linear trend with small slopes of size β=±0.5 in the Poisson rate, the average modal value, $E (\hat{τ})$ , reports the 105th sample and less as the change point in all three control charts, whereas the charts detect the changes with delays greater than 10 samples, obtained in the Poisson CUSUM. This superiority also persists where a trend with larger slopes of size β=±1.0,±2.0 has occurred in the process mean. In these scenarios, the bias of the Bayesian estimator does not exceed two and one samples, where the minimum delays are seven and four samples, respectively.

Table 5 Average of posterior estimates (mode, SD) of linear trend change point model parameters τ and β following signals (RL)

Full size table

Table 5 shows that in all three control charts, the variation of the Bayesian estimates for time tends to reduce when the magnitude of slope increases. The mean of posterior standard deviation for time, $E (σ_{\hat{τ}})$ , also decreases by moving for small slope sizes to medium and large sizes in both directions. However, the observed variation for estimation of a decreasing trend is less than those obtained for an increasing trend with the same slope size.

The average of the posterior estimates for the magnitude of the change, $E (\hat{β})$ , shows that the modes of the posteriors for change sizes perform as well as the posterior estimates of the time, particularly, for the c-chart and Poisson EWMA chart. In the CUSUM chart, the posteriors tend to underestimate the slope sizes.

Multiple change model

We applied the multiple change point model following signals of the c-chart as the Poisson EWMA and CUSUM mostly signalled before simulating the second change in the process. Although results here are limited to the replication study, distribution of parameters and probabilistic inferences can easily be constructed.

As seen in Table 6 and discussed in section ‘Step change model’, the c-chart signals earlier when a larger shift, either an increase or decrease, has occurred in the second change; however, it performs better where there exists a jump, regardless of the direction of the first change. The chart alarmed after 38 samples when two consecutive drops of sizes around one and two standard deviations, δ_1,2=(−4,−8), occurred. Although this delay falls to 16 samples when the second change has happened in the opposite direction, the modes of posteriors for the time of the first change, $E (\hat{τ_{1}})$ , outperform the chart. This superiority persists when the size of the second change increases to around three standard deviations, δ₂=(±12). The same results are also obtained, where the first change is an increase in the magnitude of one standard deviation, δ₁=(+4).

Table 6 Average of posterior estimates (mode, SD) of multiple step change point model parameters τ _1,2 and δ _1,2 following signals (RL)

Full size table

Table 6 reveals that the Bayesian estimator tends to underestimate the time of the first change of two monotonic changes where the second change is of size δ₂=(±12). The associated variation, within replications, increases when the second step change increases in the same direction of the first change. The minimum variations of the posterior distributions for the time of the first change, $E ({\hat{σ}}_{\hat{τ_{1}}})$ , are obtained where there exist non-monotonic changes (see δ_1,2=(−4,+8) and δ_1,2=(+4,−8)). This variation also increases when the second step change increases in the same direction of the first change.

The time of the second step change is estimated precisely by the posterior modes. Table 6 shows that the average, $E (\hat{τ_{1}})$ , mostly concentrate on the 110th sample. Surprisingly, the variation between replications and also the variation of posterior distributions obtained for the time of the second change, $E ({\hat{σ}}_{\hat{τ_{1}}})$ , are less than those obtained for the first step change.

The average of the posterior estimates of the magnitude of the changes, $E (\hat{δ_{1}})$ and $E (\hat{δ_{2}})$ , shows that the modes of the posteriors for change sizes do not perform as well as the posterior distributions of the time across different scenarios. The modes tend to underestimate the sizes, particularly, for jumps in either the first or the second step change. However, there still exists some gain in studying the estimated sizes and directions, particularly when the obtained standard deviations are also considered.

Comparative performance and model selection

We used Monte Carlo simulation to study the performance of the developed change point models in different change point scenarios following a signal from a c-chart. We generated 100 observations of a Poisson process with an in-control rate of λ₀=20. We then induced a step, a linear trend, and a multiple change in the Poisson rate. For each scenario the three change point models were applied and the time of the change was estimated. Based on the MCMC simulation, the Deviance Information Criterion (DIC) and related parameters, mean and variance of the posterior distribution of the deviance and the penalty value, were recorded. The DIC is a goodness of fit criterion which takes into account the deviance of the model, −2log(p(y∣θ)), and a penalty for the model complexity, p_D (Spielgelhalter et al. 2002). To allow for asymmetry in the posterior distribution, seen in Figure 1, p_V was used as an alternative to p_D, where p_V is half of the variance of the posterior distribution of the deviance (Gelman et al. 2004).

Table 7 indicates that the Bayesian estimate of a step change outperforms other Bayesian estimates, linear and multiple, where there is a step change in the process parameter. It estimates 101.9 and 108.3 as the times of change of sizes δ=−4 and δ=+4, respectively, whereas the linear model underestimates the time with a bias of around 55 and 24 samples and the multiple model tends to overestimate it relative to the step model. According to the reported DICs, the DIC_V supports that the step model with values of 1,167 and 845.5 is a preferable fit where there exists either an increasing or a decreasing step change.

Table 7 Performance and goodness of change point models on different change types following signal

Full size table

In the case of an occurrence of a linear trend shift in the Poisson rate, the Bayesian estimate of a linear trend change outperforms other Bayesian estimates in estimating the change point. The reported DIC_V is convincing that the linear model with values of 603.7 and 630.9 is also the best fit. These results can be extended to the multiple change scenario. Table 7 shows that the Bayesian estimate of a multiple change (two changes) outperforms other Bayesian estimates, step and linear, where there are two consecutive changes in the Poisson rate. Similarly, the reported DIC_V supports that the multiple model with values of 636.4 and 791.2 is also the best fit in this case.

Comparison of Bayesian estimator with other methods

To study the performance of the proposed Bayesian estimators in comparison with those introduced in section ‘Background’, we run the alternatives, built-in estimators of Poisson EWMA and CUSUM charts and ML estimators, within replications as discussed in section ‘Evaluation’.

Table 8 shows the mean of Bayesian estimates and detected change points provided by built-in estimators of EWMA (Nishina 1992) and CUSUM (Page 1954) charts and the ML estimator (Perry 2004) for a step change in a Poisson process.

Table 8 Average of estimated time of a step change in a Poisson process obtained by Bayesian estimators, CUSUM and EWMA built-in estimators, and ML estimator following signals (RL)

Full size table

Although the Bayesian estimator, ${\hat{τ}}_{b}$ , tends to overestimate the time of a step change of small sizes, δ=±2, with a delay of three samples, it outperforms the ML estimator, ${\hat{τ}}_{MLE}$ , which underestimates the time by six samples following a signal from the c-chart. For step sizes of one and half and three standard deviations, the ML estimator performs slightly better than the Bayesian estimator; however considering the obtained standard deviations decreases this superiority, particularly where there exists a jump in the process mean.

Table 8 reveals that the EWMA estimator, ${\hat{τ}}_{EWMA}$ , underestimates the change point when the size of shift increases for both directions where the Bayesian estimator tends to be more precise. ${\hat{τ}}_{b}$ still remains the best estimator for small changes and shows acceptable performance in comparison with ${\hat{τ}}_{MLE}$ over larger shifts, particularly when the standard deviations are taken into account.

The CUSUM estimator, ${\hat{τ}}_{CUSUM}$ , outperforms the equivalent estimators in EWMA for larger shifts, δ=(±6,±15); however, it overestimates the time of small shifts significantly. Similar to c-chart and EWMA cases, in CUSUM, the Bayesian estimator outperforms alternatives for small shifts and offers acceptable performance over other shift sizes, considering the obtained standard deviations over replications.

Table 9 shows the mean of the Bayesian estimates and detected change points provided by built-in estimators of EWMA (Nishina 1992) and CUSUM (Page 1954) charts and the ML estimator (Perry et al. 2006) for a linear trend change in a Poisson process. Application of the proposed ML estimator is restricted to trends with a positive slope as Newton’s method is not tractable for decreasing trends in Poisson mean; see Perry et al. (2006) for more details.

Table 9 Average of estimated time of linear trend in Poisson process obtained by Bayesian estimator, CUSUM and EWMA built-in estimators, and ML estimator following signals (RL)

Full size table

The Bayesian estimator, ${\hat{τ}}_{b}$ , almost outperforms the built-in estimator of EWMA, ${\hat{τ}}_{EWMA}$ , where there exists a decreasing trend. This superiority increases when the slope size raises, β=−2. The CUSUM estimator, ${\hat{τ}}_{CUSUM}$ , estimates the change point more precisely than the EWMA; however the Bayesian estimator, ${\hat{τ}}_{b}$ , still remains the best alternative for detection of linear trends with negative slopes, when the variation of the estimates is taken into account.

Table 9 reveals that the Bayesian estimator, ${\hat{τ}}_{b}$ , is slightly outperformed by the ML estimator, ${\hat{τ}}_{MLE}$ , across the charts when there exists an increasing linear trend in the process mean. Having said that, the Bayesian estimator can still be a reasonable alternative in light of the obtained standard deviations which are less than those observed from the ML estimator over replications.

The ML estimator proposed by Perry et al. (2007a) is suitable for monotonic consecutive changes. In contrast, the Bayesian estimator for a known number of change points proposed in section ‘Multiple change model’ can also be applied where there exists non-monotonic consecutive changes in the process mean. Therefore, the comparison study was not followed for the multiple change point case as there is no appropriate ML alternative against which to evaluate the Bayesian estimator. Similarly, the built-in EWMA and CUSUM estimators cannot be studied as they tend to signal before the second change point. In the case of signalling after the second change, they also failed as they tend to concentrate on the time of the latter step change as the change point in non-monotonic scenarios.

Apart from the accuracy and precision criteria used for the comparison study, the posterior distributions for the time and the magnitude of a change enable us to construct probabilistic intervals around estimates and probabilistic inferences about the location of change point as discussed in section ‘Step change model’ for the step change scenario. This is a significant advantage of the proposed Bayesian approach. Although similar results may be obtained when resampling in conjunction with ML methods, the inferential basics of this approach is more limited; see Bernardo and Smith (1994) for more details. Furthermore, flexibility of Bayesian hierarchical models, ease of extension to more complicated change scenarios such as combination of steps and linear and nonlinear trends, relief of analytic calculation of likelihood function, particularly for non-tractable likelihood functions and ease of coding with available packages should be considered as additional benefits of the proposed Bayesian change point model for monitoring purposes. This approach can be easily applied for other types of data and processes such as Bernoulli, normal and exponential family data and the mentioned advantages remain achievable.

The two-step approach to change point identification described in this paper has the advantage of building on control charts that may be already in place in practice. An alternative may be to retain the two-step approach but to use a Bayesian framework in both stages. There is now a substantial literature on Bayesian formulation of control charts and extensions such as monitoring processes with varying parameters (Feltz and Shiau 2001), over-dispersed data (Bayarri and Garcia-Donato 2005), start-up and short runs (Tsiamyrtzis and Hawkins 2005 2008). A further alternative is to consider a fully Bayesian, one-step approach, in which both the monitoring of the in-control process and the retrospective or prospective identification of changes is undertaken in the one analysis. This is the subject of further research.

Conclusion

Identification of the time when a process has changed enables process engineers to pursue investigation of special causes more effectively. Indeed, knowing the change point restricts the search efforts to a tighter window of observations and related variables. In this paper, we modeled the change point estimation for a Poisson process in a Bayesian framework. We considered three scenarios of changes, a step change, a linear trend, and a multiple change when the number of changes is known. We constructed Bayesian hierarchical models and derived posterior distributions for change point estimates using MCMC. We compared the performance of the Bayesian estimators with c-, Poisson EWMA, and CUSUM control charts. The results showed that the Bayesian estimates outperform standard control charts in change estimation, particularly where there exists a small to medium size of step change(s) and a linear trend change with small to relatively large magnitude of slope. In comparison with built-in estimators of EWMA and CUSUM and ML-based estimators, the Bayesian estimator performs reasonably well and remains a strong alternative, particularly when other criteria such as probability quantification through credible intervals and probabilistic inferences, flexibility, generalization, and simplicity are taken into accounts.

The investigation on the performance of the Bayesian estimates over different change scenarios reveals that each Bayesian change point model outperforms other models where its underlying change type has occurred in the Poisson process. The results also support the idea of using DIC as a primary step in change point estimation which can direct process engineers to identify the appropriate change point model before making inferences about the derived underlying changes in the process.

Appendix

The following are the different change model codes:

1.
Step change model

model {

for (i in 1:RL_c){

xc[i] ∼ dpois(lambda2[i])

lambda1[i]=lambda0+delta × step(i-change)

lambda2[i]=max(lambda1[i],0.000001) }

tau=1/(6 × sqrt(lambda0))

RL=RL_c−1

delta ∼ dnorm(0, tau)

change ∼ dunif(1,RL) }
2.
Linear trend change model

model {

for (i in 1:RL_c){

xc[i] ∼ dpois(lambda2[i])

lambda1[i]=lambda0+beta × (i-change) × step(i-change)

lambda2[i]=max(lambda1[i],0.000001) }

tau=1/(6 × sqrt(lambda0))

RL=RL_c−1

beta ∼ dnorm(0, tau)

change ∼ dunif(1,RL) }
3.
Multiple change model

model {

for (i in 1:RL_c){

xc[i] ∼ dpois(lambda2[i])

lambda1[i]=lambda0+delta1 × step(i-change1) × step(change2-i)+delta2 × step(i-change2)

lambda2[i]=max(lambda1[i],0.000001) }

tau=1/(6 × sqrt(lambda0))

RL=RL_c−1

delta1 ∼ dnorm(0, tau)

delta2 ∼ dnorm(0, tau)

change1 ∼ dunif(1,change2)

change2 ∼ dunif(change1,RL) }

Authors’ information

HA is a research fellow at the University of New South Wales. His primary research interests include Bayesian methods and engineering statistics and its applications in statistical quality control. RN is Professor of Industrial Engineering at Iran University of Science and Technology. His primary research interests include statistical process control, process optimization, Six Sigma, and statistical analysis. KM is Professor of Statistics at Queensland University of Technology. Her research interests are Bayesian statistics, mixture models, hierarchical modelling and meta-analysis.

References

Amiri A, Allahyari S: Change point estimation methods for control chart postsignal diagnostics: a literature review. Qual Reliability Eng Int 2012,28(7):673–685. http://dx.doi.org/10.1002/qre.1266 10.1002/qre.1266
Article Google Scholar
Amiri A, Koosha M, Azhdari A: Profile monitoring for Poisson responses. IEEE International Conference on Industrial Engineering and Engineering Management (IEEM), Singapore, Singapore, 6–9 Dec 2011. 2011.
Google Scholar
Assareh H, Mengersen K: Change point estimation in monitoring survival time. PLoS ONE 2012,7(3):e33630. http://dx.doi.org/10.1371/journal.pone.0033630 10.1371/journal.pone.0033630
Article Google Scholar
Assareh H, Smith I, Mengersen K: Bayesian change point detection in monitoring cardiac surgery outcomes. Qual Manag Health Care 2011a,20(3):207–222. 10.1097/QMH.0b013e318220897e
Article Google Scholar
Smith I, Mengersen K, Assareh, H: Bayesian estimation of the time of a linear trend in risk-adjusted control charts. IAENG Int J Comput Sci 2011b,38(4):409–417.
Google Scholar
Assareh H, Smith I, Mengersen K: Change point detection in risk adjusted control charts. Stat Methods Med Res 2011c. http://dx.doi.org/10.1177/0962280211426356
Google Scholar
Atashgar K: Identification of the change point: an overview. Int J Adv Manuf Technol 2013,64(9–12):1663–1683. http://dx.doi.org/10.1007/s00170–012–4131–2 10.1007/s00170-012-4131-2
Article Google Scholar
Bayarri MJ, Garcia-Donato G: A Bayesian sequential look at u-control charts. Technometrics 2005,47(2):141–151.
Article MathSciNet Google Scholar
Benneyan JC: Statistical quality control methods in infection control and hospital epidemiology, Part I: introduction and basic theory. Infect Control Hosp Epidemiol 1998,19(3):194–241. 10.1086/647795
Article Google Scholar
Bernardo JM, Smith AFM: Bayesian theory. New York: Wiley; 1994.
Book MATH Google Scholar
Borror C C and Champ, Rigdon S: Poisson EWMA control charts. J Qual Technol 1998,30(4):352–361.
Google Scholar
Brook D, Evans D: An approach to the probability distribution of CUSUM run length. Biometrika 1972,59(3):539–549. 10.1093/biomet/59.3.539
Article MATH MathSciNet Google Scholar
Chiu JE, Kuo TI: Attribute control chart for multivariate Poisson distribution. Commun Stat Theory Methods 2007, 37: 146–158. 10.1080/03610920701648771
Article MathSciNet Google Scholar
Feltz CJ, Shiau JJH: Statistical process monitoring using an empirical Bayes multivariate process control chart. Qual Reliability Eng Int 2001,17(2):119–124. 10.1002/qre.393
Article Google Scholar
Gardiner J S Montgomery D C: Using statistical control charts for software quality control. Qual Reliability Eng Int 1987, 3: 15–20. 10.1002/qre.4680030104
Article Google Scholar
Gelman A, Carlin J, Stern H, Rubin D: Bayesian Data Analysis. London: Chapman & Hall/CRC; 2004.
MATH Google Scholar
Hou CD, Shao YE, Huang S: A combined MLE and generalized P chart approach to estimate the change point of a multinomial process. Appl Math Inf Sci 2013,7(4):1487–1493. 10.12785/amis/070430
Article Google Scholar
Limayea SS, Mastrangeloa CM, Zerrb DM: A case study in monitoring hospital-associated infections with count control charts. Qual Eng 2008,20(4):404–413. 10.1080/08982110802334120
Article Google Scholar
Montgomery DC: Introduction to Statistical Quality Control. New York: Wiley; 2008.
Google Scholar
Niaki S, Khedmati M: Detecting and estimating the time of a step-change in multivariate Poisson processes. Scientia Iranica 2012,19(3):862–871. 10.1016/j.scient.2011.11.044
Article Google Scholar
Niaki S, Nafar M: An artificial neural network approach to monitor and diagnose multi-attribute quality control processes. J Ind Eng Int 2008,4(7):10–24.
Google Scholar
Niaki SA, Abbasi B: A transformation technique in designing multi-attribute C control charts. Scientia Iranica 2008, 15: 125–130.
MATH Google Scholar
Niaki STA, Khedmati M: Estimating the change point of the parameter vector of multivariate Poisson processes monitored by a multi-attribute T2 control chart. Int J Adv Manuf Technol 2013a,64(9–12):1625–1642. 10.1007/s00170-012-4128-x
Article Google Scholar
Khedmati M, Niaki, S T A: Identifying the change time of multivariate binomial processes for step changes and drifts. J Ind Eng Int 2013b, 9: 1–11. 10.1186/2251-712X-9-1
Article Google Scholar
Nishina K: A comparison of control charts from the viewpoint of change-point estimation. Qual Reliability Eng Int 1992,8(6):537–541. 10.1002/qre.4680080605
Article Google Scholar
Noorossana R, Saghaei A, Paynabar K, Abdi S: Identifying the period of a step change in high-yield processes. Qual Reliability Eng Int 2009,25(7):875–883. 10.1002/qre.1007
Article Google Scholar
Page ES: Continuous inspection schemes. Biometrika 1954,41(1/2):100–115. 10.2307/2333009
Article MATH MathSciNet Google Scholar
Page, E S: Cumulative sum charts. Technometrics 1961, 3: 1–9. 10.1080/00401706.1961.10489922
Article MathSciNet Google Scholar
Perry MB: Robust change detection and change point estimation for Poisson count processes. USA: PhD thesis, Florida State University; 2004.
Google Scholar
Perry MB, Pignatiello JJ, Simpson J: Estimating the change point of a Poisson rate parameter with a linear trend disturbance. Qual Reliability Eng Int 2006,22(4):371–384. 10.1002/qre.715
Article Google Scholar
Perry MB, Pignatiello JJ, Simpson J: Change point estimation for monotonically changing Poisson rates in SPC. Int J Prod Res 2007a, 45: 1791–1813. 10.1080/00207540600622449
Article Google Scholar
Perry MB, Pignatiello JJ, Simpson JR: Estimating the change point of the process fraction non-conforming with a monotonic change disturbance in SPC. Qual Reliability Eng Int 2007b,23(3):327–339. 10.1002/qre.792
Article Google Scholar
Plummer M, Best N, Cowles K, Vines K: The Coda package: Output Analysis and Diagnostics for MCMC, R Package Version 0.13–2. Available: CRAN. R-project. org/package= coda (April 2010). Citeseer 2010.
Google Scholar
Roberts S: Control chart tests based on geometric moving averages. Technometrics 1959, 1: 239–250. 10.1080/00401706.1959.10489860
Article Google Scholar
Samuel T, Pignatiello J: Identifying the time of a change in a poisson rate parameter. Qual Eng 1998,10(4):673–681. 10.1080/08982119808919185
Article Google Scholar
Sharafi A, Aminnayeri M, Amiri A: An MLE approach for estimating the time of step changes in Poisson regression profiles. Scientia Iranica 2013,20(3):855–860.
Google Scholar
Shewhart W: Quality control charts. Bell Syst Tech J 1926, 5: 593–602. 10.1002/j.1538-7305.1926.tb00125.x
Article Google Scholar
Shewhart, W: Quality control. Bell Syst Tech J 1927, 6: 722–735. 10.1002/j.1538-7305.1927.tb00215.x
Article Google Scholar
Spielgelhalter D, Best NCB, Van Der Linde A: Bayesian measures of model complexity and fit. J R Stat Soc Series B (Methodological) 2002,64(4):583–639. 10.1111/1467-9868.00353
Article Google Scholar
Spielgelhalter D, Thomas A, Best N: WinBUGS version 1.4. Bayesian inference using Gibbs sampling. Cambridge, United Kingdom: MRC Biostatistics Unit. Institute for Public Health; 2003.
Google Scholar
Sturtz S, Ligges U, Gelman A: R2WinBUGS: a package for running WinBUGS from. R. J Stat Softw 2005,12(3):1–16.
Google Scholar
Trevanich A, Bourke P: EWMA control charts using attributes data. Statistician 1993,42(3):215. 10.2307/2348797
Article Google Scholar
Tsiamyrtzis P, Hawkins DM: A Bayesian scheme to detect changes in the mean of a short-run process. Technometrics 2005,47(4):446–456. 10.1198/004017005000000346
Article MathSciNet Google Scholar
Tsiamyrtzis P, Hawkins DM: A Bayesian EWMA method to detect jumps at the start-up phase of a process. Qual Reliability Eng Int 2008,24(6):721–735. 10.1002/qre.952
Article Google Scholar
Vermaat MB, Does RJMM, Bisgaard S: EWMA control chart limits for first- and second-order autoregressive processes. Qual Reliability Eng Int 2008,24(5):573–584. http://dx.doi.org/10.1002/qre.922 10.1002/qre.922
Article Google Scholar
Weiß CH: Controlling correlated processes of Poisson counts. Qual Reliability Eng Int 2007,23(6):741–754. http://dx.doi.org/10.1002/qre.875 10.1002/qre.875
Article Google Scholar
White CH, Keats JB, Stanley J: Poisson CUSUM versus c-chart for defect data. Qual Eng 1997,9(4):673–679. 10.1080/08982119708919089
Article Google Scholar
Woodall DH: Control charts based on attribute data: bibliography and review. Journal Qual Technol 1997,29(2):172–183.
Google Scholar
Woodall WH, Adams BM: The statistical design of CUSUM charts. Qual Eng 1993,5(4):559–570. 10.1080/08982119308918998
Article Google Scholar

Download references

Acknowledgements

The authors gratefully acknowledge the financial support from Queensland University of Technology and St Andrews Medical Institute through an ARC Linkage Project.

Author information

Authors and Affiliations

Simpson Centre for Health Services Research, Australian Institute of Health Innovation, Faculty of Medicine, University of New South Wales, Sydney, NSW, 2052, Australia
Hassan Assareh
School of Industrial Engineering, Iran University of Science and Technology, Tehran, Iran
Rassoul Noorossana
Discipline of Mathematical Sciences, Science and Engineering Faculty, Queensland University of Technology, Brisbane, QLD, 4001, Australia
Kerrie L Mengersen

Authors

Hassan Assareh
View author publications
You can also search for this author in PubMed Google Scholar
Rassoul Noorossana
View author publications
You can also search for this author in PubMed Google Scholar
Kerrie L Mengersen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hassan Assareh.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

HA contributed to the conception, design, and implementation of statistical analysis and writing and modification of the manuscript. RN and KM contributed to the conception, design, and modification of the manuscript. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 9

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Assareh, H., Noorossana, R. & L Mengersen, K. Bayesian change point estimation in Poisson-based control charts. J Ind Eng Int 9, 32 (2013). https://doi.org/10.1186/2251-712X-9-32

Download citation

Received: 05 June 2013
Accepted: 17 October 2013
Published: 20 November 2013
DOI: https://doi.org/10.1186/2251-712X-9-32

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Bayesian change point estimation in Poisson-based control charts

Abstract

Similar content being viewed by others

Background

Bayesian change point model

Evaluation

Performance analysis

Step change model

Linear trend model

Multiple change model

Comparative performance and model selection

Comparison of Bayesian estimator with other methods

Conclusion

Appendix

Authors’ information

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation