A distance-based control chart for monitoring multivariate processes using support vector machines
- 358 Downloads
Traditional control charts assume a baseline parametric model, against which new observations are compared in order to identify significant departures from the baseline model. To monitor a process without a baseline model, real-time contrasts (RTC) control charts were recently proposed to monitor classification errors when seperarting new observations from limited phase I data using a binary classifier. In contrast to the RTC framework, the distance between an in-control dataset and a dataset of new observations can also be used to measure the shift of the process. In this paper, we propose a distance-based multivariate process control chart using support vector machines (SVM), referred to as D-SVM chart. The SVM classifier provides a continuous score or distance from the boundary for each observation and allows smaller sample sizes than the previously random forest based RTC charts. An extensive experimental study shows that the RTC charts with the SVM scores are more efficient than those with the random forest for detecting changes in high-dimensional processes and/or non-normal processes. A real-life example from a mobile phone assembly process is also considered.
KeywordsAverage run length Classification High-dimensional processes Statistical process control Support vector machine
The authors would like to thank the two anonymous referees for their helpful comments. Shuguang He’s research was supported by the National Natural Science Foundation of China #71472132 and #71532008; Wei Jiang’s research was supported by Program of Shanghai Subject Chief Scientist #15XD1502000 and the National Natural Science Foundation of China #71531010, #71172131, and #71325003.
- Cook, D., & Chiu, C. (1998). Using radial basis function neural networks to recognize shifts in correlated manufacturing process parameters. IIE Transactions, 30(3), 227–234.Google Scholar
- Grandvalet, Y., Mariethoz, J., & Bengio, S. (2005). A probabilistic interpretation of SVMs with an application to unbalanced classification semi-parametric classification. In Advances in Neural Information Processing Systems 15 (Vol. 15). IDIAP-RR 05-26.Google Scholar
- Hotelling, H. H. (1947). Multivariate quality control. In C. Eisenhart, M. W. Hastay, & W. A. Wallis (Eds.), Techniques of statistical analysis (pp. 111–184). New York, NY: McGraw-Hill Professional.Google Scholar
- Osuna, E., Freund, R., & Girosi, F. (1997). Training support vector machines: An application to face detection. In IEEE conference on computer vision and pattern recognition, pp. 130–136.Google Scholar
- Platt, J. C. (1999). Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. In A. J. Smola, P. Bartlett, B. Scholkopf, & D. Schuurmans (Eds.), Advances in large margin classifiers. Cambridge: MIT Press.Google Scholar
- Sollich, P. (2000). Probabilistic methods for support vector machines. In S. A. Solla, T. K. Leen, & K. R. Muller (Eds.), Advances in neural information processing systems (pp. 349–355). Cambridge: MIT Press.Google Scholar
- Sullivan, J. H., & Woodall, W. H. (2000). Change-point detection of mean vector or covariance matrix shifts using multivariate individual observations. IIE Transactions, 32(6), 537–549.Google Scholar
- Vapnik, V. N. (1998). Statistical learning theory. New York, NY: Springer.Google Scholar