Abstract
In this paper we examine the possibilities of using popular univariate statistical tests for discovering virtual concept drift in the stream of multidimensional data. Three popular methods are evaluated with different generalization approaches both on simulated and real data and compared by the specificity and sensitivity scores.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Newman, D.J., Asuncion, A.: UCI machine learning repository (2007)
Babcock, B., Babu, S., Datar, M., Motwani, R., Widom, J.: Models and issues in data stream systems. In: Proceedings of the Twenty-First ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, PODS 2002, pp. 1–16. ACM, New York (2002)
Corder, G.W., Foreman, D.I.: Nonparametric Statistics for Non-Statisticians: A Step-by-Step Approach, 1st edn. Wiley (May 2009)
Dries, A., Rückert, U.: Adaptive concept drift detection. Stat. Anal. Data Min. 2(5-6), 311–327 (2009)
Fasano, G., Franceschini, A.: A multidimensional version of the Kolmogorov-Smirnov test. Monthy Notices of the Royal Astronomical Society 225, 155–170 (1987)
Friedman, J., Rafsky, L.: Multivariate generalizations of the wald-wolfowitz and smirnov two-sample tests. In: The Annals of Statistics, pp. 697–717 (1979)
Greiner, R., Grove, A.J., Roth, D.: Learning cost-sensitive active classifiers. Artif. Intell. 139(2), 137–174 (2002)
Lowry, R.: Concepts and Applications of Inferential Statistics. Online (June 2007)
Markou, M., Singh, S.: Novelty detection: a review–part 1: statistical approaches. Signal Processing 83(12), 2481–2497 (2003)
Nishida, K., Yamauchi, K.: Detecting concept drift using statistical testing. In: Corruble, V., Takeda, M., Suzuki, E. (eds.) DS 2007. LNCS (LNAI), vol. 4755, pp. 264–269. Springer, Heidelberg (2007)
Revuz, D., Yor, M.: Continuous Martingales and Brownian Motion (Grundlehren der mathematischen Wissenschaften), 3rd edn. Springer (December 2004)
Schlimmer, J.C., Granger Jr., R.H.: Incremental learning from noisy data. Mach. Learn. 1(3), 317–354 (1986)
Smirnov, N.V.: Table for estimating the goodness of fit of empirical distributions. Ann. Math. Stat. 19, 279–281 (1948)
Sugiura, N.: Multisample and multivariate nonparametric tests based on u statistics and their asymptotic efficiencies. Osaka J. Math. 2, 385–426 (1965)
Vreeken, J., van Leeuwen, M., Siebes, A.: Characterising the difference. In: Berkhin, P., Caruana, R., Wu, X. (eds.) KDD, pp. 765–774. ACM (2007)
Wald, A.: Sequential Tests of Statistical Hypotheses. The Annals of Mathematical Statistics 16(2), 117–186 (1945)
Wang, S., Schlobach, S., Klein, M.C.A.: Concept drift and how to identify it. J. Web Sem. 9(3), 247–265 (2011)
Widmer, G., Kubat, M.: Effective learning in dynamic environments by explicit context tracking. In: Brazdil, P.B. (ed.) ECML 1993. LNCS, vol. 667, pp. 227–243. Springer, Heidelberg (1993)
Wilcoxon, F.: Individual Comparisons by Ranking Methods. Biometrics Bulletin 1(6), 80–83 (1945)
Wolfowitz, J.: On Waldś Proof of the Consistency of the Maximum Likelihood Estimate. The Annals of Mathematical Statistics 20, 601–602 (1949)
Žliobaitė, I.: Learning under Concept Drift: an Overview. Technical report, Vilnius University, Faculty of Mathematics and Informatic (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer International Publishing Switzerland
About this paper
Cite this paper
Sobolewski, P., Woźniak, M. (2013). Comparable Study of Statistical Tests for Virtual Concept Drift Detection. In: Burduk, R., Jackowski, K., Kurzynski, M., Wozniak, M., Zolnierek, A. (eds) Proceedings of the 8th International Conference on Computer Recognition Systems CORES 2013. Advances in Intelligent Systems and Computing, vol 226. Springer, Heidelberg. https://doi.org/10.1007/978-3-319-00969-8_32
Download citation
DOI: https://doi.org/10.1007/978-3-319-00969-8_32
Publisher Name: Springer, Heidelberg
Print ISBN: 978-3-319-00968-1
Online ISBN: 978-3-319-00969-8
eBook Packages: EngineeringEngineering (R0)