Abstract
Concept drift is a common problem in the data streams, which makes the classifiers no longer valid. In the multidimensional data, this problem becomes difficult to tackle. This paper examines the possibilities of identifying the specific features, in which concept drift occurs. This allows to limit the scope of the necessary update in the classification system. As a tool, we select a popular Kolmogorov-Smirnov test statistic.
Chapter PDF
References
Newman, D.J., Asuncion, A.: UCI machine learning repository (2007)
Babcock, B., Babu, S., Datar, M., Motwani, R., Widom, J.: Models and issues in data stream systems. In: Proceedings of the Twenty-First ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, PODS 2002, pp. 1–16. ACM, New York (2002)
Dries, A., Rückert, U.: Adaptive concept drift detection. Stat. Anal. Data Min. 2(5-6), 311–327 (2009)
Greiner, R., Grove, A.J., Roth, D.: Learning cost-sensitive active classifiers. Artif. Intell. 139(2), 137–174 (2002)
Revuz, D., Yor, M.: Continuous Martingales and Brownian Motion (Grundlehren der mathematischen Wissenschaften), 3rd edn. Springer (December 2004)
Schlimmer, J.C., Granger Jr., R.H.: Incremental learning from noisy data. Mach. Learn. 1(3), 317–354 (1986)
Smirnov, N.V.: Table for estimating the goodness of fit of empirical distributions. Ann. Math. Stat. 19, 279–281 (1948)
Sobolewski, P., Wozniak, M.: Sequential Tests of Statistical Hypotheses. The Annals of Mathematical Statistics 16(2), 117–186 (1945)
Sobolewski, P., Wozniak, M.: Ldcnet: minimizing the cost of supervision for various types of concept drift. In: Proceedings of the CIDUE 2013 - IEEE Symposium on Computational Intelligence in Dynamic and Uncertain Environments, CIDUE 2013, pp. 68–75 (2013)
Sobolewski, P., Woźniak, M.: Comparable study of statistical tests for virtual concept drift detection. In: Burduk, R., Jackowski, K., Kurzynski, M., Wozniak, M., Zolnierek, A. (eds.) CORES 2013. AISC, vol. 226, pp. 333–341. Springer, Heidelberg (2013)
Wilcoxon, F.: Individual Comparisons by Ranking Methods. Biometrics Bulletin 1(6), 80–83 (1945)
Wolfowitz, J.: On Wald’s Proof of the Consistency of the Maximum Likelihood Estimate. The Annals of Mathematical Statistics 20, 601–602 (1949)
Zliobaite, I., Kuncheva, L.I.: Determining the training window for small sample size classification with concept drift. In: Proceedings of the 2009 IEEE International Conference on Data Mining Workshops, ICDMW 2009, pp. 447–452. IEEE Computer Society, Washington, DC (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 IFIP International Federation for Information Processing
About this paper
Cite this paper
Sobolewski, P., Woźniak, M. (2014). Identifying Features with Concept Drift in Multidimensional Data Using Statistical Tests. In: Iliadis, L., Maglogiannis, I., Papadopoulos, H. (eds) Artificial Intelligence Applications and Innovations. AIAI 2014. IFIP Advances in Information and Communication Technology, vol 436. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-44654-6_40
Download citation
DOI: https://doi.org/10.1007/978-3-662-44654-6_40
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-44653-9
Online ISBN: 978-3-662-44654-6
eBook Packages: Computer ScienceComputer Science (R0)