A First Attempt to Construct Effective Concept Drift Detector Ensembles
The big data is usually described by so-called 5Vs (Volume, Velocity, Variety, Veracity, Value). The business success in the big data era strongly depends on the smart analytical software which can help to make efficient decisions (Value for enterprise). Therefore, the decision support software should take into consideration especially that we deal with massive data (Volume) and that data usually comes continuously in the form of so-called data stream (Velocity). Unfortunately, most of the traditional data analysis methods are not ready to efficiently analyze fast growing amount of the stored records. Additionally, one should also consider phenomenon appearing in data stream called concept drift, which means that the parameters of an using model are changing, what could dramatically decrease the analytical model quality. This work is focusing on the classification task, which is very popular in many practical cases as fraud detection, network security, or medical diagnosis. We propose how to detect the changes in the data stream using combined concept drift detection model. The experimental evaluations show that it is an interesting direction, what encourage us to use it in practical applications.
KeywordsData stream Concept drift Pattern classification Drift detector
This work was supported by the statutory funds of the Department of Systems and Computer Networks, Faculty of Electronics, Wroclaw University of Science and Technology and by the Polish National Science Centre under the grant no. DEC-2013/09/B/ST6/02264. All computer experiments were carried out using computer equipment sponsored by EC under FP7, Coordination and Support Action, Grant Agreement Number 316097, ENGINE European Research Centre of Network Intelligence for Innovation Enhancement (http://engine.pwr.edu.pl/).
- 2.Bifet, A., Read, J., Pfahringer, B., Holmes, G., Žliobaitė, I.: CD-MOA: change detection framework for massive online analysis. In: Tucker, A., Höppner, F., Siebes, A., Swift, S. (eds.) IDA 2013. LNCS, vol. 8207, pp. 92–103. Springer, Heidelberg (2013). doi: 10.1007/978-3-642-41398-8_9 CrossRefGoogle Scholar
- 4.Gama, J., Zliobaite, I., Bifet, A., Pechenizkiy, M., Bouchachia, A.: A survey on concept drift adaptation. ACM Comput. Surv. (CSUR), 46(4) (2014). Surveys Homepage archive. Article No. 44Google Scholar
- 6.Gustafsson, F.: Adaptive Filtering and Change Detection. Wiley, October 2000. http://www.wiley.com/WileyCDA/WileyTitle/productCd-0471492876,descCd-description.html
- 9.Sobolewski, P., Wozniak, M.: Concept drift detection and model selection with simulated recurrence and ensembles of statistical detectors. J. Univ. Comput. Sci. 19(4), 462–483 (2013)Google Scholar
- 11.Widmer, G., Kubat, M.: Learning in the presence of concept drift and hidden contexts. Mach. Learn. 23(1), 69–101 (1996)Google Scholar
- 12.Wozniak, M., Grana, M., Corchado, E.: A survey of multiple classifier systems as hybrid systems. Inf. Fusion 16, 3–17 (2014). Special Issue on Information Fusion in Hybrid Intelligent Fusion Systems. http://www.sciencedirect.com/science/article/pii/S156625351300047X CrossRefGoogle Scholar