Overview of One-Pass and Discard-After-Learn Concepts for Classification and Clustering in Streaming Environment with Constraints
With the advancement of internet technology and sensor networks, tremendous amount of data have been generated beyond our imagination. These data contain valuable and possibly relevant information for various fields of applications. Learning these data online by using current neural learning techniques is not so simple due to many technical constraints including data overflow, uncontrollable learning epochs, arbitrary class drift, and dynamic imbalanced class ratio. Recently, we have been attempted to tackle this neural learning problem under the non-stationary environment. In this article, we summarize the new concept of One-Pass-Learning-and-Discard and also new structures, called Malleable Hyper-ellipsoid and Hyper-cylinder, of neural network recently introduced to cope with supervised as well as unsupervised learning under the constraints of data overflow, preserving polynomial time and space complexities of learning process, arbitrary class drift, life of data, and dynamic imbalanced class ratio. Both structures are rotatable, transposable, and expandable according to the distribution and location of data cluster.
This work is supported by Thailand Research Fund under grant number RTA6080013.
- 2.Aggarwal, C. C., Han, J., Wang, J., & Yu, P. S. (2003). A framework for clustering evolving data streams. In Proceedings of the 29th International Conference on Very Large Data Bases (pp. 81–92).Google Scholar
- 4.Cao, F., Ester, M., Qian, W., & Zhou, A. (2006). Density-based clustering over an evolving data stream with noise. In SIAM International Conference on Data Mining (pp. 328–339).Google Scholar
- 5.Ditzler, G., Rosen, G., & Polikar, R. (2014, July). Domain adaptation bounds for multiple expert systems under concept drift. In Proceedings of the International Joint Conference on Neural Networks (pp. 595–601).Google Scholar
- 8.Hahsler, M., & Dunham, M. H. (2010). rEMM: Extensible Markov model for data stream clustering in r. Journal of Statistical Software, 35(5).Google Scholar