Abstract
In this paper a family of algorithms for the online learning and classification is considered. These algorithms work in rounds, where at each round a new instance is given and the algorithm makes a prediction. After the true class of the instance is revealed, the learning algorithm updates its internal hypothesis. The proposed algorithms are based on fuzzy C-means clustering followed by calculation of distances between cluster centroids and the incoming instance for which the class label is to be predicted. Simple distance-based classifiers thus obtained serve as basic classifiers for the implemented rotation forest kernel. The proposed approach is validated experimentally. Experiment results show that proposed classifiers perform well against competitive approaches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Asuncion, A., Newman, D.J.: UCI Machine Learning Repository. University of California, School of Information and Computer Science (2007), http://www.ics.uci.edu/~mlearn/MLRepository.html
Bertini, J.R., Zhao, L., Lopes, A.: An incremental learning algorithm based on the K-associated graph for non-stationary data classification. Information Sciences 246, 52–68 (2013)
Ditzler, G., Polikar, R.: Incremental learning of concept drift from streaming imbalanced data. IEEE Transactions on Knowledge and Data Engineering 25(10), 2283–2301 (2013)
Dunn, J.C.: A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters. Journal of Cybernetics 3, 32–57 (1973)
Gaber, M.M., Zaslavsky, A., Krishnaswamy, S.: Mining data streams: a review. ACM SIGMOD Record 34(1), 18–26 (2005)
Gaber, M.M., Zaslavsky, A., Krishnaswamy, S.: Data Stream Mining. In: Maimon, O., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook, Part 6, pp. 759–787 (2010)
Gama, J., Gaber, M.M.: Learning from Data Streams. Springer, Berlin (2007)
IDA Benchmark Repository, http://mldata.org/repository/tags/data/IDA_Benchmark_Repository/ (January 12, 2013)
Jędrzejowicz, J., Jędrzejowicz, P.: Cellular GEP-Induced Classifiers. In: Pan, J.-S., Chen, S.-M., Nguyen, N.T. (eds.) ICCCI 2010, Part I. LNCS (LNAI), vol. 6421, pp. 343–352. Springer, Heidelberg (2010)
Jędrzejowicz, J., Jędrzejowicz, P.: Rotation Forest with GEP-Induced Expression Trees. In: O’Shea, J., Nguyen, N.T., Crockett, K., Howlett, R.J., Jain, L.C. (eds.) KES-AMSTA 2011. LNCS (LNAI), vol. 6682, pp. 495–503. Springer, Heidelberg (2011)
Jędrzejowicz, J., Jędrzejowicz, P.: Online classifiers based on fuzzy C-means Clustering. In: Bǎdicǎ, C., Nguyen, N.T., Brezovan, M. (eds.) ICCCI 2013. LNCS (LNAI), vol. 8083, pp. 427–436. Springer, Heidelberg (2013)
Last, M.: Online classification of nonstationary data streams. Intelligent Data Analysis 6, 129–147 (2002)
http://moa.cms.waikato.ac.nz/datasets/ (September 03, 2013)
Murata, N., Kawanabe, N., Ziehe, A., Muller, K.R., Amari, S.: On-line learning in changing environments with application in supervised and unsupervised learning. Neural Networks 15, 743–760 (2002)
Pramod, S., Vyas, O.P.: Data Stream Mining: A Review on Windowing Approach. Global Journal of Computer Science and Technology Software & Data Engineering 12(11), 26–30 (2012)
Sung, J., Kim, D.: Adaptive acting appearance model with incremental learning. Pattern Recognition Letters 30, 359–367 (2009)
Street, W., Kim, Y.: A streaming ensemble algorithm (SEA) for large-scale classification. In: 7th International Conference on Knowledge Discovery and Data Mining, KDD 2001, San Francisco, CA, pp. 377–382 (August 2001)
Wang, L., Ji, H.-B., Jin, Y.: Fuzzy Passive-Aggressive classification: A robust and efficient algorithm for online classification problems. Information Sciences 220, 46–63 (2013)
Widmar, G., Kubat, M.: Learning in the presence of concept drift and hidden contexts. Machine Learning 23, 69–101 (1996)
Wisaeng, K.: A Comparison of Different Classification Techniques for Bank Direct Marketing. International Journal of Soft Computing and Engineering 3(4), 116–119 (2013)
Žliobaitė, I.E.: Controlled Permutations for Testing Adaptive Classifiers. In: Elomaa, T., Hollmén, J., Mannila, H. (eds.) DS 2011. LNCS, vol. 6926, pp. 365–379. Springer, Heidelberg (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Jędrzejowicz, J., Jędrzejowicz, P. (2014). A Family of the Online Distance-Based Classifiers. In: Nguyen, N.T., Attachoo, B., Trawiński, B., Somboonviwat, K. (eds) Intelligent Information and Database Systems. ACIIDS 2014. Lecture Notes in Computer Science(), vol 8398. Springer, Cham. https://doi.org/10.1007/978-3-319-05458-2_19
Download citation
DOI: https://doi.org/10.1007/978-3-319-05458-2_19
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-05457-5
Online ISBN: 978-3-319-05458-2
eBook Packages: Computer ScienceComputer Science (R0)