Abstract
The paper discusses an algorithm that groups the items on the basis of their attributes and then classifies the clusters. In other words, the proposed algorithms first cluster the items on the basis of property, i.e., attributes available for the dataset. The clustering is performed by K-means clustering. Then this clustered data is classified using the RepTree. In other words, the proposed algorithm is the hybrid algorithm of K-means clustering and the RepTree classification. The proposed algorithm is compared with the RepTree algorithm using the WEKA tool. The comparison is done over clothing dataset downloaded from Internet. The proposed algorithm decreases the mean absolute error as well as the root-mean-square error. The decrease in error results in accurate classification. So the proposed algorithm clusters the items and classifies them on the basis of their attributes more accurately.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agrawal, R., Srikant, R. Mining sequential patterns. In: Proceedings of the Eleventh International Conference on Data Engineering, 1995, pp. 3–14. IEEE (1995)
Cooley, R., Tan, P.N., Srivastava, J.: Discovery of interesting usage patterns from web data. In: Web Usage Analysis and User Profiling, pp. 163–182. Springer, Berlin, Heidelberg (2000)
Sanchati, R., Patidar, P.C., Kulkarni, G. Path breaking case studies in E-commerce using data mining. Int. J. Comput. Technol. Electron. Eng. 1 (2011)
Mohamed, W., Salleh, M.N.M., Omar, A.H.: A comparative study of reduced error pruning method in decision tree algorithms. In: 2012 IEEE International Conference on Control System, Computing and Engineering (ICCSCE), pp. 392–397. IEEE, Nov 2012
Zontul, M., Dogan, G., Aydin, F., Sener, S., Kaynar, O.: Wind speed forecasting using Reptree and bagging methods in Kirklareli-Turkey. J. Theor. Appl. Inf. Technol. 56(1) (2013)
Witten, I.H., Frank. E.: Data mining: practical machine learning tools and techniques. 2nd ed. The United States of America, Morgan Kaufmann Series in Data Management Systems (2005)
Alpaydın, E.: Introduction to Machine Learning. The MIT Press, Printed and Bound in the United States of America. ISBN: 0-262-01211-1 (2004)
Agrawal, R., Mannila, H., Srikant, R., Toivonen, H., Verkamo, A.I.: Fast discovery of association rules. Adv. Knowl. Discovery Data Min. 12(1), 307–328 (1996)
Sharma (Sachdeva), R.: K-means clustering in spatial data mining using WEKA interface. In: International Conference on Advances in Communication and Computing Technologies (ICACACT) 2012 Proceedings. International Journal of Computer Applications® (IJCA) (2014)
Ismail, M., Kamel, M.: Multidimensional data clustering utilization hybrid search strategies. Pattern Recogn. 22(1), 75–89 (1989)
Pena, J.M., Lozano, J.A., Larranaga, P.: An empirical comparison of four initialization methods for the k-means algorithm. Pattern Recogn. Lett. 20(10), 1027–1040 (1999)
Jain, S.: K-means clustering using WEKA interface. In: Proceedings of the 4th National Conference; INDIACom-2010 Computing for Nation Development, 25–26 Feb 2010
Pahl, C.: Data mining technology for the evaluation of learning content interaction. Int. J. E-Learn. 3(4), 47 (2004)
Sharma, N., Bajpai, A., Litoriya, R. Comparison the various clustering algorithms of WEKA tools. Int. J. Emerg. Technol. Adv. Eng. 2(5) (2012)
Shrivastava, V., Narayan Arya, P. A study of various clustering algorithms on retail sales data. Int. J. Comput. Commun. Netw. 1(2) (2012)
Quinlan, J.R.: Simplifying decision trees. Int. J. Man Mach. Stud. 27(3), 221–234 (1987)
Bhan, N.: Comparative study of EM and K-means clustering techniques in WEKA inter-face. Int. J. Adv. Technol. Eng. Res. IJATER 3(4) (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Midha, N., Singh, V. (2018). Classification of E-commerce Products Using RepTree and K-means Hybrid Approach. In: Aggarwal, V., Bhatnagar, V., Mishra, D. (eds) Big Data Analytics. Advances in Intelligent Systems and Computing, vol 654. Springer, Singapore. https://doi.org/10.1007/978-981-10-6620-7_26
Download citation
DOI: https://doi.org/10.1007/978-981-10-6620-7_26
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-6619-1
Online ISBN: 978-981-10-6620-7
eBook Packages: EngineeringEngineering (R0)