Classification of E-commerce Products Using RepTree and K-means Hybrid Approach

Midha, Neha; Singh, Vikram

doi:10.1007/978-981-10-6620-7_26

Neha Midha¹⁷ &
Vikram Singh¹⁸

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 654))

4002 Accesses
4 Citations

Abstract

The paper discusses an algorithm that groups the items on the basis of their attributes and then classifies the clusters. In other words, the proposed algorithms first cluster the items on the basis of property, i.e., attributes available for the dataset. The clustering is performed by K-means clustering. Then this clustered data is classified using the RepTree. In other words, the proposed algorithm is the hybrid algorithm of K-means clustering and the RepTree classification. The proposed algorithm is compared with the RepTree algorithm using the WEKA tool. The comparison is done over clothing dataset downloaded from Internet. The proposed algorithm decreases the mean absolute error as well as the root-mean-square error. The decrease in error results in accurate classification. So the proposed algorithm clusters the items and classifies them on the basis of their attributes more accurately.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Agrawal, R., Srikant, R. Mining sequential patterns. In: Proceedings of the Eleventh International Conference on Data Engineering, 1995, pp. 3–14. IEEE (1995)
Google Scholar
Cooley, R., Tan, P.N., Srivastava, J.: Discovery of interesting usage patterns from web data. In: Web Usage Analysis and User Profiling, pp. 163–182. Springer, Berlin, Heidelberg (2000)
Google Scholar
Sanchati, R., Patidar, P.C., Kulkarni, G. Path breaking case studies in E-commerce using data mining. Int. J. Comput. Technol. Electron. Eng. 1 (2011)
Google Scholar
Mohamed, W., Salleh, M.N.M., Omar, A.H.: A comparative study of reduced error pruning method in decision tree algorithms. In: 2012 IEEE International Conference on Control System, Computing and Engineering (ICCSCE), pp. 392–397. IEEE, Nov 2012
Google Scholar
Zontul, M., Dogan, G., Aydin, F., Sener, S., Kaynar, O.: Wind speed forecasting using Reptree and bagging methods in Kirklareli-Turkey. J. Theor. Appl. Inf. Technol. 56(1) (2013)
Google Scholar
Witten, I.H., Frank. E.: Data mining: practical machine learning tools and techniques. 2nd ed. The United States of America, Morgan Kaufmann Series in Data Management Systems (2005)
Google Scholar
Alpaydın, E.: Introduction to Machine Learning. The MIT Press, Printed and Bound in the United States of America. ISBN: 0-262-01211-1 (2004)
Google Scholar
Agrawal, R., Mannila, H., Srikant, R., Toivonen, H., Verkamo, A.I.: Fast discovery of association rules. Adv. Knowl. Discovery Data Min. 12(1), 307–328 (1996)
Google Scholar
Sharma (Sachdeva), R.: K-means clustering in spatial data mining using WEKA interface. In: International Conference on Advances in Communication and Computing Technologies (ICACACT) 2012 Proceedings. International Journal of Computer Applications^® (IJCA) (2014)
Google Scholar
Ismail, M., Kamel, M.: Multidimensional data clustering utilization hybrid search strategies. Pattern Recogn. 22(1), 75–89 (1989)
Article MATH Google Scholar
Pena, J.M., Lozano, J.A., Larranaga, P.: An empirical comparison of four initialization methods for the k-means algorithm. Pattern Recogn. Lett. 20(10), 1027–1040 (1999)
Article Google Scholar
Jain, S.: K-means clustering using WEKA interface. In: Proceedings of the 4th National Conference; INDIACom-2010 Computing for Nation Development, 25–26 Feb 2010
Google Scholar
Pahl, C.: Data mining technology for the evaluation of learning content interaction. Int. J. E-Learn. 3(4), 47 (2004)
Google Scholar
Sharma, N., Bajpai, A., Litoriya, R. Comparison the various clustering algorithms of WEKA tools. Int. J. Emerg. Technol. Adv. Eng. 2(5) (2012)
Google Scholar
Shrivastava, V., Narayan Arya, P. A study of various clustering algorithms on retail sales data. Int. J. Comput. Commun. Netw. 1(2) (2012)
Google Scholar
Quinlan, J.R.: Simplifying decision trees. Int. J. Man Mach. Stud. 27(3), 221–234 (1987)
Article Google Scholar
Bhan, N.: Comparative study of EM and K-means clustering techniques in WEKA inter-face. Int. J. Adv. Technol. Eng. Res. IJATER 3(4) (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Chaudhary Devi Lal University, Sirsa, Haryana, India
Neha Midha
Department of Computer Science & Applications, Chaudhary Devi Lal University, Sirsa, Haryana, India
Vikram Singh

Authors

Neha Midha
View author publications
You can also search for this author in PubMed Google Scholar
Vikram Singh
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Neha Midha .

Editor information

Editors and Affiliations

Jagan Institute of Management Studies, New Delhi, Delhi, India
V. B. Aggarwal
Department of Computer Science, University of Delhi, New Delhi, Delhi, India
Vasudha Bhatnagar
Microsoft Innovation Centre, Sri Aurobindo Institute of Technology, Indore, Madhya Pradesh, India
Durgesh Kumar Mishra

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Midha, N., Singh, V. (2018). Classification of E-commerce Products Using RepTree and K-means Hybrid Approach. In: Aggarwal, V., Bhatnagar, V., Mishra, D. (eds) Big Data Analytics. Advances in Intelligent Systems and Computing, vol 654. Springer, Singapore. https://doi.org/10.1007/978-981-10-6620-7_26

Download citation

DOI: https://doi.org/10.1007/978-981-10-6620-7_26
Published: 04 October 2017
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-6619-1
Online ISBN: 978-981-10-6620-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics