Skip to main content

Classification of E-commerce Products Using RepTree and K-means Hybrid Approach

  • Conference paper
  • First Online:
Big Data Analytics

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 654))

Abstract

The paper discusses an algorithm that groups the items on the basis of their attributes and then classifies the clusters. In other words, the proposed algorithms first cluster the items on the basis of property, i.e., attributes available for the dataset. The clustering is performed by K-means clustering. Then this clustered data is classified using the RepTree. In other words, the proposed algorithm is the hybrid algorithm of K-means clustering and the RepTree classification. The proposed algorithm is compared with the RepTree algorithm using the WEKA tool. The comparison is done over clothing dataset downloaded from Internet. The proposed algorithm decreases the mean absolute error as well as the root-mean-square error. The decrease in error results in accurate classification. So the proposed algorithm clusters the items and classifies them on the basis of their attributes more accurately.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Agrawal, R., Srikant, R. Mining sequential patterns. In: Proceedings of the Eleventh International Conference on Data Engineering, 1995, pp. 3–14. IEEE (1995)

    Google Scholar 

  2. Cooley, R., Tan, P.N., Srivastava, J.: Discovery of interesting usage patterns from web data. In: Web Usage Analysis and User Profiling, pp. 163–182. Springer, Berlin, Heidelberg (2000)

    Google Scholar 

  3. Sanchati, R., Patidar, P.C., Kulkarni, G. Path breaking case studies in E-commerce using data mining. Int. J. Comput. Technol. Electron. Eng. 1 (2011)

    Google Scholar 

  4. Mohamed, W., Salleh, M.N.M., Omar, A.H.: A comparative study of reduced error pruning method in decision tree algorithms. In: 2012 IEEE International Conference on Control System, Computing and Engineering (ICCSCE), pp. 392–397. IEEE, Nov 2012

    Google Scholar 

  5. Zontul, M., Dogan, G., Aydin, F., Sener, S., Kaynar, O.: Wind speed forecasting using Reptree and bagging methods in Kirklareli-Turkey. J. Theor. Appl. Inf. Technol. 56(1) (2013)

    Google Scholar 

  6. Witten, I.H., Frank. E.: Data mining: practical machine learning tools and techniques. 2nd ed. The United States of America, Morgan Kaufmann Series in Data Management Systems (2005)

    Google Scholar 

  7. Alpaydın, E.: Introduction to Machine Learning. The MIT Press, Printed and Bound in the United States of America. ISBN: 0-262-01211-1 (2004)

    Google Scholar 

  8. Agrawal, R., Mannila, H., Srikant, R., Toivonen, H., Verkamo, A.I.: Fast discovery of association rules. Adv. Knowl. Discovery Data Min. 12(1), 307–328 (1996)

    Google Scholar 

  9. Sharma (Sachdeva), R.: K-means clustering in spatial data mining using WEKA interface. In: International Conference on Advances in Communication and Computing Technologies (ICACACT) 2012 Proceedings. International Journal of Computer Applications® (IJCA) (2014)

    Google Scholar 

  10. Ismail, M., Kamel, M.: Multidimensional data clustering utilization hybrid search strategies. Pattern Recogn. 22(1), 75–89 (1989)

    Article  MATH  Google Scholar 

  11. Pena, J.M., Lozano, J.A., Larranaga, P.: An empirical comparison of four initialization methods for the k-means algorithm. Pattern Recogn. Lett. 20(10), 1027–1040 (1999)

    Article  Google Scholar 

  12. Jain, S.: K-means clustering using WEKA interface. In: Proceedings of the 4th National Conference; INDIACom-2010 Computing for Nation Development, 25–26 Feb 2010

    Google Scholar 

  13. Pahl, C.: Data mining technology for the evaluation of learning content interaction. Int. J. E-Learn. 3(4), 47 (2004)

    Google Scholar 

  14. Sharma, N., Bajpai, A., Litoriya, R. Comparison the various clustering algorithms of WEKA tools. Int. J. Emerg. Technol. Adv. Eng. 2(5) (2012)

    Google Scholar 

  15. Shrivastava, V., Narayan Arya, P. A study of various clustering algorithms on retail sales data. Int. J. Comput. Commun. Netw. 1(2) (2012)

    Google Scholar 

  16. Quinlan, J.R.: Simplifying decision trees. Int. J. Man Mach. Stud. 27(3), 221–234 (1987)

    Article  Google Scholar 

  17. Bhan, N.: Comparative study of EM and K-means clustering techniques in WEKA inter-face. Int. J. Adv. Technol. Eng. Res. IJATER 3(4) (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Neha Midha .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this paper

Cite this paper

Midha, N., Singh, V. (2018). Classification of E-commerce Products Using RepTree and K-means Hybrid Approach. In: Aggarwal, V., Bhatnagar, V., Mishra, D. (eds) Big Data Analytics. Advances in Intelligent Systems and Computing, vol 654. Springer, Singapore. https://doi.org/10.1007/978-981-10-6620-7_26

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-6620-7_26

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-6619-1

  • Online ISBN: 978-981-10-6620-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics