Skip to main content
Log in

A proposed hybrid clustering algorithm using K-means and BIRCH for cluster based cab recommender system (CBCRS)

  • Original Research
  • Published:
International Journal of Information Technology Aims and scope Submit manuscript

Abstract

An efficient Cluster Based Cab Recommender System (CBCRS) assists the cab drivers with the recommendations about passenger pickup location available at the shortest distance from him. To recommend drivers about the passenger pickup location, one need to group the Global Positioning System (GPS) coordinates of several pickup points of the same geographic region. The GPS coordinates of cab pick-up points are unsupervised data. Clustering of unsupervised cab dataset is troublesome since cab dataset is a large database and clustering techniques when applied on such large datasets do not generate good clusters for GPS datapoints. Therefore, this research paper proposes an improved hybrid clustering algorithm which combines the features of Partition-based clustering and Hierarchical Based Clustering techniques. Thus, the objectives of the research paper are four folds: firstly, the research paper identifies various clustering techniques to cluster GPS Coordinates. Secondly, to design and develop an improved hybrid clustering algorithm for CBCRS. Thirdly, the research paper compares the clusters formed by the proposed algorithm with standard K-Means and Balanced Iterative Reducing and Clustering using Hierarchies (BIRCH) using three datasets over Silhouette Coefficient and Calinski-Harabasz Score. Finally, the paper concludes and analyses the results of the proposed algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  1. Alhijawi B, Obeid N, Awajan A et al (2022) New hybrid semantic-based collaborative filtering recommender systems. Int j inf tecnol. https://doi.org/10.1007/s41870-022-01011-x

    Article  Google Scholar 

  2. Ankerst M, Breunig M, Kriegel HP, Sander J (1999) OPTICS: Ordering points to identify the clustering structure. Sigmod Record 28:49–60. https://doi.org/10.1145/304182.304187

  3. Bade, K., Nurnberger, A. (2006). Personalized hierarchical clustering. Proceedings of the IEEE/WIC/ACM international conference on web intelligence, (pp. 181–187)

  4. Berkhin P (2006) Survey of clustering data mining techniques. Grouping multidimensional data. Springer, Berlin, Heidelberg

  5. Chetty N, Shukla S (2014) A review ON K-means DATA. Clust Appr 4:1847–1860

    Google Scholar 

  6. Choudhury SS, Mohanty SN, Jagadev AK (2021) Multimodal trust based recommender system with machine learning approaches for movie recommendation. Int j inf tecnol 13:475–482. https://doi.org/10.1007/s41870-020-00553-2

    Article  Google Scholar 

  7. Drias H, Kechid A, Cherif F, Nadjib (2017) A hybrid clustering algorithm and web information foraging. Int J Hybr Int Syst 13:137–149. https://doi.org/10.3233/HIS-160231

    Article  Google Scholar 

  8. Ester M, Kriegel H-P, Sander J, Xu X (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. International conference on knowledge discovery and data mining, Portland, OR, USA. (2: 226–231)

  9. Hartigan JA, Wong MA (1979) A k-means clustering algorithm. Appl Stat 28:100–108

    Article  MATH  Google Scholar 

  10. Hassan SI, Samad A, Ahmad O et al (2020) Partitioning and hierarchical based clustering: a comparative empirical assessment on internal and external indices, accuracy, and time. Int j inf tecnol 12:1377–1384. https://doi.org/10.1007/s41870-019-00406-7

    Article  Google Scholar 

  11. H Malik, N Laghari, DM Sangrasi and ZA Dayo (2018) Comparative analysis of hybrid clustering algorithm on different dataset. 2018 8th international conference on electronics information and emergency communication (ICEIEC) 25–30 https://doi.org/10.1109/ICEIEC.2018.8473568

  12. Jain, R. (2012) A hybrid clustering algorithm for data mining. ArXiv, abs/1205.5353

  13. Mahmood M, Ahmed AEA, Karim G, Olfa H (2021) A hybrid clustering technique to propose the countries for HELP international India. J Comp Sci Eng 12:306–314

    Google Scholar 

  14. Mann SK, Chawla S (2020) Clustering based algorithmic design for cab recommender system (CRS). In: Fong S, Dey N, Joshi A (eds) ICT Analysis and Applications. Lecture Notes in Networks and Systems, vol 93. Springer, Singapore. https://doi.org/10.1007/978-981-15-0630-7_35

  15. Manning CD, Raghavan P, Sch¨utze H (2009) Introduction to Information Retrieval (Hierarchical clustering). Cambridge university press (346–368) ISBN 978-0-521-86571-5

  16. Mazimpaka J, Timpf S (2016) Trajectory data mining: a review of methods and applications. J Spat Inform Sci 13:61–99

    Google Scholar 

  17. M Ghazizadeh-Ahsaee and A Shamsadini-Farsangi (2020) Developing of a new hybrid clustering algorithm based on density. 2020 6th international conference on web research (ICWR) 146–151 https://doi.org/10.1109/ICWR49608.2020.9122309

  18. Murtagh F (1983) A survey of recent advances in hierarchical clustering algorithms. Comp J 26(4):354–359

    Article  MATH  Google Scholar 

  19. Pedregosa et al (2011) Scikit-learn: machine learning in python. JMLR 12:2825–2830

    MathSciNet  MATH  Google Scholar 

  20. Ricci F, Rokach L, Shapira B (2011) Introduction to recommender systems handbook. In: Ricci F, Rokach L, Shapira B, Kantor P (eds) Recommender Systems Handbook. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-85820-3_1

  21. Vijayarani S, Jothi P (2016) A hybrid clustering algorithm for outlier detection in data streams. Int J Grid Distrib Comp 9:285–296

    Article  Google Scholar 

  22. Wang R, Chow C, Lyu Y, Victor C, Kwong S, Li Y, Zeng J (2017) TaxiRec: recommending road clusters to taxi drivers using ranking-based extreme learning machines. IEEE Trans Knowl Data Eng 11:101109

    Google Scholar 

  23. Liu X, Yu S, Moreau Y, De Moor B, Glänzel W, Janssens F (2009) Hybrid clustering of text mining and bibliometrics applied to journal sets. Proceedings of the 2009 SIAM international conference on data mining (SDM) 49–60. https://doi.org/10.1137/1.9781611972795.5

  24. Xu D, Tian Y (2015) A comprehensive survey of clustering algorithms. Ann Data Sci 2:165–193. https://doi.org/10.1007/s40745-015-0040-1

    Article  Google Scholar 

  25. Yadav RS (2020) Application of hybrid clustering methods for student performance evaluation. Int j inf tecnol 12:749–756. https://doi.org/10.1007/s41870-018-0192-2

    Article  Google Scholar 

  26. Yuan NJ, Zheng Y, Zhang L and Xie X (2013) T-Finder: A recommender system for finding passengers and vacant taxis. IEEE Trans Knowl Data Eng 25:2390–2403. https://doi.org/10.1109/TKDE.2012.153

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Supreet Kaur Mann.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Mann, S.K., Chawla, S. A proposed hybrid clustering algorithm using K-means and BIRCH for cluster based cab recommender system (CBCRS). Int. j. inf. tecnol. 15, 219–227 (2023). https://doi.org/10.1007/s41870-022-01113-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s41870-022-01113-6

Keywords

Navigation