Abstract
An efficient Cluster Based Cab Recommender System (CBCRS) assists the cab drivers with the recommendations about passenger pickup location available at the shortest distance from him. To recommend drivers about the passenger pickup location, one need to group the Global Positioning System (GPS) coordinates of several pickup points of the same geographic region. The GPS coordinates of cab pick-up points are unsupervised data. Clustering of unsupervised cab dataset is troublesome since cab dataset is a large database and clustering techniques when applied on such large datasets do not generate good clusters for GPS datapoints. Therefore, this research paper proposes an improved hybrid clustering algorithm which combines the features of Partition-based clustering and Hierarchical Based Clustering techniques. Thus, the objectives of the research paper are four folds: firstly, the research paper identifies various clustering techniques to cluster GPS Coordinates. Secondly, to design and develop an improved hybrid clustering algorithm for CBCRS. Thirdly, the research paper compares the clusters formed by the proposed algorithm with standard K-Means and Balanced Iterative Reducing and Clustering using Hierarchies (BIRCH) using three datasets over Silhouette Coefficient and Calinski-Harabasz Score. Finally, the paper concludes and analyses the results of the proposed algorithm.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs41870-022-01113-6/MediaObjects/41870_2022_1113_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs41870-022-01113-6/MediaObjects/41870_2022_1113_Fig2_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs41870-022-01113-6/MediaObjects/41870_2022_1113_Fig3_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs41870-022-01113-6/MediaObjects/41870_2022_1113_Fig4_HTML.png)
Similar content being viewed by others
References
Alhijawi B, Obeid N, Awajan A et al (2022) New hybrid semantic-based collaborative filtering recommender systems. Int j inf tecnol. https://doi.org/10.1007/s41870-022-01011-x
Ankerst M, Breunig M, Kriegel HP, Sander J (1999) OPTICS: Ordering points to identify the clustering structure. Sigmod Record 28:49–60. https://doi.org/10.1145/304182.304187
Bade, K., Nurnberger, A. (2006). Personalized hierarchical clustering. Proceedings of the IEEE/WIC/ACM international conference on web intelligence, (pp. 181–187)
Berkhin P (2006) Survey of clustering data mining techniques. Grouping multidimensional data. Springer, Berlin, Heidelberg
Chetty N, Shukla S (2014) A review ON K-means DATA. Clust Appr 4:1847–1860
Choudhury SS, Mohanty SN, Jagadev AK (2021) Multimodal trust based recommender system with machine learning approaches for movie recommendation. Int j inf tecnol 13:475–482. https://doi.org/10.1007/s41870-020-00553-2
Drias H, Kechid A, Cherif F, Nadjib (2017) A hybrid clustering algorithm and web information foraging. Int J Hybr Int Syst 13:137–149. https://doi.org/10.3233/HIS-160231
Ester M, Kriegel H-P, Sander J, Xu X (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. International conference on knowledge discovery and data mining, Portland, OR, USA. (2: 226–231)
Hartigan JA, Wong MA (1979) A k-means clustering algorithm. Appl Stat 28:100–108
Hassan SI, Samad A, Ahmad O et al (2020) Partitioning and hierarchical based clustering: a comparative empirical assessment on internal and external indices, accuracy, and time. Int j inf tecnol 12:1377–1384. https://doi.org/10.1007/s41870-019-00406-7
H Malik, N Laghari, DM Sangrasi and ZA Dayo (2018) Comparative analysis of hybrid clustering algorithm on different dataset. 2018 8th international conference on electronics information and emergency communication (ICEIEC) 25–30 https://doi.org/10.1109/ICEIEC.2018.8473568
Jain, R. (2012) A hybrid clustering algorithm for data mining. ArXiv, abs/1205.5353
Mahmood M, Ahmed AEA, Karim G, Olfa H (2021) A hybrid clustering technique to propose the countries for HELP international India. J Comp Sci Eng 12:306–314
Mann SK, Chawla S (2020) Clustering based algorithmic design for cab recommender system (CRS). In: Fong S, Dey N, Joshi A (eds) ICT Analysis and Applications. Lecture Notes in Networks and Systems, vol 93. Springer, Singapore. https://doi.org/10.1007/978-981-15-0630-7_35
Manning CD, Raghavan P, Sch¨utze H (2009) Introduction to Information Retrieval (Hierarchical clustering). Cambridge university press (346–368) ISBN 978-0-521-86571-5
Mazimpaka J, Timpf S (2016) Trajectory data mining: a review of methods and applications. J Spat Inform Sci 13:61–99
M Ghazizadeh-Ahsaee and A Shamsadini-Farsangi (2020) Developing of a new hybrid clustering algorithm based on density. 2020 6th international conference on web research (ICWR) 146–151 https://doi.org/10.1109/ICWR49608.2020.9122309
Murtagh F (1983) A survey of recent advances in hierarchical clustering algorithms. Comp J 26(4):354–359
Pedregosa et al (2011) Scikit-learn: machine learning in python. JMLR 12:2825–2830
Ricci F, Rokach L, Shapira B (2011) Introduction to recommender systems handbook. In: Ricci F, Rokach L, Shapira B, Kantor P (eds) Recommender Systems Handbook. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-85820-3_1
Vijayarani S, Jothi P (2016) A hybrid clustering algorithm for outlier detection in data streams. Int J Grid Distrib Comp 9:285–296
Wang R, Chow C, Lyu Y, Victor C, Kwong S, Li Y, Zeng J (2017) TaxiRec: recommending road clusters to taxi drivers using ranking-based extreme learning machines. IEEE Trans Knowl Data Eng 11:101109
Liu X, Yu S, Moreau Y, De Moor B, Glänzel W, Janssens F (2009) Hybrid clustering of text mining and bibliometrics applied to journal sets. Proceedings of the 2009 SIAM international conference on data mining (SDM) 49–60. https://doi.org/10.1137/1.9781611972795.5
Xu D, Tian Y (2015) A comprehensive survey of clustering algorithms. Ann Data Sci 2:165–193. https://doi.org/10.1007/s40745-015-0040-1
Yadav RS (2020) Application of hybrid clustering methods for student performance evaluation. Int j inf tecnol 12:749–756. https://doi.org/10.1007/s41870-018-0192-2
Yuan NJ, Zheng Y, Zhang L and Xie X (2013) T-Finder: A recommender system for finding passengers and vacant taxis. IEEE Trans Knowl Data Eng 25:2390–2403. https://doi.org/10.1109/TKDE.2012.153
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Mann, S.K., Chawla, S. A proposed hybrid clustering algorithm using K-means and BIRCH for cluster based cab recommender system (CBCRS). Int. j. inf. tecnol. 15, 219–227 (2023). https://doi.org/10.1007/s41870-022-01113-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41870-022-01113-6