Skip to main content
Log in

Improved credit card churn prediction based on rough clustering and supervised learning techniques

  • Published:
Cluster Computing Aims and scope Submit manuscript


Every process is digitized in the current society. Transfer of money from one account holder to another has become possible in seconds because of advanced technologies in information processing. Not only this in all sectors like railways, insurance, health sector, fashion technology, education sector, sales and business sectors, and advertisement sectors every firm digitized its operations. One such sector is banking where every individual based on his or her financial status will be considered for crediting loan, and credit card etc. If the credit score of the loan availing person is high banks will be ready to provide him with the loan but the availing person can opt for any one of the banks on his or her own willing. Such scenario happens in credit card churn prediction also. Hence the banks should take healthy measures to retain the existing credit card holders without any churn. Withholding existing customers of a firm plays an important role to increase the overall revenue of the firm and retains the good name of the firm in competitive market. Hence every organization takes key measures to withhold existing customers using customer management models. Because customer retention is a crucial task as it reduces the time, money and workforce needed for adding new customers to the firm. Customers retention technique in credit card churn prediction (C3P) was done using only supervised classification techniques. But it could not end with better results. So, through many proven hybrid classification techniques we can bring better accuracy in C3P. Also C3P lags in highly efficient techniques like rough set theory. Hence in this work initially we perform data processing techniques and in second stage we propose modified rough K-means algorithm used for clustering credit card holders and in next stage hold-out method divides the cluster data into testing and training clusters. At last classification is performed using various algorithms like support vector machine, random forest, decision tree, K-nearest neighbor, and Naive Bayes. Finally we evaluate the work using precision, recall (sensitivity), specification, accuracy, and misclassification error.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others


  1. Yeh, I.C., et al.: The comparisons of data mining techniques for the predictive accuracy of probability of default of credit card clients. Expert Syst. Appl. 36, 2473–2480 (2009)

    Article  Google Scholar 

  2. Oyeniyi, A.O., Adeyemo, A.B., Oyeniyi, A.O., Adeyemo, A.B.: Customer churn analysis in banking sector using data mining techniques. Afr J. of Comp and ICTs. 8(3), 165–174 (2015)

    Google Scholar 

  3. Farquad, M. A., et al.: Data mining using rules extracted from SVM: an application to churn prediction in bank credit cards. In: RSFDGrC ’09 Proceedings of the 12th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing. pp. 390–397 (2009)

  4. Nie, G., et al.: Credit card churns forecasting by logistic regression and decision tree. Expert Syst. Appl. 38, 15273–15285 (2011)

    Article  Google Scholar 

  5. Wang, N., Niu, D. X.: Credit card customer churn prediction based on the RST and LS-SVM. IEEE conference on Service and systems and Service Management (2009)

  6. Zhao, J., et al.: Bank customer churn prediction based on support vector machine: taking a commercial bank’s VIP customer churn. IEEE Conference on Wireless Communication, Networking and Mobile computing (2008)

  7. Zhao, Y., et al.: Customer churn prediction using improved one-class support vector machine. DMA, LNAI 3584, pp. 300–306. Springer-Verlag Berlin Heidelberg, Berlin (2005)

    Google Scholar 

  8. Shao, H., et al.: Construction of Bayesian classifiers with GA for predicting customer retention. IEEE Conference on Natural Computation (2005)

  9. Huang, Ying, Huang, Y., Kechadi, T.: An effective hybrid learning system for telecommunication churns Prediction. J. Expert Syst. Appl. 40, 5635–5647 (2013)

    Article  Google Scholar 

  10. Bose, I., Chen, X.: Hybrid models using unsupervised clustering for prediction of customer churn. J. Organ. Comput. and Electron. Commer. 19, 133–151 (2009)

    Google Scholar 

  11. Huang, B.Q., Kechadi, M.T., Buckley, B.: Customer churns prediction in telecommunications. J. Expert Syst. Appl. 39(1), 1414–1425 (2012)

    Article  Google Scholar 

  12. Khashei, M., Hamadani, A.Z., Bijari, M.: A novel hybrid classification model of artificial neural networks and multiple linear regression models. J. Expert Syst. Appl. 39(3), 2606–2620 (2012)

    Article  Google Scholar 

  13. Tsai, C.F., Lu, Y.H.: Customer churns prediction by hybrid neural networks. J. Expert Syst. Appl. 36(10), 12547–12553 (2009)

    Article  Google Scholar 

  14. Yeshwanth, V, Raj, V.V., Saravanan, M.: Evolutionary churn prediction in mobile networks using hybrid learning. In: Proceedings of the Twenty-Fourth International Florida Artificial Intelligence Research Society Conference, (FLAIRS), Palm Beach, Florida, USA, May 18–20. AAAI Press (2011)

  15. Hung, S.Y., et al.: Applying data mining to telecom churn management. J. Expert Syst. Appl. 31(3), 515–524 (2006)

    Article  Google Scholar 

  16. Zhang, Y., et al .: A hybrid KNN-LR classifier and its application in customer churn prediction IEEE conference on system, man and cybernetics (2007)

  17. Lee et al,.: Customer churn prediction by hybrid model. Advance data mining and application Springer (2006)

  18. Kawale, J., Pal, A., Srivastava, J.: Churn prediction in MMORPGs: a social influence based approach. Proc. 2009 Int. Conf. Comput. Sci. Eng. 4, 423–428 (2009)

    Article  Google Scholar 

  19. Suznjevic, M., Stupar, I., Matijasevic, M.: MMORPG player behavior model based on player action categories. Proceedings of the 10th Annual Workshop on Network and Systems Support for Games. IEEE Press (2011)

  20. Chen, K. T., Lei, C. L., Network game design: hints and implications of player interaction. In: Proceedings of 5th ACM SIGCOMM Workshop on Network and System Support for Games. pp. 1–9 ( 2006)

  21. Soeini, R.A., Rodpysh, K.V.: Applying data mining to insurance customer churn management. Int. Proc. Comput. Sci. Inf. Technol. 30, 82–92 (2012)

    Google Scholar 

  22. Verbeke, W., Martens, D., Baesens, B.: Social network analysis for customer churn prediction. Appl. Soft Comput. 14, 431–446 (2014)

    Article  Google Scholar 

  23. Malik, Z.K., Hussain, A., Jonathan, W.: An online generalized eigenvalue version of Laplacian Eigenmaps for visual big data. Neurocomputing 173, 127–136 (2016)

    Article  Google Scholar 

  24. Anil Kumar, D., Ravi, V.: Predicting credit card customer churn in banks using data mining. Int. J. Data Anal. Tech. Strateg. 1(1), 4–28 (2008)

    Article  Google Scholar 

  25. Nie, G., et al.: Finding the hidden pattern of credit card holders churn: a case of China, ICCS 2009 part II. LNCS 5545(1), 561–569 (2009)

    Google Scholar 

  26. Wang, G., et al.: Predicting credit card holder churn in banks of China using data mining and MCDM. IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (2010)

  27. Chiang, D.A., et al.: Goal-oriented sequential pattern for network banking churn analysis. Expert Syst. Appl. 25, 293–302 (2003)

    Article  Google Scholar 

  28. Jinbo, S., et al.: The application of AdaBoost in customer churn prediction. IEEE Conference on Service and Systems and Service Management (2007)

  29. Mutanen, T., et al.: Customer churn prediction: a case study in banking. ECML/PKDD 2006 Workshop on Practical Data Mining: Applications, Experiences and Challenges (2010)

  30. Olle, G., Hybrid, A.: Churn prediction model in mobile telecommunication industry. Int. J. e-Educ. e-Bus. e-Manag. Learn. 4(1), 55–62 (2014)

    Google Scholar 

  31. Peters, G.: Some refinements of rough k-means clustering. Pattern Recognit. 39, 1481–1491 (2006)

    Article  MATH  Google Scholar 

  32. Kim, K., Jun, C.-H., Lee, J.: Improved churn prediction in telecommunication industry by analyzing a large network. Expert Syst. Appl. 41(15), 6575–6584 (2014)

    Article  Google Scholar 

  33. Dataset.

  34. Lingras, P., West, C.: Interval set clustering of web users with rough k-means. J. Intell. Inform. Syst. 23, 5–16 (2004)

    Article  MATH  Google Scholar 

  35. Kumar, P.: Comparative study of K-means, pam and rough K-means algorithms using cancer datasets. 2009 International Symposium on Computing, Communication, and Control (ISCCC 2009)

  36. Lingras, P., West, C.: Interval set clustering of web users with rough kmeans, Technical Report 2002–002. St. Marys University, Halifax, Canada, Department of Mathematics and Computer Science (2002)

  37. Parmar, D., et al.: MMR: an algorithm for clustering categorical data using rough set theory. Data Knowl. Eng. 63(2007), 879–893 (2007)

    Article  Google Scholar 

  38. Amin, A., et al.: Churn prediction in telecommunication industry using rough set approach. New Trends in Computational Collective Intelligence, pp. 83–95. Springer International Publishing, Berlin (2015)

    Google Scholar 

  39. Amin, A., et al.: Customer churn prediction in the telecommunication sector using a rough set approach. Neurocomputing 237, 242–254 (2017)

    Article  Google Scholar 

  40. Data Source Link.

Download references

Author information

Authors and Affiliations


Corresponding author

Correspondence to J. Manokaran.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Rajamohamed, R., Manokaran, J. Improved credit card churn prediction based on rough clustering and supervised learning techniques. Cluster Comput 21, 65–77 (2018).

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: