Cluster Computing

, Volume 21, Issue 1, pp 65–77 | Cite as

Improved credit card churn prediction based on rough clustering and supervised learning techniques

  • R. Rajamohamed
  • J. ManokaranEmail author


Every process is digitized in the current society. Transfer of money from one account holder to another has become possible in seconds because of advanced technologies in information processing. Not only this in all sectors like railways, insurance, health sector, fashion technology, education sector, sales and business sectors, and advertisement sectors every firm digitized its operations. One such sector is banking where every individual based on his or her financial status will be considered for crediting loan, and credit card etc. If the credit score of the loan availing person is high banks will be ready to provide him with the loan but the availing person can opt for any one of the banks on his or her own willing. Such scenario happens in credit card churn prediction also. Hence the banks should take healthy measures to retain the existing credit card holders without any churn. Withholding existing customers of a firm plays an important role to increase the overall revenue of the firm and retains the good name of the firm in competitive market. Hence every organization takes key measures to withhold existing customers using customer management models. Because customer retention is a crucial task as it reduces the time, money and workforce needed for adding new customers to the firm. Customers retention technique in credit card churn prediction (C3P) was done using only supervised classification techniques. But it could not end with better results. So, through many proven hybrid classification techniques we can bring better accuracy in C3P. Also C3P lags in highly efficient techniques like rough set theory. Hence in this work initially we perform data processing techniques and in second stage we propose modified rough K-means algorithm used for clustering credit card holders and in next stage hold-out method divides the cluster data into testing and training clusters. At last classification is performed using various algorithms like support vector machine, random forest, decision tree, K-nearest neighbor, and Naive Bayes. Finally we evaluate the work using precision, recall (sensitivity), specification, accuracy, and misclassification error.


Credit card C3P Churn K-means Modified rough K-means Decision tree Support vector machine Random forest Naive Bayes K-nearest neighbor 


  1. 1.
    Yeh, I.C., et al.: The comparisons of data mining techniques for the predictive accuracy of probability of default of credit card clients. Expert Syst. Appl. 36, 2473–2480 (2009)CrossRefGoogle Scholar
  2. 2.
    Oyeniyi, A.O., Adeyemo, A.B., Oyeniyi, A.O., Adeyemo, A.B.: Customer churn analysis in banking sector using data mining techniques. Afr J. of Comp and ICTs. 8(3), 165–174 (2015)Google Scholar
  3. 3.
    Farquad, M. A., et al.: Data mining using rules extracted from SVM: an application to churn prediction in bank credit cards. In: RSFDGrC ’09 Proceedings of the 12th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing. pp. 390–397 (2009)Google Scholar
  4. 4.
    Nie, G., et al.: Credit card churns forecasting by logistic regression and decision tree. Expert Syst. Appl. 38, 15273–15285 (2011)CrossRefGoogle Scholar
  5. 5.
    Wang, N., Niu, D. X.: Credit card customer churn prediction based on the RST and LS-SVM. IEEE conference on Service and systems and Service Management (2009)Google Scholar
  6. 6.
    Zhao, J., et al.: Bank customer churn prediction based on support vector machine: taking a commercial bank’s VIP customer churn. IEEE Conference on Wireless Communication, Networking and Mobile computing (2008)Google Scholar
  7. 7.
    Zhao, Y., et al.: Customer churn prediction using improved one-class support vector machine. DMA, LNAI 3584, pp. 300–306. Springer-Verlag Berlin Heidelberg, Berlin (2005)Google Scholar
  8. 8.
    Shao, H., et al.: Construction of Bayesian classifiers with GA for predicting customer retention. IEEE Conference on Natural Computation (2005)Google Scholar
  9. 9.
    Huang, Ying, Huang, Y., Kechadi, T.: An effective hybrid learning system for telecommunication churns Prediction. J. Expert Syst. Appl. 40, 5635–5647 (2013)CrossRefGoogle Scholar
  10. 10.
    Bose, I., Chen, X.: Hybrid models using unsupervised clustering for prediction of customer churn. J. Organ. Comput. and Electron. Commer. 19, 133–151 (2009)Google Scholar
  11. 11.
    Huang, B.Q., Kechadi, M.T., Buckley, B.: Customer churns prediction in telecommunications. J. Expert Syst. Appl. 39(1), 1414–1425 (2012)CrossRefGoogle Scholar
  12. 12.
    Khashei, M., Hamadani, A.Z., Bijari, M.: A novel hybrid classification model of artificial neural networks and multiple linear regression models. J. Expert Syst. Appl. 39(3), 2606–2620 (2012)CrossRefGoogle Scholar
  13. 13.
    Tsai, C.F., Lu, Y.H.: Customer churns prediction by hybrid neural networks. J. Expert Syst. Appl. 36(10), 12547–12553 (2009)CrossRefGoogle Scholar
  14. 14.
    Yeshwanth, V, Raj, V.V., Saravanan, M.: Evolutionary churn prediction in mobile networks using hybrid learning. In: Proceedings of the Twenty-Fourth International Florida Artificial Intelligence Research Society Conference, (FLAIRS), Palm Beach, Florida, USA, May 18–20. AAAI Press (2011)Google Scholar
  15. 15.
    Hung, S.Y., et al.: Applying data mining to telecom churn management. J. Expert Syst. Appl. 31(3), 515–524 (2006)CrossRefGoogle Scholar
  16. 16.
    Zhang, Y., et al .: A hybrid KNN-LR classifier and its application in customer churn prediction IEEE conference on system, man and cybernetics (2007)Google Scholar
  17. 17.
    Lee et al,.: Customer churn prediction by hybrid model. Advance data mining and application Springer (2006)Google Scholar
  18. 18.
    Kawale, J., Pal, A., Srivastava, J.: Churn prediction in MMORPGs: a social influence based approach. Proc. 2009 Int. Conf. Comput. Sci. Eng. 4, 423–428 (2009)CrossRefGoogle Scholar
  19. 19.
    Suznjevic, M., Stupar, I., Matijasevic, M.: MMORPG player behavior model based on player action categories. Proceedings of the 10th Annual Workshop on Network and Systems Support for Games. IEEE Press (2011)Google Scholar
  20. 20.
    Chen, K. T., Lei, C. L., Network game design: hints and implications of player interaction. In: Proceedings of 5th ACM SIGCOMM Workshop on Network and System Support for Games. pp. 1–9 ( 2006)Google Scholar
  21. 21.
    Soeini, R.A., Rodpysh, K.V.: Applying data mining to insurance customer churn management. Int. Proc. Comput. Sci. Inf. Technol. 30, 82–92 (2012)Google Scholar
  22. 22.
    Verbeke, W., Martens, D., Baesens, B.: Social network analysis for customer churn prediction. Appl. Soft Comput. 14, 431–446 (2014)CrossRefGoogle Scholar
  23. 23.
    Malik, Z.K., Hussain, A., Jonathan, W.: An online generalized eigenvalue version of Laplacian Eigenmaps for visual big data. Neurocomputing 173, 127–136 (2016)CrossRefGoogle Scholar
  24. 24.
    Anil Kumar, D., Ravi, V.: Predicting credit card customer churn in banks using data mining. Int. J. Data Anal. Tech. Strateg. 1(1), 4–28 (2008)CrossRefGoogle Scholar
  25. 25.
    Nie, G., et al.: Finding the hidden pattern of credit card holders churn: a case of China, ICCS 2009 part II. LNCS 5545(1), 561–569 (2009)Google Scholar
  26. 26.
    Wang, G., et al.: Predicting credit card holder churn in banks of China using data mining and MCDM. IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (2010)Google Scholar
  27. 27.
    Chiang, D.A., et al.: Goal-oriented sequential pattern for network banking churn analysis. Expert Syst. Appl. 25, 293–302 (2003)CrossRefGoogle Scholar
  28. 28.
    Jinbo, S., et al.: The application of AdaBoost in customer churn prediction. IEEE Conference on Service and Systems and Service Management (2007)Google Scholar
  29. 29.
    Mutanen, T., et al.: Customer churn prediction: a case study in banking. ECML/PKDD 2006 Workshop on Practical Data Mining: Applications, Experiences and Challenges (2010)Google Scholar
  30. 30.
    Olle, G., Hybrid, A.: Churn prediction model in mobile telecommunication industry. Int. J. e-Educ. e-Bus. e-Manag. Learn. 4(1), 55–62 (2014)Google Scholar
  31. 31.
    Peters, G.: Some refinements of rough k-means clustering. Pattern Recognit. 39, 1481–1491 (2006)CrossRefzbMATHGoogle Scholar
  32. 32.
    Kim, K., Jun, C.-H., Lee, J.: Improved churn prediction in telecommunication industry by analyzing a large network. Expert Syst. Appl. 41(15), 6575–6584 (2014)CrossRefGoogle Scholar
  33. 33.
  34. 34.
    Lingras, P., West, C.: Interval set clustering of web users with rough k-means. J. Intell. Inform. Syst. 23, 5–16 (2004)CrossRefzbMATHGoogle Scholar
  35. 35.
    Kumar, P.: Comparative study of K-means, pam and rough K-means algorithms using cancer datasets. 2009 International Symposium on Computing, Communication, and Control (ISCCC 2009)Google Scholar
  36. 36.
    Lingras, P., West, C.: Interval set clustering of web users with rough kmeans, Technical Report 2002–002. St. Marys University, Halifax, Canada, Department of Mathematics and Computer Science (2002)Google Scholar
  37. 37.
    Parmar, D., et al.: MMR: an algorithm for clustering categorical data using rough set theory. Data Knowl. Eng. 63(2007), 879–893 (2007)CrossRefGoogle Scholar
  38. 38.
    Amin, A., et al.: Churn prediction in telecommunication industry using rough set approach. New Trends in Computational Collective Intelligence, pp. 83–95. Springer International Publishing, Berlin (2015)Google Scholar
  39. 39.
    Amin, A., et al.: Customer churn prediction in the telecommunication sector using a rough set approach. Neurocomputing 237, 242–254 (2017)CrossRefGoogle Scholar
  40. 40.

Copyright information

© Springer Science+Business Media New York 2017

Authors and Affiliations

  1. 1.Department of Electronics and Communication EngineeringIndra Ganesan College of EngineeringTiruchirappalliIndia

Personalised recommendations