Improved Churn Prediction Based on Supervised and Unsupervised Hybrid Data Mining System

  • J. Vijaya
  • E. Sivasankar
Conference paper
Part of the Lecture Notes in Networks and Systems book series (LNNS, volume 9)


Retaining a customer plays a vital role in success of every sales firm. Not only it increases the profit margin of the firm but also it maintains the ranking of the firm in the competitive market. Every organization competes for the success of itself in the competitive market, so it aims to retain the existing customers through customer preservation techniques. Because preserving an existing customer is less costly than adding new customer. Hence, customer association management (CAM) and customer maintenance (CM) are two important parameters in determining the success of the organization. Also quality service is another parameter which reduces the customer churn to a greater extent. Hence, every organization conducts a customer churn forecast as a valuable step because it aims at customer maintenance and organization success. In this work instead of single classifier resulting in low efficiency, hybrid supervised and unsupervised techniques are deployed to achieve improved churn prediction. In stage one, data cleaning is carried out to eliminate deviations in data set. In stage two, clustering algorithms such as K-Means and K-Medoids are used to group customers with similar trends. In stage three, hold-out methods based training and testing data sets are obtained from the above clusters. In the next stage, the training and testing are carried out by algorithms such as decision tree (DT), k-nearest neighbor (KNN), support vector machine (SVM), linear discriminant analysis (LDA), and naive bayes (NB). In the final stage, sensitivity, specificity, and accuracy are measured to evaluate the efficiency of the hybrid system.


CAM Churn CM K-Means K-Medoids Decision tree Support vector machine Linear discriminant analysis Naive bayes K-nearest neighbor 


  1. 1.
  2. 2.
    Mahmoudi N, Duman E (2015) Detecting credit card fraud by modified Fisher discriminant analysis. J Exp Syst Appl 42:2510–2516Google Scholar
  3. 3.
    Burez J, Van den Poel D (2009) Handling class imbalance in customer churn prediction. J Exp Syst Appl 36:4626–4636Google Scholar
  4. 4.
    Huang BQ, Kechadi MT, Buckley B (2012) Customer churns prediction in telecommunications. J Exp Syst Appl 39(1):1414–1425CrossRefGoogle Scholar
  5. 5.
    Hwang H, Jung T, Suh E (2004) An LTV model and customer segmentation based on customer value: a case study on the wireless telecommunication industry. J Exp Syst Appl 26(2):181–188Google Scholar
  6. 6.
    Larivire B, Poel DVD (2014) Predicting customer retention and profitability by using random forests and regression forests techniques. J Exp Syst Appl 29:472–484Google Scholar
  7. 7.
    Neslin SA et al (2006) Defection detection: measuring and understanding the predictive accuracy of customer churn models. J Market ResGoogle Scholar
  8. 8.
    Xia GE, Jin WD (2008) Model of customer churn prediction on support vector machine. J Syst Eng Theory Pract 28(1):71–77Google Scholar
  9. 9.
    Anil Kumar D, Ravi V (2008) Predicting credit card customer churn in banks using data mining. Int J Data Anal Tech StrategGoogle Scholar
  10. 10.
    Iturriaga FJL, Sanz IP (2015) Bankruptcy visualization and prediction using neural networks: a study of U.S. commercial banks. J Exp Syst Appl 42:2857–2869Google Scholar
  11. 11.
    Tsai CF, Lu YH (2009) Customer churns prediction by hybrid neural networks. J Exp Syst Appl 36(10): 12547–12553Google Scholar
  12. 12.
    Huang Y, Kechadi T (2013) An effective hybrid learning system for telecommunication churns prediction. J Exp Syst Appl 40:5635–5647Google Scholar
  13. 13.
    Bose I, Chen X (2009) Hybrid models using unsupervised clustering for prediction of customer churn. J Organ Comput Electr Comm 19:133–151. Copyright: Taylor & Francis Group, LLCGoogle Scholar
  14. 14.
    Khashei M, Hamadani AZ, Bijari M (2012) A novel hybrid classification model of artificial neural networks and multiple linear regression models. J Exp Syst Appl 39(3):2606–2620CrossRefGoogle Scholar
  15. 15.
    Kim K, Jun C-H, Lee J (2014) Improved churn prediction in telecommunication industry by analyzing large network. J Exp Syst Appl 41(1):6575–6584Google Scholar
  16. 16.
    Gerpott TJ, Ahmadi N (2015) Regaining drifting mobile communication customers: predicting the odds of success of winback efforts with competing risks regression. J Exp Syst Appl 42:7917–7928Google Scholar
  17. 17.
    Ali OG, Arıturk U (2014) Dynamic churn prediction framework with more effective use of rare event data: the case of private banking. J Exp Syst Appl 41:7889–7903Google Scholar
  18. 18.
    Idris A, Rizwan M, Khan A (2012) Churn prediction in telecom using random forest and PSO based data balancing in combination with various feature selection strategies. J Exp Syst Appl 38:1808–1819Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2018

Authors and Affiliations

  1. 1.National Institute of TechnologyTiruchirapalliIndia

Personalised recommendations