Journal of Computer Science and Technology

, Volume 30, Issue 6, pp 1201–1214 | Cite as

Enhancing Telco Service Quality with Big Data Enabled Churn Analysis: Infrastructure, Model, and Deployment

  • Hui Li
  • Di Wu
  • Gao-Xiang Li
  • Yi-Hao Ke
  • Wen-Jie Liu
  • Yuan-Huan Zheng
  • Xiao-La Lin
Regular Paper


The penetration of mobile phones is nearly saturated in both developing and developed regions. In such a circumstance, how to prevent subscriber churn has become an important issue for today’s telecom operators, as the cost to acquire a new subscriber is much higher than that to retain an existing subscriber. In this paper, we propose to leverage the power of big data to mitigate the problem of subscriber churn and enhance the service quality of telecom operators. As the information hub, telecom operators have accumulated a huge volume of valuable data on subscriber behaviors, service usage, and network operations. To enable efficient big data processing, we first build a dedicated distributed cloud infrastructure that integrates both online and offline processing capabilities. Second, we develop a complete churn analysis model based on deep data mining techniques, and utilize inter-subscriber influence to improve prediction accuracy. Finally, we use real datasets obtained from a large telecom operator in China to verify the accuracy of our churn analysis models. The dataset contains the information of over 3.5 million subscribers, which generate over 600 million call detail records (CDRs) per month. The empirical results demonstrate that our proposed method can achieve around 90% accuracy for T + 1 testing periods and identify subscribers with high negative influence successfully.


subscriber churn big data cloud infrastructure telco service quality 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. [1]
    Hu H, Wen Y G, Chua T S, Wang Z, Huang J, Zhu W W, Wu D. Community-based effective social video contents placement in cloud-centric CDN network. In Proc. the 2014 IEEE International Conference on Multimedia and Expo (ICME), July 2014.Google Scholar
  2. [2]
    Zhao Y, Li B, Li X, Liu W, Ren S. Customer churn prediction using improved one-class support vector machine. In Proc. the 1st Int. Conf. Advanced Data Mining and Applications, July 2005, pp.300–306.Google Scholar
  3. [3]
    Lee T, Chiu C, Chou Y, Lu C. Mining the customer credit using classification and regression tree and multivariate adaptive regression splines. Computational Statistics and Data Analysis, 2006, 50(4): 1113–1130.Google Scholar
  4. [4]
    Huang S, Yen D C, Wang H. Applying data mining to telecom churn management. Expert Systems with Applications, 2006, 31(3): 515–524.Google Scholar
  5. [5]
    Lemmens A, Croux C. Bagging and boosting classification trees to predict churn. Journal of Marketing Research, 2006, 43(2): 276–286.Google Scholar
  6. [6]
    Archaux C, Martin A, Khenchaf A. An SVM based churn detector in prepaid mobile telephony. In Proc. International Conference on Information and Communication Technologies: From Theory to Applications, April 2004, pp.459–460.Google Scholar
  7. [7]
    Au W, Chan K, Yao X. A novel evolutionary data mining algorithm with applications to churn prediction. IEEE Transactions on Evolutionary Computation, 2003, 7(6): 532–545.Google Scholar
  8. [8]
    Idris A, Khan A, Lee Y S. Intelligent churn prediction in telecom: Employing mRMR feature selection and rotboost based ensemble classification. Applied Intelligence, 2013, 39(3): 659–672.Google Scholar
  9. [9]
    Coussement K, van den Poel D. Churn prediction in subscription services: An application of support vector machines while comparing two parameter-selection techniques. Expert Systems with Applications, 2008, 34(1): 313–327.Google Scholar
  10. [10]
    Ngonmang B, Viennet E, Tchuente M. Churn prediction in a real online social network using local community analysis. In Proc. International Conference on Advances in Social Networks Analysis and Mining, Aug. 2012, pp.282–288.Google Scholar
  11. [11]
    Dasgupta K, Singh R, Viswanathan B, Chakraborty D, Mukherjea S, Nanavati A A, Joshi A. Social ties and their relevance to churn in mobile telecom networks. In Proc. the 11th International Conference on Extending Database Technology, March 2008, pp.668–677.Google Scholar
  12. [12]
    Phadke C, Uzunalioglu H, Mendiratta V B, Kushnir D, Doran D. Prediction of subscriber churn using social network analysis. Bell Labs Technical Journal, 2013, 17(4): 63–76.Google Scholar
  13. [13]
    Zhang X, Zhu J, Xu S, Wan Y. Predicting customer churn through interpersonal influence. Knowledge-Based Systems, 2012, 28: 97–104.Google Scholar
  14. [14]
    Kusuma P D, Radosavljevik D, Takes F W, van der Putten P. Combining customer attribute and social network mining for prepaid mobile churn prediction. In Proc. the 23rd Annual Belgian Dutch Conference on Machine Learning (BENELEARN), June 2013, pp.50–58.Google Scholar
  15. [15]
    Gomez-Rodriguez M, Leskovec J, Krause A. Inferring networks of diffusion and influence. ACM Transactions on Knowledge Discovery from Data (TKDD), 2012, 5(4): 21:1–21:37.Google Scholar
  16. [16]
    Myers S, Zhu C, Leskovec J. Information diffusion and external influence in networks. In Proc. the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 2012, pp.33–41.Google Scholar
  17. [17]
    Wen Y G, Zhu X Q, Rodrigues J J P C, Chen C W. Cloud mobile media: Reflections and outlook. IEEE Transactions on Multimedia (TMM), 2014, 16(4): 885–902.Google Scholar
  18. [18]
    Hu H, Wen Y G, Chua T S, Li X L. Towards scalable systems for big data analytics: A technology tutorial. IEEE Access Journal, 2014, 2: 652–687.Google Scholar
  19. [19]
    Xia W F, Wen Y G, Foh C H, Niyato D, Xie H Y. A survey on software-defined networking. IEEE Communication Surveys and Tutorials, 2015, 17(1): 27–51.Google Scholar
  20. [20]
    Magnusson J. Social network analysis utilizing big data technology [M.S. Thesis]. Uppsala University, 2012.Google Scholar
  21. [21]
    Euler T. Churn prediction in telecommunications using MiningMart. In Proc. the Workshop on Data Mining and Business, October 2005, pp.1–2.Google Scholar
  22. [22]
    Mozer M, Wolniewicz R, Grimes D B, Johnson E, Kaushanky H. Predicting subscriber dissatisfaction and improving retention in wireless telecommunications industry. IEEE Transactions on Neural Networks, 2000, 11(3): 690–696.Google Scholar
  23. [23]
    Nanavati A, Gurumurthy S, Das G, Chakraborty D, Dasgupta K, Mukherjea S, Joshi A. On the structural properties of massive telecom call graphs: Findings and implications. In Proc. ACM CIKM, November 2006, pp.435–444.Google Scholar
  24. [24]
    Motahari S, Jung T, Zang H, Janakiraman K, Li X Y, Hoo K S. Predicting the influencers on wireless subscriber churn. In Proc. IEEE Wireless Communications and Networking Conference (WCNC), April 2014, pp.3402–3407.Google Scholar

Copyright information

© Springer Science+Business Media New York 2015

Authors and Affiliations

  • Hui Li
    • 1
  • Di Wu
    • 1
  • Gao-Xiang Li
    • 1
  • Yi-Hao Ke
    • 1
  • Wen-Jie Liu
    • 1
  • Yuan-Huan Zheng
    • 1
  • Xiao-La Lin
    • 1
  1. 1.Department of Computer ScienceSun Yat-sen UniversityGuangzhouChina

Personalised recommendations