Skip to main content

Measuring the Importance of Churn Predictors in Romanian Telecommunication Industry

  • Conference paper
  • First Online:
Eurasian Economic Perspectives

Part of the book series: Eurasian Studies in Business and Economics ((EBES,volume 16/1))

  • 473 Accesses

Abstract

Telecommunication sector is a saturated market, and each client’s action affects the company profit. The most valuable asset of the telecommunication company represents its clients’ database. The Telecom industry pays special attention to the migrant clients because from an economic perspective, the cost invested by the company in the acquisition of a new customer is higher than the cost of keeping an existing client. This paper aims to determine the most important factors that influence the decision of a client to migrate from a telecom provider to another through a graphical method. We apply the churn prediction model on a dataset from Romania that has not yet been studied before. We choose to use the Balanced Random Forest technique to build the churn prediction model and the AUC coefficient to evaluate it. Permutation importance makes a classification of the most important features in the model and measures their impact through a metric called the importance score. The result proves that the most significant three factors in the churn phenomenon (client migration) are the number of months since the last change in the account, the number of minutes off the network and the invoice cost, a significant difference in score, the first the indicator being ten times more important than the next one. Therefore, we can state that we can resolve the main action by resolving the churn problem on the current dataset solved by monitoring and evaluating these variables.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Achawanantakun, R., Chen, J., Sun, Y., & Zhang, Y. (2015). LncRNA-ID: Long non-coding RNA IDentification using balanced random forests. Bioinformatics, btv480.

    Google Scholar 

  • Archer, K., & Kimes, R. (2008). Empirical characterization of random forest variable importance measures. Computational Statistics & Data Analysis, 52(4), 2249–2260.

    Article  Google Scholar 

  • Auret, L., & Aldrich, C. (2011). Empirical comparison of tree ensemble variable importance measures. Chemometrics and Intelligent Laboratory Systems, 105(2), 157–170.

    Article  Google Scholar 

  • Blattberg, R., Kim, B. and Neslin, S., 2010. Database Marketing.

    Google Scholar 

  • Genuer, R., Poggi, J., & Tuleau-Malot, C. (2010). Variable selection using random forests. Pattern Recognition Letters, 31(14), 2225–2236.

    Article  Google Scholar 

  • Gregorutti, B., Michel, B., & Saint-Pierre, P. (2015). Grouped variable importance with random forests and application to multiple functional data analysis. Computational Statistics & Data Analysis, 90, 15–35.

    Article  Google Scholar 

  • Gregorutti, B., Michel, B., Saint-Pierre, P. (2017), ‘Correlation and variable importance in random forests’. Stat. Comput. 27(3), 659–678.

    Google Scholar 

  • Guyon, I., Weston, J., Barnhill, S., & Vapnik, V. (2002). Machine Learning, 46(1/3), 389–422.

    Article  Google Scholar 

  • Nason, M., Emerson, S., & LeBlanc, M. (2004). CARTscans: A tool for visualizing complex models. Journal of Computational and Graphical Statistics, 13(4), 807–825.

    Article  Google Scholar 

  • Nicodemus, K., & Shugart, Y. (2007). Impact of linkage disequilibrium and effect size on the ability of machine learning methods to detect epistasis in case-control studies. Proceedings of the Sixteenth Annual Meeting of the International Genetic Epidemiology Society, 31, 611.

    Google Scholar 

  • Parr, T., Turgutlu, K., Csiszar, C. & Howard, J., (2018). Beware Default Random Forest Importances. Explained.ai. Accessed Mar 13, 2019, from https://explained.ai/rf-importance/

  • Strobl, C., Boulesteix, A., Kneib, T., Augustin, T., & Zeileis, A. (2008). Conditional variable importance for random forests. BMC Bioinformatics, 9(1).

    Google Scholar 

  • Toloşi, L., & Lengauer, T. (2011). Classification with correlated features: Unreliability of feature ranking and solutions. Bioinformatics, 27(14), 1986–1994.

    Article  Google Scholar 

  • X. Yaya, L. Xiu, E. W. T. Ngai, & W. Ying. (2009). Customer churn prediction using improved balanced random forests. Expert Systems with Applications, 36(3), 5445–5449.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Dumitrache, A., Stancu, S., Stefanet, M. (2021). Measuring the Importance of Churn Predictors in Romanian Telecommunication Industry. In: Bilgin, M.H., Danis, H., Demir, E., Vale, S. (eds) Eurasian Economic Perspectives. Eurasian Studies in Business and Economics, vol 16/1. Springer, Cham. https://doi.org/10.1007/978-3-030-63149-9_8

Download citation

Publish with us

Policies and ethics