Abstract
High employee turnover is a common problem that can affect organizational performance and growth. The ability to predict employee turnover would be an invaluable tool for any organization seeking to retain employees and predict their future behavior. This study employed machine learning (ML) algorithms to predict whether employees would leave a company. It presented a comparative performance combination of five ML algorithms and three Feature Selection techniques. In this experiment, the best predictors were identified using the SelectKBest, Recursive Feature Elimination (RFE) and Random Forest (RF) model. Different ML algorithms were trained, which included logistic regression, decision tree (DT), naïve Bayes, support vector machine (SVM) and AdaBoost with optimal hyperparameters. In the last phase of the experiment, the predictive models’ performance was evaluated using several critical metrics. The empirical results have demonstrated that two predictive models performed better: DT with SelectKBest and the SVM-polynomial kernel using RF.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Sikaroudi, E., et al.: A data mining approach to employee turnover prediction (case study: Arak automotive parts manufacturing). J. Ind. Syst. Eng. 8(4), 106–121 (2015)
Keramati, A., et al.: Improved churn prediction in telecommunication industry using data mining techniques. Appl. Soft Comput. 24, 994–1012 (2014)
Fan, C.-Y., et al.: Using hybrid data mining and machine learning clustering analysis to predict the turnover rate for technology professionals. Expert Syst. Appl. 39(10), 8844–8851 (2012)
Chien, C.-F., Chen, L.-F.: Data mining to improve personnel selection and enhance human capital: a case study in high-technology industry. Expert Syst. Appl. 34(1), 280–290 (2008)
Saradhi, V.V., Palshikar, G.K.: Employee churn prediction. Expert Syst. Appl. 38(3), 1999–2006 (2011)
Hung, S.-Y., Yen, D.C., Wang, H.-Y.: Applying data mining to telecom churn management. Expert Syst. Appl. 31(3), 515–524 (2006)
Valle, M.A., Ruz, G.A.: Turnover prediction in a call center: behavioral evidence of loss aversion using random forest and Naïve Bayes algorithms. Appl. Artif. Intell. 29(9), 923–942 (2015)
García, D.L., Nebot, À., Vellido, A.: Intelligent data analysis approaches to churn as a business problem: a survey. Knowl. Inf. Syst. 51(3), 719–774 (2017)
Rombaut, E., Guerry, M.-A.: Predicting voluntary turnover through human resources database analysis. Manag. Res. Rev. 41(1), 96–112 (2018)
Lima, E., Mues, C., Baesens, B.: Domain knowledge integration in data mining using decision tables: case studies in churn prediction. J. Oper. Res. Soc. 60(8), 1096–1106 (2017)
De Caigny, A., Coussement, K., De Bock, K.W.: A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees. Eur. J. Oper. Res. 269(2), 760–772 (2018)
Vafeiadis, T., et al.: A comparison of machine learning techniques for customer churn prediction. Simul. Model. Pract. Theory 55, 1–9 (2015)
Valle, M.A., Varas, S., Ruz, G.A.: Job performance prediction in a call center using a Naive Bayes classifier. Expert Syst. Appl. 39(11), 9939–9945 (2012)
Shearer, C.: The CRISP-DM model: the new blueprint for data mining. J. Data Warehous. 5(4), 13–22 (2000)
Kaggle. HR Analytics (2017). https://www.kaggle.com/colara/hr-analytics
Sainani, K.L.: Introduction to Survival Analysis. PM R 8(6), 580–585 (2016)
Kartsonaki, C.: Survival analysis. Diagn. Histopathol. 22(7), 263–270 (2016)
Amin, A., et al.: Comparing oversampling techniques to handle the class imbalance problem: a customer churn prediction case study. IEEE Access 4, 7940–7957 (2016)
Jain, D., Singh, V.: Feature selection and classification systems for chronic disease prediction: a review. Egypt. Inform. J. 19(3), 179–189 (2018)
Gao, X., Hou, J.: An improved SVM integrated GS-PCA fault diagnosis approach of Tennessee Eastman process. Neurocomputing 174, 906–911 (2016)
Moosavi, M., Soltani, N.: Prediction of the specific volume of polymeric systems using the artificial neural network-group contribution method. Fluid Phase Equilib. 356, 176–184 (2013)
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Alaskar, L., Crane, M., Alduailij, M. (2019). Employee Turnover Prediction Using Machine Learning. In: Alfaries, A., Mengash, H., Yasar, A., Shakshuki, E. (eds) Advances in Data Science, Cyber Security and IT Applications. ICC 2019. Communications in Computer and Information Science, vol 1097. Springer, Cham. https://doi.org/10.1007/978-3-030-36365-9_25
Download citation
DOI: https://doi.org/10.1007/978-3-030-36365-9_25
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-36364-2
Online ISBN: 978-3-030-36365-9
eBook Packages: Computer ScienceComputer Science (R0)