Abstract
This paper is based on the theme of employee attrition where the reasoning behind employee turnover has predicted with the help of machine learning approach. As employee turnover has become a vital issue these days due to heavy work pressure, less salary, less work satisfaction, poor working environment; it’s high time to uphold a better solution on this term. Therefore, we have come up with a prediction model based on machine learning approach where we have used each feature’s respective Random Forest importance weights while threshold based correlated feature merging into each of the single combined variable. Again, we scale specific features to get the correlated matrix of features matrix by defining threshold. Certainly, this newly developed technique has achieved good result for some algorithms compared to Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) for the same dataset.
Keywords
- Random forest
- PCA
- LDA
- Dimensionality reduction
- Classifier
This is a preview of subscription content, access via your institution.
Buying options






References
Sikaroudi, E., Mohammad, A., Ghousi, R., Sikaroudi, A.: A data mining approach to employee turnover prediction (case study: Arak automotive parts manufacturing). J. Ind. Syst. Eng. 8(4), 106–121 (2015)
Pedregosa, F., et al.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12(Oct), 2825–2830 (2011)
Gao, Y.: Using decision tree to analyze the turnover of employees (2017)
Ajit, P.: Prediction of employee turnover in organizations using machine learning algorithms. Algorithms 4(5), C5 (2016)
Howley, T., Madden, M.G., O’Connell, M.L., Ryder, A.G.: The effect of principal component analysis on machine learning accuracy with high-dimensional spectral data. Knowl. Based Syst. 19(5), 363–370 (2006)
Maisuradze, M.: Predictive analysis on the example of employee turnover
Alam, M., Mohiuddin, K., Hassan, M.M., Islam, M., Allayear, S.: A machine learning approach to analyze and reduce features to a significant number for employee’s turn over prediction model. In: IEEE Computing Conference 2018, London (2018)
Fan, C.Y., Fan, P.S., Chan, T.Y., Chang, S.H.: Using hybrid data mining and machine learning clustering analysis to predict the turnover rate for technology professionals. Expert Syst. Appl. 39(10), 8844–8851 (2012)
L. (n.d.). HR Analytics. https://www.kaggle.com/ludobenistant/hr-analytics-1/notebook. Accessed 09 Dec 2017
Sklearn.preprocessing.StandardScaler (n.d.). http://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.StandardScaler.html. Accessed 01 Oct 2017
Sklearn.preprocessing.RobustScaler (n.d.). http://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.RobustScaler.html. Accessed 01 Oct 2017
Sklearn.preprocessing.MinMaxScaler (n.d.). http://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.MinMaxScaler.html#sklearn.preprocessing.MinMaxScaler. Accessed 01 Oct 2017
Pandas.DataFrame.corr (n.d.). https://pandas.pydata.org/pandas-docs/stable/generated/pandas.DataFrame.corr.html. Accessed 09 Dec 2017
Pandas.get_dummies (n.d.). https://pandas.pydata.org/pandas-docs/stable/generated/pandas.get_dummies.html. Accessed
Raschka, S.: Python machine learning. Packt Publishing Ltd., Birmingham (2015)
Liaw, A., Wiener, M.: Classification and regression by randomForest. R News 2(3), 18–22 (2002)
Sklearn.model_selection.train_test_split (n.d.). http://scikit-learn.org/stable/modules/generated/sklearn.model_selection.train_test_split.html. Accessed 10 Dec 2017
Wold, S., Esbensen, K., Geladi, P.: Principal component analysis. Chemometr. Intell. Lab. Syst. 2(1–3), 37–52 (1987)
Izenman, A.J.: Linear discriminant analysis. In: Izenman, A.J. (ed.) Modern Multivariate Statistical Techniques. STS, pp. 237–280. Springer, New York (2013). https://doi.org/10.1007/978-0-387-78189-1_8
Hanley, J.A., McNeil, B.J.: The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143(1), 29–36 (1982)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Islam, M.K., Alam, M.M., Islam, M.B., Mohiuddin, K., Das, A.K., Kaonain, M.S. (2018). An Adaptive Feature Dimensionality Reduction Technique Based on Random Forest on Employee Turnover Prediction Model. In: Singh, M., Gupta, P., Tyagi, V., Flusser, J., Ören, T. (eds) Advances in Computing and Data Sciences. ICACDS 2018. Communications in Computer and Information Science, vol 906. Springer, Singapore. https://doi.org/10.1007/978-981-13-1813-9_27
Download citation
DOI: https://doi.org/10.1007/978-981-13-1813-9_27
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1812-2
Online ISBN: 978-981-13-1813-9
eBook Packages: Computer ScienceComputer Science (R0)