Abstract
This chapter is solely dedicated to the first half of the Assemble phase: Data Wrangling. Data wrangling is the overwhelmingly dominant part of a data scientists’ job, and this chapter (1) introduces wrangling to those unfamiliar with its place in the data lifecycle, and (2) puts an HR lens on data wrangling in terms of the employee lifecycle and HR data systems. This chapter concludes by introducing some common data formats, wrangling tools, as well as a few moderately advanced data transformation techniques, like binning, lagging, and z-scores.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Note: Creating an index is a complex process which requires analysis of collinearity, latent variables, and other considerations. If you want to create an index, consult with your analytics team.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Rosett, C.M., Hagerty, A. (2021). Data Wrangling. In: Introducing HR Analytics with Machine Learning. Springer, Cham. https://doi.org/10.1007/978-3-030-67626-1_13
Download citation
DOI: https://doi.org/10.1007/978-3-030-67626-1_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-67625-4
Online ISBN: 978-3-030-67626-1
eBook Packages: Behavioral Science and PsychologyBehavioral Science and Psychology (R0)