Abstract
Data science is as much about manipulating data as it is about fitting models to data. Data rarely arrives in a form that we can directly feed into the statistical models or machine learning algorithms we want to analyze them with. The first stages of data analysis are almost always figuring out how to load the data into R and then figuring out how to transform it into a shape you can readily analyze. The code in this chapter, and all the following, assumes that the packages magrittr and ggplot2 have been loaded (just to avoid explicitly doing so in each example).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2017 Thomas Mailund
About this chapter
Cite this chapter
Mailund, T. (2017). Data Manipulation. In: Beginning Data Science in R. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-2671-1_3
Download citation
DOI: https://doi.org/10.1007/978-1-4842-2671-1_3
Published:
Publisher Name: Apress, Berkeley, CA
Print ISBN: 978-1-4842-2670-4
Online ISBN: 978-1-4842-2671-1
eBook Packages: Professional and Applied ComputingApress Access BooksProfessional and Applied Computing (R0)