Abstract
R has different types of data storage such as lists, arrays, and data frames. This can be confusing for some analysts with a pure background in handling rectangular datasets like data (with rows for records and variables for columns). The first and often the toughest or most time-consuming task in an analytical environment for a new project is getting the data loaded into the analytical software. This chapter discusses the techniques for reading in data from various formats. The two main methods of inputting data are through the command line and a GUI, and different packages for bigger datasets (¿1 GB) are discussed. In addition, obtaining data from various types of databases is specifically mentioned. Analyzing data can have many challenges associated with it. In the case of business analytics data, these challenges or constraints can have a marked effect on the quality and timeliness of the analysis as well as the expected versus actual payoff from the analytical results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer Science+Business Media New York
About this chapter
Cite this chapter
Ohri, A. (2012). Manipulating Data. In: R for Business Analytics. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-4343-8_4
Download citation
DOI: https://doi.org/10.1007/978-1-4614-4343-8_4
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-4342-1
Online ISBN: 978-1-4614-4343-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)