Abstract
In data science, the application of most approaches requires the existence of big data from a real-world system. Due to access limitations, nonexistence of the system, or temporal as well as economic restrictions, such data might not be accessible or available. To overcome a lack of real-world data, this chapter introduces simulation-based data acquisition as method for the generation of artificial data that serves as a substitute when applying data science techniques. Instead of gathering data from the real-world system, computer simulation is used to model and execute artificial systems that can provide a more accessible, economic, and robust source of big data. To this end, it is outlined how data science can benefit from simulation and vice versa. Specific approaches are introduced for the design and execution of experiments, and a selection of simulation frameworks is presented that facilitates the conducting of simulation studies for novice and professional users.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Alshammari, N., Alshammari, T., Sedky, M., Champion, J., & Bauer, C. (2017). Openshs: Open smart home simulator. Sensors, 17(5):1003
Axelrod, R. (1997). Advancing the art of simulation in the social sciences. In Simulating social phenomena (pp. 21–40). Berlin: Springer.
Banks, J., & Gibson, R. (1997). Don’t simulate when…10 rules for determining when simulation is not appropriate. IIE Solutions, 29(9), 30–33.
Bonabeau, E. (2002). Agent-based modeling: Methods and techniques for simulating human systems. Proceedings of the National Academy of Sciences, 99(suppl 3), 7280–7287.
Carson II, J. S. (2005). Introduction to modeling and simulation. In Proceedings of the 37th Winter Simulation Conference (pp. 16–23). Winter Simulation Conference.
Dagkakis, G., Papagiannopoulos, I., & Heavey, C. (2016). Manpy: An open-source software tool for building discrete event simulation models of manufacturing systems. Software: Practice and Experience, 46(7), 955–981.
Davidsson, P. (2000). Multi agent based simulation: beyond social simulation. In International Workshop on Multi-agent Systems and Agent-Based Simulation (pp. 97–107). Springer.
Feldkamp, N., Bergmann, S., & Strassburger, S. (2015). Visual analytics of manufacturing simulation data. In Proceedings of the 2015 Winter Simulation Conference (pp. 779–790). IEEE Press.
Franceschini, R., Bisgambiglia, P.-A., Touraille, L., Bisgambiglia, P., & Hill, D. (2014). A survey of modelling and simulation software frameworks using discrete event system specification. In OASIcs-OpenAccess Series in Informatics (Vol. 43). Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik.
Hamill, L., & Gilbert, N. (2015). Agent-based modelling in economics. Chichester: John Wiley & Sons.
Hey, T., Tansley, S., Tolle, K. M., et al. (2009). The fourth paradigm: Data-intensive scientific discovery (Vol. 1). Redmond: Microsoft Research.
Hoad, K., Robinson, S., & Davies, R. (2010). Automated selection of the number of replications for a discrete-event simulation. Journal of the Operational Research Society, 61(11), 1632–1644.
Horne, G. E., & Meyer, T. E. (2004). Data farming: Discovering surprise. In Proceedings of the 36th Winter Simulation Conference (pp. 807–813). Winter Simulation Conference.
Janssen, M. A., Na’ia Alessa, L., Barton, M., Bergin, S., & Lee, A. (2008). Towards a community framework for agent-based modelling. Journal of Artificial Societies and Social Simulation, 11(2), 6.
Kleijnen, J. P. C. (2015). Design and analysis of simulation experiments. In International Workshop on Simulation (pp. 3–22). Springer.
Kleijnen, J. P. C., Sanchez, S. M., Lucas, T. W., & Cioppa, T. M. (2005). State-of-the-art review: a user’s guide to the brave new world of designing simulation experiments. INFORMS Journal on Computing, 17(3), 263–289.
Kravari, K., & Bassiliades, N. (2015). A survey of agent platforms. Journal of Artificial Societies and Social Simulation, 18(1), 11.
Law, A. M. (2013). Simulation modeling and analysis (McGraw-Hill series in industrial engineering and management science, 5th ed.). Dubuque: McGraw-Hill Education.
Lloyd, C. M., Lawson, J. R., Hunter, P. J., & Nielsen, P. F. (2008). The cellML model repository. Bioinformatics, 24(18), 2122–2123.
Lorig, F. (2019). Hypothesis-driven simulation studies – Assistance for the systematic design and conducting of computer simulation experiments. Wiesbaden: Springer.
Lorig, F., Lebherz, D. S., Berndt, J. O., & Timm, I. J. (2017). Hypothesis-driven experiment design in computer simulation studies. In Simulation Conference (WSC), 2017 Winter (pp. 1360–1371). IEEE.
Maria, A. (1997). Introduction to modeling and simulation. In Proceedings of the 29th Winter Simulation Conference (pp. 7–13). IEEE Computer Society.
Matloff, N. (2008). Introduction to discrete-event simulation and the simpy language. Dept of Computer Science, University of California at Davis, Davis. Retrieved on 2 Aug 2009.
Montgomery, D. C. (2017). Design and analysis of experiments. Hoboken: John Wiley & Sons.
North, M. J., Collier, N. T., & Vos, J. R. (2006). Experiences creating three implementations of the repast agent modeling toolkit. ACM Transactions on Modeling and Computer Simulation (TOMACS), 16(1), 1–25.
O’Neil, C., & Schutt, R. (2013). Doing data science: Straight talk from the frontline. Beijing: O’Reilly Media, Inc.
Ouyang, H., & Nelson, B. L. (2017). Simulation-based predictive analytics for dynamic queueing systems. In Simulation Conference (WSC), 2017 Winter (pp. 1716–1727). IEEE.
Ozik, J., Collier, N. T., Murphy, J. T., & North, M. J. (2013). The ReLogo agent-based modeling language. In Simulation Conference (WSC), 2013 Winter (pp. 1560–1568). IEEE.
Renoux, J., & Klügl, F. (2017). Simulating daily activities in a smart home for data generation. In Proceedings of the 2017 Winter Simulation Conference. IEEE.
Robinson, S. (2004). Simulation: The practice of model development and use. Chichester: Wiley.
Rodermund, S. C., Lorig, F., Berndt, J. O., & Timm, I. J. (2017). An agent architecture for simulating communication dynamics in social media. In J. O. Berndt, P. Petta, & R. Unland (Eds.), Multiagent system technologies (pp. 19–37). Cham: Springer International Publishing.
Sanchez, S. M. (2014). Simulation experiments: Better data, not just big data. In Proceedings of the 2014 Winter Simulation Conference (pp. 805–816). IEEE Press.
Sanchez, S. M., & Wan, H. (2012). Work smarter, not harder: A tutorial on designing and conducting simulation experiments. In Proceedings of the Winter Simulation Conference (p. 170). Proceedings of the 2012 Winter Simulation Conference.
Sanchez, S. M., Wan, H., & Lucas, T. W. (2009). Two-phase screening procedure for simulation experiments. ACM Transactions on Modeling and Computer Simulation (TOMACS), 19(2), 7.
Shao, G., Shin, S.-J., & Jain, S. (2014). Data analytics using simulation for smart manufacturing. In Proceedings of the 2014 Winter Simulation Conference (pp. 2192–2203). IEEE Press.
Sokolowski, J. A., & Banks, C. M. (2011). Principles of modeling and simulation: A multidisciplinary approach. New York: John Wiley & Sons.
Timm, I. J., & Lorig, F. (2015). A survey on methodological aspects of computer simulation as research technique. In Proceedings of the 2015 Winter Simulation Conference (pp. 2704–2715). IEEE Press.
Tisue, S., & Wilensky, U. (2004). Netlogo: A simple environment for modeling complexity. In International Conference on Complex Systems, Boston (Vol. 21, pp. 16–21).
Ulam, S. M. (1990). Analogies between analogies: The mathematical reports of SM Ulam and his Los Alamos collaborators (Vol. 10). Berkeley: University of California Press.
Wilensky, U., & Rand, W. (2015). An introduction to agent-based modeling: Modeling natural, social, and engineered complex systems with NetLogo. Cambridge, MA: MIT Press.
Zeigler, B. P., Kim, T. G., & Praehofer, H. (2000). Theory of modeling and simulation. Amsterdam: Academic Press.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Lorig, F., Timm, I.J. (2020). Simulation-Based Data Acquisition. In: Arabnia, H.R., Daimi, K., Stahlbock, R., Soviany, C., Heilig, L., Brüssau, K. (eds) Principles of Data Science. Transactions on Computational Science and Computational Intelligence. Springer, Cham. https://doi.org/10.1007/978-3-030-43981-1_1
Download citation
DOI: https://doi.org/10.1007/978-3-030-43981-1_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-43980-4
Online ISBN: 978-3-030-43981-1
eBook Packages: EngineeringEngineering (R0)