Abstract
This chapter provides an overview of the current development of big data in the computational social sciences and humanities. It is composed of two parts. In the first part, we review works incorporating the three most frequently seen types of big data, namely geographic data, text corpus data, and social media data, that are used to conduct research on the social sciences in a wide range of fields, including anthropology, economics, finance, geography, history, linguistics, political science, psychology, public health, and mass communications. The second part of the chapter provides a panoramic view of the development of big data in the computational social sciences and humanities, including recent trends and the evoked challenges. As for the former, we review four representative cases of its timely development. They are big data finance, big data in psychology, the spatial humanities, and cloud computing. As for the latter, we present an overview of four challenges associated with big data, namely the complexity of big data or the ontology and epistemology of big data, big data search, big data simulation, and big data risk.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Computational social sciences, as the title of this book series demonstrates, require little explanation. The term, computational humanities, however, is less popular. Gerhard Heyer distinguishes digital humanities from computational humanities as follows. The former is the creation, dissemination, and use of digital repositories, and the latter is the computer-based analysis of digital repositories using advanced computational and algorithmic methods (Biemann et al. 2014). Alternatively, “[c]omputational humanities is an emerging field that bridges the sciences and humanities with the goal of creating accurate computer simulations of historical, social, cultural, and religious events (Cruz-Neira 2003, p. 10).” See Gavin (2014) for a demonstration of the above two descriptions of computational humanities.
- 2.
- 3.
- 4.
Richard Thaler is the 2017 Nobel Laureate in Economics.
- 5.
There is a philosophical issue as to whether machines will evolve to have their own interpretations of the text and hence develop their own emotions which are different from those of general human beings under the governance of their own culture. More positively, would machines surpass humans by demonstrating the features of positive psychology, as advocated by Martin Seligman (Seligman 2004), more successfully than humans?
- 6.
- 7.
While there are only two chapters collected in this volume, the interested reader may find more useful references in Peterson (2016) and the excellent collections edited by Mitra and Xiang (2016). However, sentiment analysis may go further, beyond what the current literature delineates, and can be further incorporated into agent-based computational finance and give new impetus to behavioral finance (Chen and Venkatachalam 2017).
- 8.
- 9.
- 10.
The PTT Bulletin Board System is the largest terminal-based bulletin board system (BBS) based in Taiwan. For more information, see https://en.wikipedia.org/wiki/PTT_Bulletin_Board_System.
- 11.
The conundrum has been well illustrated by the so-called adaptive market hypothesis, which endowed the efficient markets hypothesis with a dynamic and evolutionary interpretation (Lo 2004). In the vein of the agent-based fashion, the adaptive market hypothesis has been further studied in the form of the market fraction hypothesis (Chen et al. 2010).
- 12.
This project is carried out within a collaboration between the Kavli Foundation, the Institute for the Interdisciplinary Study of Decision Making at New York University (NYU), and the NYU Center for Urban Science and Progress. For more details, the interested reader is referred to Azmak et al. (2015).
- 13.
The current use of big data in psychology is not just exhausted by the survey presented in this chapter. The journal Psychological Methods has published a special issue on this frontier (Harlow and Oswald 2016). For other developments, the interested reader is also referred to Cheung and Jak (2016) and Jones (2016).
- 14.
The representativeness heuristic is one of the heuristics that has been carefully studied by psychologists and behavioral economists, regarding how human decisions or judgments are made under uncertainty (Kahneman and Tversky 1972).
- 15.
The interested reader is welcome to visit its home page: http://apsti.nccu.edu.tw/.
- 16.
For a general background of this fast-growing field, the interested reader is referred to Bodenhamer et al. (2010).
- 17.
This feature can be coined as the big data paradox, namely too big to be “small.”
- 18.
In the development of the computational social sciences and humanities, the role of cyborgs is often ignored. For example, in social simulation or agent-based simulation, there is a clear distinction between human agents and software agents, but their possible hybridizations are left out. See Chen et al. (2018).
References
Azmak, O., Bayer, H., Caplin, A., Chun, M., Glimcher, P., Koonin, S., & Patrinos, A. (2015). Using Big data to understand the human condition: The Kavli HUMAN project. Big Data, 3(3), 173–188.
Bauerlein, M. (2008). The dumbest generation: How the digital age stupefies young Americans and jeopardizes our future (or, don’t trust anyone under 30). London: Penguin.
Biemann, C., Crane, G. R., Fellbaum, C. D., & Mehler, A. (2014). Computational humanities-bridging the gap between computer science and digital humanities (Dagstuhl Seminar 14301). In Dagstuhl reports (Vol. 4, No. 7). Dagstuhl: Schloss Dagstuhl-Leibniz-Zentrum für Informatik.
Bodenhamer, D. J., Corrigan, J., & Harris, T. M. (Eds.). (2010). The spatial humanities: GIS and the future of humanities scholarship. Bloomington: Indiana University Press.
Cambria, E., Das, D., Bandyopadhyay, S., & Feraco, A. (Eds.). (2017). A practical guide to sentiment analysis (Vol. 5). Heidelberg: Springer.
Ceron, A., Curini, L., & Iacus, S. M. (2016). Politics and Big data: Nowcasting and forecasting elections with social media. Didcot: Taylor & Francis.
Chen, S.-H. (2008). Financial applications: Stock markets. In B. Wang (Ed.), Wiley encyclopedia of computer science and engineering (pp. 1227–1244). Hoboken: Wiley.
Chen, S.-H. (2013). Reasoning-based artificial agents in agent-based computational economics. In K. Nakamatsu & L. Jain (Eds.), The handbook on reasoning-based intelligent systems (pp. 575–602). Singapore: World Scientific.
Chen, S.-H., & Venkatachalam, R. (2017). Agent-based modelling as a foundation for big data. Journal of Economic Methodology, 24(4), 362–383.
Chen, S. H., Kaboudan, M., & Du, Y. R. (2018). Computational economics in the era of natural computationalism. In S. H. Chen, M. Kaboudan, & Y. R. Du (Eds.), The Oxford handbook of computational economics and finance. New York: Oxford.
Chen, S.-H., Kampouridis, M., & Tsang, E. (2010). Microstructure dynamics and agent-based financial markets. In International workshop on multi-agent systems and agent-based simulation (pp. 121–135). Berlin: Springer.
Cheung, M. W. L., & Jak, S. (2016). Analyzing big data in psychology: A split/analyze/meta-analyze approach. Frontiers in Psychology, 7, 738 https://doi.org/10.3389/fpsyg.2016.00738.
Clark, A. E., Flèche, S., Layard, R., Powdthavee, N., & Ward, G. (2018). The origins of happiness: The science of Well-being over the life course. Princeton: Princeton University Press.
Conover, M. D., Ferrara, E., Menczer, F., & Flammini, A. (2013). The digital evolution of occupy Wall Street. PLoS One, 8(5), e64679.
Conroy, N. J., Rubin, V. L., & Chen, Y. (2015). Automatic deception detection: Methods for finding fake news. Proceedings of the Association for Information Science and Technology, 52(1), 1–4.
Cooper, C. (2017). Citizen science: How ordinary people are changing the face of discovery. London: Gerald Duckworth & Co.
Cruz-Neira, C. (2003). Computational humanities: The new challenge for VR. IEEE Computer Graphics and Applications, 23(3), 10–13.
Franzoni, C., & Sauermann, H. (2014). Crowd science: The organization of scientific research in open collaborative projects. Research Policy, 43(1), 1–20.
Gavin, M. (2014). Agent-based modeling and historical simulation. DHQ: Digital Humanities Quarterly, 8(4). Retrieved January 12, 2015, from http://www.digitalhumanities.org/dhq/vol/8/4/000195/000195.html
Hansen, S., McMahon, M., & Prat, A. (2018). Transparency and deliberation within the FOMC: A computational linguistics approach. The Quarterly Journal of Economics, 1, 70. https://doi.org/10.1093/qje/qjx045.
Harlow, L. L., & Oswald, F. L. (2016). Big data in psychology: Introduction to the special issue. Psychological Methods, 21(4), 447.
Helbing, D., Frey, B. S., Gigerenzer, G., Hafen, E., Hagner, M., Hofstetter, Y., et al. (2017). Will democracy survive big data and artificial intelligence? Scientific American, 25. Retrieved February 27, 2017, from https://www.scientificamerican.com/article/will-democracy-survive-big-data-and-artificial-intelligence/ (accessed 27 Feb, 2017)
Jones, M. N. (Ed.). (2016). Big data in cognitive science. Hove: Psychology Press.
Joshi, A., Mishra, A., Senthamilselvan, N., & Bhattacharyya, P. (2014). Measuring sentiment annotation complexity of text. In Proceedings of the 52nd annual meeting of the Association for Computational Linguistics (volume 2: Short papers) (Vol. 2, pp. 36–41).
Kahneman, D., & Tversky, A. (1972). Subjective probability: A judgment of representativeness. Cognitive Psychology, 3(3), 430–454.
Kleiner, B., Stam, A., & Pekari, A. (2015). Big data for the social sciences (FORS Working Papers, 2015-2).
Lane, J., Stodden, V., Bender, S., & Nissenbaum, H. (Eds.). (2014). Privacy, big data, and the public good: Frameworks for engagement. Cambridge: Cambridge University Press.
Liu, B. (2015). Sentiment analysis: Mining opinions, sentiments, and emotions. Cambridge: Cambridge University Press.
Loader, B. D., Vromen, A., Xenos, M. A., Steel, H., & Burgum, S. (2015). Campus politics, student societies and social media. The Sociological Review, 63(4), 820–839.
Lo, A. W. (2004). The adaptive markets hypothesis: Market efficiency from an evolutionary perspective. Journal of Portfolio Management, 30, 15–29.
McCloskey, D. N. (1983). The rhetoric of economics. Journal of Economic Literature, 21(2), 481–517.
McCloskey, D. N. (1998). The rhetoric of economics. Madison: University of Wisconsin Press.
Mitra, G., & Xiang, Y. (2016). Handbook of sentiment analysis in finance. New York: Albury Books.
Morson, G. S., & Schapiro, M. (2017). Cents and sensibility: What economics can learn from the humanities. Princeton: Princeton University Press.
Morzy, M., Kajdanowicz, T., & Kazienko, P. (2017). On measuring the complexity of networks: Kolmogorov complexity versus entropy. Complexity, 2017, 3250301.
O’Neil, C. (2017). Weapons of math destruction: How big data increases inequality and threatens democracy. New York: Broadway Books.
Peters, B. (2012). The big data gold rush. New York: Forbes Magazine.
Peterson, R. L. (2016). Trading on sentiment: The power of minds over markets. Hoboken: Wiley.
Pinheiro, F. L., Santos, M. D., Santos, F. C., & Pacheco, J. M. (2014). Origin of peer influence in social networks. Physical Review Letters, 112(9), 098702.
Pozzi, F. A., Fersini, E., Messina, E., & Liu, B. (2016). Sentiment analysis in social networks. Burlington: Morgan Kaufmann.
Rossbach, S. (1983). Feng Shui, the Chinese art of placement. New York: EP Dutton. Inc.
Roy, D., & Zeckhauser, R. (2016). Literary light on decision’s dark corner. In R. Frantz, S. H. Chen, K. Dopfer, F. Heukelom, & S. Mousavi (Eds.), Routledge handbook of behavioral economics (pp. 230–249). Abingdon: Routledge.
Savage, M., & Burrows, R. (2007). The coming crisis of empirical sociology. Sociology, 41(5), 885–899.
Seligman, M. E. (2004). Authentic happiness: Using the new positive psychology to realize your potential for lasting fulfillment. New York: Simon and Schuster.
Shiller, R. J. (2017). Narrative economics. American Economic Review, 107(4), 967–1004.
Soja, E. (2001). In different spaces: Interpreting the spatial organization of societies. In Proceedings, 3rd international space syntax symposium (p. 1-s1).
Soros, G. (2013). Fallibility, reflexivity, and the human uncertainty principle. Journal of Economic Methodology, 20(4), 309–329.
Stephens-Davidowitz, S., & Pabon, A. (2017). Everybody lies: Big data, new data, and what the internet can tell us about who we really are. New York: HarperLuxe.
Sunstein, C. R. (2008). Neither Hayek nor Habermas. Public Choice, 134(1–2), 87–95.
Thompson, A. (2016). Journalists and Trump voters live in separate online bubbles, MIT analysis shows. New York: Vice News.
Vaillant, G. E. (2008). Aging well: Surprising guideposts to a happier life from the landmark study of adult development. Boston: Little, Brown.
Webster, R. (2012). Feng Shui for beginners: Successful living by design. Woodbury: Llewellyn Worldwide.
WHO-CBD. (2015). Connecting global priorities: biodiversity and human health: a state of knowledge review, p. 344.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this chapter
Cite this chapter
Chen, SH., Yu, T. (2018). Big Data in Computational Social Sciences and Humanities: An Introduction. In: Chen, SH. (eds) Big Data in Computational Social Science and Humanities. Computational Social Sciences. Springer, Cham. https://doi.org/10.1007/978-3-319-95465-3_1
Download citation
DOI: https://doi.org/10.1007/978-3-319-95465-3_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-95464-6
Online ISBN: 978-3-319-95465-3
eBook Packages: Computer ScienceComputer Science (R0)