Abstract
Internet research, and especially social media research, has benefited from concurrent factors, technological and analytical, that have enabled access to vast amounts of user data and content online. These trends have accompanied a prevalence of Big Data studies of online activity, as researchers gather datasets featuring millions of tweets, for instance – here, Big Data is a reference not solely to the size of datasets but to the wider practices and research cultures around large-scale and exhaustive (and often ongoing) capture of data from large groups, often (but not always) studied quantitatively (see Kitchin and Lauriaut 2014a; Crawford et al. 2014). However, the accessibility of “big social data” (Manovich 2012) for Internet studies research is not without its limitations and challenges, and while extensive datasets enable valuable research, combining them with small data can provide more rounded perspectives and encourage us to think more about what we are studying. Similarly, privileging the online-only or the quantitative analysis of social media activity may overlook or mask key practices and relevant participants not present within the datasets. We argue for a blended data model as a critique and complement for different social media datasets, drawing in part on our research into social movements and activists’ use (and non-use) of online technologies. Together, these approaches may overcome and negotiate the respective limits and challenges of social media data, both big and small.
Similar content being viewed by others
References
Ananny M, Crawford K (2016) Seeing without knowing: limitations of the transparency ideal and its application to algorithmic accountability. New Media Soc. https://doi.org/10.1177/1461444816676645
Baym NK (2013) Data not seen: the uses and shortcomings of social media metrics. First Monday 18(10). Retrieved from http://firstmonday.org/ojs/index.php/fm/article/view/4873/3752
boyd d, Crawford K (2012) Critical questions for Big Data. Info Comm Soc 15(5):662–679. https://doi.org/10.1080/1369118X.2012.678878
Brock A (2012) From the blackhand side: Twitter as a cultural conversation. J Broadcast Electron Media 56(4):529–549. https://doi.org/10.1080/08838151.2012.732147
Brock A (2015) Deeper data: a response to Boyd and Crawford. Media. Culture Society 37(7):1084–1088
Brown LA, Strega S (2005) Research as resistance critical, indigenous and anti-oppressive approaches. Canadian Scholars’ Press, Toronto. Retrieved from http://site.ebrary.com/id/10191692
Bucher T (2012) Want to be on the top? Algorithmic power and the threat of invisibility on Facebook. New Media Soc 14(7):1164–1180
Burgess J, Galloway A, Sauter T (2015) Hashtag as hybrid forum: the case of #agchatoz. In: Rambukkana N (ed) Hashtag publics. Peter Lang, New York, NY, pp 61–76
Burgess J, Matamoros Fernández A (2016) Mapping sociocultural controversies across digital media platforms: one week of #gamergate on Twitter, YouTube, and Tumblr. Comm Res Pract 2(1):79–96
Chan A (2015) Big data interfaces and the problem of inclusion. Media Cult Soc 0163443715594106. https://doi.org/10.1177/0163443715594106
Chesters G (2012) Social movements and the ethics of knowledge production. Soc Mov Stud 11(2):145–160. https://doi.org/10.1080/14742837.2012.664894
Chief Elk L (2014) Teach-In Summer Fundraiser. YouCaring. Retrieved 3 June 2015, from http://www.youcaring.com/nonprofits/teach-in-summer-fundraiser/212206
Cho S, Crenshaw KW, McCall L (2013) Toward a field of intersectionality studies: theory, applications, and praxis. Signs 38(4):785–810. https://doi.org/10.1086/669608
Collins PH (1989) The social construction of black feminist thought. Signs 14(4):745–773
Consalvo M (2012) Confronting toxic gamer culture: a challenge for feminist game studies scholars. J Gender New Media Technol 1. Retrieved from http://adanewmedia.org/2012/11/issue1-consalvo/
Crawford K, Miltner K, Gray ML (2014) Critiquing Big Data: politics, ethics, epistemology. Int J Comm 8:1663–1672
Croeser S (2015) Global justice and the politics of information: the struggle over knowledge. Routledge, Hoboken, NJ
Croeser S, Highfield T (2014) Occupy Oakland and #oo: uses of Twitter within the occupy movement. First Monday 19(3). https://doi.org/10.5210/fm.v19i3.4827
Croeser S, Highfield T (2015a) Harbouring dissent: Greek independent and social media and the antifascist movement. Fibreculture 26:136–157
Croeser S, Highfield T (2015b) Mapping movements - social movement research and Big Data: critiques and alternatives. In: Langlois G, Redden J, Elmer G (eds) Compromised data: from social media to Big Data. Bloomsbury, pp 173–201
Driscoll K, Thorson K (2015) Searching and clustering methodologies: connecting political communication content across platforms. Ann Am Acad Pol Soc Sci 659(1):134–148. https://doi.org/10.1177/0002716215570570
Duguay S (forthcoming) Social media’s breaking news: the logic of automation in Facebook trending topics and twitter moments
Freelon D, McIlwain CD, Clark MD (2016) Beyond the hashtags: #Ferguson, #Blacklivesmatter, and the online struggle for offline justice. Centre for Social Media & Social Impact, Washington, DC. http://cmsimpact.org/wp-content/uploads/2016/03/beyond_the_hashtags_2016.pdf
Gillespie T (2010) The politics of “platforms”. New Media Soc 12(3):347–364
Gillespie T (2014) The relevance of algorithms. In: Gillespie T, Boczkowski PJ, Foot KA (eds) Media technologies: essays on communication, materiality, and society. The MIT Press, Cambridge, MA, pp 167–194
González-Bailón S, Wang N, Rivero A, Borge-Holthoefer J, Moreno Y (2014) Assessing the bias in samples of large online networks. Soc Networks 38:16–27. https://doi.org/10.1016/j.socnet.2014.01.004
Harry S (2014, October 6) Everyone Watches, Nobody Sees: How Black Women Disrupt Surveillance Theory. Model View Culture. Retrieved from https://modelviewculture.com/pieces/everyone-watches-nobody-sees-how-black-women-disrupt-surveillance-theory
Hesse-Biber S, Johnson RB (2013) Coming at things differently: future directions of possible engagement with mixed methods research. J Mixed Methods Res 7(2):103–109. https://doi.org/10.1177/1558689813483987
Highfield T, Leaver T (2015) A methodology for mapping Instagram hashtags. First Monday (1):20
Highfield T, Leaver T (2016) Instagrammatics and digital methods: studying visual social media, from selfies and GIFs to memes and emoji. Comm Res Pract 2(1):47–62
Hoffman AL (2014, June 30) Reckoning with a decade of breaking things. Model View Culture. Retrieved from https://modelviewculture.com/pieces/reckoning-with-a-decade-of-breaking-things
hooks b (2000) Feminist theory: from margin to Center. Pluto Press, London
Humphreys L, Gill P, Krishnamurthy B, Newbury E (2013) Historicizing new media: a content analysis of twitter. J Commun 63(3):413–431
Johnson RB, Onwuegbuzie AJ, Turner LA (2007) Toward a definition of mixed methods research. Journal of Mixed Methods Research 1(2):112–133. https://doi.org/10.1177/1558689806298224
Kim D (2014, October 7) Social media and academic surveillance: the ethics of digital bodies. Model View Culture. Retrieved from http://modelviewculture.com/pieces/social-media-and-academic-surveillance-the-ethics-of-digital-bodies
Kitchin R (2014) Big Data, new epistemologies and paradigm shifts. Big Data Soc 1(1):2053951714528481. https://doi.org/10.1177/2053951714528481
Kitchin R, Lauriault TP (2014a) Small Data, Data Infrastructures and Big Data (SSRN Scholarly Paper No. ID 2376148). Rochester, NY: Social Science Research Network. Retrieved from http://papers.ssrn.com/abstract=2376148
Kitchin R, Lauriault TP (2014b) Towards critical data studies: charting and unpacking data assemblages and their work (SSRN Scholarly Paper No. ID 2474112). Rochester, NY: Social Science Research Network. Retrieved from http://papers.ssrn.com/abstract=2474112
Kim D, Kim E (2014, April 7) The #TwitterEthics manifesto. Model View Culture. Retrieved from https://modelviewculture.com/pieces/the-twitterethics-manifesto
Krikorian R (2014) Introducing Twitter Data Grants. Retrieved 4 Aug 2015, from https://blog.twitter.com/2014/introducing-twitter-data-grants
Langlois G, Elmer G (2013) The research politics of social media platforms. Culture machine (14)
Mahrt M, Scharkow M (2013) The value of Big Data in digital media research. J Broadcast Electron Media 57(1):20–33. https://doi.org/10.1080/08838151.2012.761700
Manovich L (2012) Trending: the promises and the challenges of Big Social Data. In: Gold MK (ed) Debates in the digital humanities. University of Minnesota Press, Minneapolis, pp 460–475
Marwick A, Caplan R (2017) Media manipulation and disinformation online. Data Soc, New York City. https://datasociety.net/pubs/oh/DataAndSociety_MediaManipulationAndDisinformationOnline.pdf
Massanari A (2017) #Gamergate and the Fappening: how Reddit’s algorithm, governance, and culture support toxic technocultures. New Media Soc 19(3):329–346
Matamoros Fernández A (2017) Platformed racism: the mediation and circulation of an Australian race-based controversy on Twitter, Facebook and YouTube. Info Comm Soc 20(6):930–946
McElroy K (2015) Gold medals, black twitter, and not-so-good hair: framing the gabby Douglas controversy. ISOJ 1(1). Retrieved from https://isojjournal.wordpress.com/2015/04/15/gold-medals-black-twitter-and-not-so-good-hair-framing-the-gabby-douglas-controversy/
Moe H (2010) Everyone a pamphleteer? Reconsidering comparisons of mediated public participation in the print age and the digital era. Media Cult Soc 32(4):691–700
Morstatter F, Pfeffer J, Liu H, Carley KM (2013) Is the sample good enough? Comparing data from Twitter’s Streaming API with Twitter’s Firehose. In: Proceedings of the 7th International AAAI Conference on Weblogs and Social Media, pp 400–408
Noble SU (forthcoming) Algorithms of oppression: how search engines reinforce racism. NYU Press, New York City
Papacharissi Z (2015) The unbearable lightness of information and the impossible gravitas of knowledge: Big Data and the makings of a digital orality. Media Cult Soc 0163443715594103. https://doi.org/10.1177/0163443715594103
Puschmann C, Burgess J (2014) Metaphors of Big Data. Int J Comm 8:1690–1709
Qiu JL (2015) Reflections on Big Data: “just because it is accessible does not make it ethical”. Media Cult Soc 0163443715594104. https://doi.org/10.1177/0163443715594104
Ramsey DX (2015, April 10) The truth about Black Twitter. The Atlantic. http://www.theatlantic.com/technology/archive/2015/04/the-truth-about-black-twitter/390120/
Raynes-Goldie K (2012) Privacy in the age of facebook : discourse, architecture, consequences. Curtin Universitye. Retrieved from http://espace.library.curtin.edu.au/R?func=dbin-jump-full&local_base=gen01-era02&object_id=187731
Reger J (2001) Emotions, objectivity and voice: an analysis of a “failed” participant observation. Women’s Stud Int Forum 24(5):605–616. https://doi.org/10.1016/S0277-5395(01)00190-X
Rentschler CA (2014) Rape culture and the feminist politics of social media. Girlhood Studies 7(1):65–82. https://doi.org/10.3167/ghs.2014.070106
Sawyer S (2008) Data wealth, data poverty, science and cyberinfrastructure. Prometheus 26(4):355–371. https://doi.org/10.1080/08109020802459348
Steinhauer J (2014, July 28) Native activist charges art students with plagiarism. Hyperallergic. Retrieved 3 June 2015, from http://hyperallergic.com/139769/native-activist-charges-art-students-with-plagiarism/
Tufekci Z (2014) Big Questions for social media Big Data: Representativeness, validity and other methodological pitfalls. In: ICWSM ‘14: Proceedings of the 8th International AAAI Conference on Weblogs and Social Media. Ann Arbor
Tufekci Z (2017) Twitter and tear gas: the power and fragility of networked protest. Yale University Press, New Haven, US
van Dijck J, Poell T (2013) Understanding social media logic. Media Comm 1(1):2–14
Vis F (2013) A critical reflection on Big Data: considering APIs, researchers and tools as data makers. First Monday 18(10). Retrieved from http://firstmonday.org/ojs/index.php/fm/article/view/4878
Weller K, Kinder-Kurlanda KE (2015) Uncovering the challenges in collection, sharing and documentation: The hidden data of social media research? Ninth International AAAI Conference on Web and Social Media
Young S (2012) I identify as a disabled person. Retrieved 27 Aug 2015, from http://www.mamamia.com.au/news/stella-young-i-identify-as-a-disabled-person/
Zimmer M, Proferes N (2014) A topology of Twitter research: disciplines, methods, and ethics. Aslib J Manag 66(3):250–261
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Science+Business Media B.V., part of Springer Nature
About this entry
Cite this entry
croeser, s., Highfield, T. (2018). Blended Data: Critiquing and Complementing Social Media Datasets, Big and Small. In: Hunsinger, J., Klastrup, L., Allen, M. (eds) Second International Handbook of Internet Research. Springer, Dordrecht. https://doi.org/10.1007/978-94-024-1202-4_15-1
Download citation
DOI: https://doi.org/10.1007/978-94-024-1202-4_15-1
Received:
Accepted:
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-024-1202-4
Online ISBN: 978-94-024-1202-4
eBook Packages: Springer Reference Biomedicine and Life SciencesReference Module Biomedical and Life Sciences