Social Media Data in Research: Provenance Challenges
In this paper we argue that understanding the provenance of social media datasets and their analysis is critical to addressing challenges faced by the social science research community in terms of the reliability and reproducibility of research utilising such data. Based on analysis of existing projects that use social media data, we present a number of research questions for the provenance community, which if addressed would help increase the transparency of the research process, aid reproducibility, and facilitate data reuse in the social sciences.
KeywordsProvenance Social media Research process
- [CFLV12]Cheney, J., Finkelstein, A., Ludascher, B., Vansummeren, S.: Principles of provenance. Dagstuhl Rep. 2(2), 84–113 (2012)Google Scholar
- [CYG+15]Cottrill, C., Yeboah, G., Gault, P., Nelson, J.D., Anable, J., Budd, T.: Tweeting transport: examining the use of twitter in transport events. In: Proceedings of the 47th Annual UTSG Conference (2015)Google Scholar
- [fECoD13]Organisation for Economic Co-operation and Development: New data for understanding the human condition. Technical report, February 2013Google Scholar
- [Tuf13]Tufekci, Z.: Big data: Pitfalls, methods and concepts for an emergent field. Technical report, March 2013Google Scholar