Mining Open Government Data Used in Scientific Research
In the following paper, we describe results from mining citations, mentions, and links to open government data (OGD) in peer-reviewed literature. We inductively develop a method for categorizing how OGD are used by different research communities, and provide descriptive statistics about the publication years, publication outlets, and OGD sources. Our results demonstrate that, 1. The use of OGD in research is steadily increasing from 2009 to 2016; 2. Researchers use OGD from 96 different open government data portals, with data.gov.uk and data.gov being the most frequent sources; and, 3. Contrary to previous findings, we provide evidence suggesting that OGD from developing nations, notably India and Kenya, are being frequently used to fuel scientific discoveries. The findings of this paper contribute to ongoing research agendas aimed at tracking the impact of open government data initiatives, and provides an initial description of how open government data are valuable to diverse scientific research communities.
KeywordsOpen data Literature mining Research policy E-government
This research was supported in part by IMLS grant # RE-40-16-0015-16. Supporting data and in-depth explanation of the methods used in this study can be found at https://github.com/OpenDataLiteracy/iConference_2018.
- 2.Davies, T.: Open data, democracy and public sector reform. A look at open government data use from data.gov.uk (2010)Google Scholar
- 4.Gruen, N., Houghton, J., Tooth, R.: Open for business: how open data can help achieve the G20 growth target (2014)Google Scholar
- 6.Bright, J., Margetts, H.Z., Wang, N., Hale, S.A.: Explaining usage patterns in Open Government Data: the case of data.gov.uk. Social Science Research Network, Rochester, NY, SSRN Scholarly Paper ID 2613853 (2015). https://doi.org/10.2139/ssrn.2613853
- 7.Young, M., Yan, A.: Civic hackers user experiences and expectations of Seattle’s open municipal data program. In: Proceedings of the 50th Hawaii International Conference on System Sciences (2017). https://doi.org/10.24251/HICSS.2017.324
- 14.Evans, J., Kaptoge, S., Caleyachetty, R., Di Angelantonio, E., Lewis, C., Parameshwar, K., Pettit, S.J.: Socioeconomic deprivation and survival after heart transplantation in England an analysis of the United Kingdom transplant registry. Circ. Cardiovasc. Qual. Outcomes 9(6), 695–703 (2016). https://doi.org/10.1161/CIRCOUTCOMES.116.002652 CrossRefGoogle Scholar
- 16.Zheng, Y., Liu, T., Wang, Y., Zhu, Y., Liu, Y., Chang, E.: Diagnosing New York city’s noises with ubiquitous data. In: Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing, pp. 715–725. ACM, New York (2014). https://doi.org/10.1145/2632048.2632102
- 18.Croft, B., Wentworth, G.R., Martin, R.V., Leaitch, W.R., Murphy, J.G., Murphy, B.N., Kodros, J.K., Abbatt, J.P., Pierce, J.R.: Contribution of Arctic seabird-colony ammonia to atmospheric particles and cloud-albedo radiative effect. Nat. commun. 7, 13444 (2016). https://doi.org/10.1038/ncomms13444 CrossRefGoogle Scholar