Semantic Attributes for Citation Relationships: Creation and Visualization
This paper presents a method to process a content of research papers in binary PDF format at a server side that gives research information systems new features of citation content analysis. This method efficiently generates JSON versions of PDF documents that allows an easier recognition of papers’ references, in-text references, citation context, etc. As a result, one can parse an extended set of citation data, including a location of citations in a research paper’s structure, frequency of mentioning for the same references, style of reference mentioning and so on. Based on these data we upgrade traditional citation relationships by adding some semantic attributes. Formatting these semantic data according W3C Web Annotation Data Model and integrating the data with some annotation tools, we visualize citation relationships, its semantic attributes and related statistics as annotations for readers of PDF documents from a research information system.
KeywordsResearch information system PDF.js PDF to JSON conversion Citation relationships Semantic attributes Citation content analysis Visualization
A part of this research (related with the annotation tool development) is funded by Russian Foundation for Basic Research, grant 12-07-00518-a. Another part – the approach development for extracting citation content data with focus on the supercomputer simulation of interactions among the agents and research community environment is funded by RSF grant (project No. 14-18-01968).
- 1.Smith, L.C.: Citation analysis. Libr. Trends 30(1), 83–106 (1981)Google Scholar
- 2.Garfield, E.: The relationship between citing and cited publications: a question of relatedness (1994). Originally published in the Current ContentsGoogle Scholar
- 3.Barrueco, J.M., Krichel, T.: Building an autonomous citation index for grey literature: the economics working papers case. In: Proceedings GL6: Sixth International Conference on Grey Literature (2004). http://core.ac.uk/download/pdf/11878095.pdf
- 5.Alschner, W., Umov, A.: Towards An Integrated Database of International Economic Law (IDIEL) Disputes (2016)Google Scholar
- 6.Bertin, M., Atanassova, I.: A study of lexical distribution in citation contexts through the IMRaD standard. PloS Negl. Trop. Dis. 1(200,920), 83–402 (2014)Google Scholar
- 10.Oevermann, J.: Reconstructing semantic structures in technical documentation with vector space classification. In: SEMANTiCS (Posters, Demos, SuCCESS) (2016)Google Scholar
- 11.Dix, A., Levialdi, S., Malizia, A.: Semantic halo for collaboration tagging systems. In: The Social Navigation and Community-Based Adaptation Technologies Workshop, 20 June 2006Google Scholar