Semantic Attributes for Citation Relationships: Creation and Visualization

Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 755)


This paper presents a method to process a content of research papers in binary PDF format at a server side that gives research information systems new features of citation content analysis. This method efficiently generates JSON versions of PDF documents that allows an easier recognition of papers’ references, in-text references, citation context, etc. As a result, one can parse an extended set of citation data, including a location of citations in a research paper’s structure, frequency of mentioning for the same references, style of reference mentioning and so on. Based on these data we upgrade traditional citation relationships by adding some semantic attributes. Formatting these semantic data according W3C Web Annotation Data Model and integrating the data with some annotation tools, we visualize citation relationships, its semantic attributes and related statistics as annotations for readers of PDF documents from a research information system.


Research information system PDF.js PDF to JSON conversion Citation relationships Semantic attributes Citation content analysis Visualization 



A part of this research (related with the annotation tool development) is funded by Russian Foundation for Basic Research, grant 12-07-00518-a. Another part – the approach development for extracting citation content data with focus on the supercomputer simulation of interactions among the agents and research community environment is funded by RSF grant (project No. 14-18-01968).


  1. 1.
    Smith, L.C.: Citation analysis. Libr. Trends 30(1), 83–106 (1981)Google Scholar
  2. 2.
    Garfield, E.: The relationship between citing and cited publications: a question of relatedness (1994). Originally published in the Current ContentsGoogle Scholar
  3. 3.
    Barrueco, J.M., Krichel, T.: Building an autonomous citation index for grey literature: the economics working papers case. In: Proceedings GL6: Sixth International Conference on Grey Literature (2004).
  4. 4.
    Waltman, L.: A review of the literature on citation impact indicators. J. Inf. 10(2), 365–391 (2016)CrossRefMathSciNetGoogle Scholar
  5. 5.
    Alschner, W., Umov, A.: Towards An Integrated Database of International Economic Law (IDIEL) Disputes (2016)Google Scholar
  6. 6.
    Bertin, M., Atanassova, I.: A study of lexical distribution in citation contexts through the IMRaD standard. PloS Negl. Trop. Dis. 1(200,920), 83–402 (2014)Google Scholar
  7. 7.
    Bertin, M., Atanassova, I., Gingras, Y., Larivière, V.: The invariant distribution of references in scientific articles. J. Assoc. Inf. Sci. Technol. 67(1), 164–177 (2016)CrossRefGoogle Scholar
  8. 8.
    Zhang, G., Ding, Y., Milojević, S.: Citation content analysis (CCA): a framework for syntactic and semantic analysis of citation content. J. Am. Soc. Inform. Sci. Technol. 64(7), 1490–1503 (2013)CrossRefGoogle Scholar
  9. 9.
    Ding, Y., Zhang, G., Chambers, T., Song, M., Wang, X., Zhai, C.: Content-based citation analysis: the next generation of citation analysis. J. Assoc. Inf. Sci. Technol. 65(9), 1820–1833 (2014)CrossRefGoogle Scholar
  10. 10.
    Oevermann, J.: Reconstructing semantic structures in technical documentation with vector space classification. In: SEMANTiCS (Posters, Demos, SuCCESS) (2016)Google Scholar
  11. 11.
    Dix, A., Levialdi, S., Malizia, A.: Semantic halo for collaboration tagging systems. In: The Social Navigation and Community-Based Adaptation Technologies Workshop, 20 June 2006Google Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  1. 1.Central Economics and Mathematics Institute of RASMoscowRussia
  2. 2.Russian Presidential Academy of National Economy and Public AdministrationMoscowRussia

Personalised recommendations