Skip to main content

Determining How Citations Are Used in Citation Contexts

  • Conference paper
  • First Online:
Digital Libraries for Open Knowledge (TPDL 2019)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11799))

Included in the following conference series:


Citations have been classified based on their textual contexts w.r.t. their worthiness, function, polarity, and importance. To the best of our knowledge, so far citations have not automatically been classified by their grammatical role, that is, whether the citation (1) is grammatically integrated in the sentence, (2) is annotated directly after the occurrence of author names, (3) backs up a concept, (4) backs up a claim, or (5) is not appropriate because the context is incomplete or noisy.We argue that determining such classes for citation contexts is useful for a variety of tasks, such as improved citation recommendation and scientific impact quantification. In this paper, we propose this classification scheme, as well as a machine-learning-based approach to determine the classes automatically. Our evaluation reveals that the classification performance varies significantly between the citation types.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
USD 64.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 84.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others


  1. 1.

    See for our source code. Note that each citation context can belong to one or several citation types. This makes our classification task a multi-label classification task.

  2. 2.

    See The pretrained vectors were trained on Common Crawl and Wikipedia using the CBOW model of fastText. fastText operates at the character level, and therefore can generate vectors for words not seen in the training corpus.

  3. 3.



  1. Teufel, S., Siddharthan, A., Tidhar, D.: An annotation scheme for citation function. In: Proceedings of SIGdial 2009, pp. 80–87 (2009)

    Google Scholar 

  2. Färber, M., Thiemann, A., Jatowt, A.: To cite, or not to cite? Detecting citation contexts in text. In: Pasi, G., Piwowarski, B., Azzopardi, L., Hanbury, A. (eds.) ECIR 2018. LNCS, vol. 10772, pp. 598–603. Springer, Cham (2018).

    Chapter  Google Scholar 

  3. Teufel, S., Siddharthan, A., Tidhar, D.: Automatic classification of citation function. In: Proceedings of EMNLP 2007, pp. 103–110 (2006)

    Google Scholar 

  4. Abu-Jbara, A., Ezra, J., Radev, D.R.: Purpose and polarity of citation: towards NLP-based bibliometrics. In: Proceedings of NAACL-HLT 2013, pp. 596–606 (2013)

    Google Scholar 

  5. Ghosh, S., Das, D., Chakraborty, T.: Determining sentiment in citation text and analyzing its impact on the proposed ranking index. CoRR abs/1707.01425 (2017)

    Google Scholar 

  6. Valenzuela, M., Ha, V., Etzioni, O.: Identifying meaningful citations. In: Proceedings of SBD 2015 (2015)

    Google Scholar 

  7. Redi, M., Fetahu, B., Morgan, J.T., Taraborelli, D.: Citation needed: a taxonomy and algorithmic assessment of Wikipedia’s verifiability. In: Proceedings of WWW 2019 (2019)

    Google Scholar 

  8. Petrić, B.: Rhetorical functions of citations in high-and low-rated master’s theses. J. Engl. Acad. Purp. 6(3), 238–253 (2007)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations


Corresponding author

Correspondence to Michael Färber .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Färber, M., Sampath, A. (2019). Determining How Citations Are Used in Citation Contexts. In: Doucet, A., Isaac, A., Golub, K., Aalberg, T., Jatowt, A. (eds) Digital Libraries for Open Knowledge. TPDL 2019. Lecture Notes in Computer Science(), vol 11799. Springer, Cham.

Download citation

  • DOI:

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-30759-2

  • Online ISBN: 978-3-030-30760-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics