Predicting Citation Counts for Academic Literature Using Graph Pattern Mining

  • Nataliia Pobiedina
  • Ryutaro Ichise
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8482)


The citation count is an important factor to estimate the relevance and significance of academic publications. However, it is not possible to use this measure for papers which are too new. A solution to this problem is to estimate the future citation counts. There are existing works, which point out that graph mining techniques lead to the best results. We aim at improving the prediction of future citation counts by introducing a new feature. This feature is based on frequent graph pattern mining in the so-called citation network constructed on the basis of a dataset of scientific publications. Our new feature improves the accuracy of citation count prediction, and outperforms the state-of-the-art features in many cases which we show with experiments on two real datasets.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Garfield, E.: Impact factors, and why they won’t go away. Science 411(6837), 522 (2001)Google Scholar
  2. 2.
    Hirsch, J.: An index to quantify an individual’s scientific research output. Proc. the National Academy of Sciences of the United States America 102(46), 16569 (2005)CrossRefGoogle Scholar
  3. 3.
    Beel, J., Gipp, B.: Google scholar’s ranking algorithm: The impact of citation counts (an empirical study). In: Proc. RCIS, pp. 439–446 (2009)Google Scholar
  4. 4.
    Callaham, M., Wears, R., Weber, E.: Journal prestige, publication bias, and other characteristics associated with citation of published studies in peer-reviewed journals. Journal of the American Medical Association 287(21), 50–2847 (2002)Google Scholar
  5. 5.
    Kulkarni, A.V., Busse, J.W., Shams, I.: Characteristics associated with citation rate of the medical literature. PLOS one 2(5) (2007)Google Scholar
  6. 6.
    Livne, A., Adar, E., Teevan, J., Dumais, S.: Predicting citation counts using text and graph mining. In: Proc. the iConference 2013 Workshop on Computational Scientometrics: Theory and Applications (2013)Google Scholar
  7. 7.
    Bringmann, B., Berlingerio, M., Bonchi, F., Gionis, A.: Learning and predicting the evolution of social networks. IEEE Intelligent Systems 25, 26–35 (2010)CrossRefGoogle Scholar
  8. 8.
    Yan, R., Tang, J., Liu, X., Shan, D., Li, X.: Citation count prediction: learning to estimate future citations for literature. In: Proc. CIKM, pp. 1247–1252 (2011)Google Scholar
  9. 9.
    Shi, X., Leskovec, J., McFarland, D.A.: Citing for high impact. In: Proc. JCDL, pp. 49–58 (2010)Google Scholar
  10. 10.
    Barabasi, A.-L., Albert, R.: Emergence of scaling in random networks. Science Magazine 286(5439), 509–512 (1999)MathSciNetGoogle Scholar
  11. 11.
    Munasinghe, L., Ichise, R.: Time score: A new feature for link prediction in social networks. IEICE Trans. 95-D(3), 821–828 (2012)Google Scholar
  12. 12.
    Mcgovern, A., Friedl, L., Hay, M., Gallagher, B., Fast, A., Neville, J., Jensen, D.: Exploiting relational structure to understand publication patterns in high-energy physics. SIGKDD Explorations 5 (2003)CrossRefGoogle Scholar
  13. 13.
    Yan, R., Huang, C., Tang, J., Zhang, Y., Li, X.: To better stand on the shoulder of giants. In: Proc. JCDL, pp. 51–60 (2012)Google Scholar
  14. 14.
    Sokolova, M., Lapalme, G.: A systematic analysis of performance measures for classification tasks. Information Processing & Management 45(4), 427–437 (2009)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Nataliia Pobiedina
    • 1
  • Ryutaro Ichise
    • 2
  1. 1.Institute of Software Technology and Interactive SystemsVienna University of TechnologyAustria
  2. 2.Principles of Informatics Research DivisionNational Institute of InformaticsJapan

Personalised recommendations