Abstract
The citation count is an important factor to estimate the relevance and significance of academic publications. However, it is not possible to use this measure for papers which are too new. A solution to this problem is to estimate the future citation counts. There are existing works, which point out that graph mining techniques lead to the best results. We aim at improving the prediction of future citation counts by introducing a new feature. This feature is based on frequent graph pattern mining in the so-called citation network constructed on the basis of a dataset of scientific publications. Our new feature improves the accuracy of citation count prediction, and outperforms the state-of-the-art features in many cases which we show with experiments on two real datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Garfield, E.: Impact factors, and why they won’t go away. Science 411(6837), 522 (2001)
Hirsch, J.: An index to quantify an individual’s scientific research output. Proc. the National Academy of Sciences of the United States America 102(46), 16569 (2005)
Beel, J., Gipp, B.: Google scholar’s ranking algorithm: The impact of citation counts (an empirical study). In: Proc. RCIS, pp. 439–446 (2009)
Callaham, M., Wears, R., Weber, E.: Journal prestige, publication bias, and other characteristics associated with citation of published studies in peer-reviewed journals. Journal of the American Medical Association 287(21), 50–2847 (2002)
Kulkarni, A.V., Busse, J.W., Shams, I.: Characteristics associated with citation rate of the medical literature. PLOS one 2(5) (2007)
Livne, A., Adar, E., Teevan, J., Dumais, S.: Predicting citation counts using text and graph mining. In: Proc. the iConference 2013 Workshop on Computational Scientometrics: Theory and Applications (2013)
Bringmann, B., Berlingerio, M., Bonchi, F., Gionis, A.: Learning and predicting the evolution of social networks. IEEE Intelligent Systems 25, 26–35 (2010)
Yan, R., Tang, J., Liu, X., Shan, D., Li, X.: Citation count prediction: learning to estimate future citations for literature. In: Proc. CIKM, pp. 1247–1252 (2011)
Shi, X., Leskovec, J., McFarland, D.A.: Citing for high impact. In: Proc. JCDL, pp. 49–58 (2010)
Barabasi, A.-L., Albert, R.: Emergence of scaling in random networks. Science Magazine 286(5439), 509–512 (1999)
Munasinghe, L., Ichise, R.: Time score: A new feature for link prediction in social networks. IEICE Trans. 95-D(3), 821–828 (2012)
Mcgovern, A., Friedl, L., Hay, M., Gallagher, B., Fast, A., Neville, J., Jensen, D.: Exploiting relational structure to understand publication patterns in high-energy physics. SIGKDD Explorations 5 (2003)
Yan, R., Huang, C., Tang, J., Zhang, Y., Li, X.: To better stand on the shoulder of giants. In: Proc. JCDL, pp. 51–60 (2012)
Sokolova, M., Lapalme, G.: A systematic analysis of performance measures for classification tasks. Information Processing & Management 45(4), 427–437 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Pobiedina, N., Ichise, R. (2014). Predicting Citation Counts for Academic Literature Using Graph Pattern Mining. In: Ali, M., Pan, JS., Chen, SM., Horng, MF. (eds) Modern Advances in Applied Intelligence. IEA/AIE 2014. Lecture Notes in Computer Science(), vol 8482. Springer, Cham. https://doi.org/10.1007/978-3-319-07467-2_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-07467-2_12
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07466-5
Online ISBN: 978-3-319-07467-2
eBook Packages: Computer ScienceComputer Science (R0)