Skip to main content

Predicting the Future Impact of Academic Publications

  • Conference paper
Progress in Artificial Intelligence (EPIA 2013)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8154))

Included in the following conference series:

  • 2874 Accesses

Abstract

Predicting the future impact of academic publications has many important applications. In this paper, we propose methods for predicting future article impact, leveraging digital libraries of academic publications containing citation information. Using a set of successive past impact scores, obtained through graph-ranking algorithms such as PageRank, we study the evolution of the publications in terms of their yearly impact scores, learning regression models to predict the future PageRank scores, or to predict the future number of downloads. Results obtained over a DBLP citation dataset, covering papers published up to the year of 2011, show that the impact predictions are highly accurate for all experimental setups. A model based on regression trees, using features relative to PageRank scores, PageRank change rates, author PageRank scores, and term occurrence frequencies in the abstracts and titles of the publications, computed over citation graphs from the three previous years, obtained the best results.

This work was partially supported by Fundação para a Ciência e a Tecnologia (FCT), through project grants with references UTA-EST/MAI/0006/2009(REACTION) and PTDC/EIA-EIA/109840/2009 (SInteliGIS), as well as through PEst-OE/EEI/LA0021/2013 (INESC-ID plurianual funding).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Breiman, L.: Random Forests. Machine Learning 45(1) (2001)

    Google Scholar 

  2. Chen, P., Xie, H., Maslov, S., Redner, S.: Finding scientific gems with Google’s PageRank algorithm. Journal of Informetrics 1(1) (2007)

    Google Scholar 

  3. Chien, S., Dwork, C., Kumar, R., Simon, D.R., Sivakumar, D.: Link evolution: Analysis and algorithms. Internet Mathematics 1(3) (2003)

    Google Scholar 

  4. Davis, J.V., Dhillon, I.S.: Estimating the global PageRank of web communities. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2006)

    Google Scholar 

  5. Friedman, J.H.: Greedy function approximation: A gradient boosting machine. Annals of Statistics 29(5) (2000)

    Google Scholar 

  6. Kan, M.-Y., Thi, H.O.N.: Fast webpage classiffication using URL features. In: Proceedings of the ACM International Conference on Information and Knowledge Management (2005)

    Google Scholar 

  7. Kaul, R., Yun, Y., Kim, S.-G.: Ranking billions of web pages using diodes. Communications of ACM 52(8) (2009)

    Google Scholar 

  8. Kendall, M.G.: A new measure of rank correlation. Biometrika 30(1/2) (1938)

    Google Scholar 

  9. Langville, A., Meyer, C.D.: Survey: Deeper inside PageRank. Internet Mathematics 1(3) (2003)

    Google Scholar 

  10. Lerman, K., Ghosh, R., Kang, J.H.: Centrality metric for dynamic networks. In: Proceedings of the Workshop on Mining and Learning with Graphs (2010)

    Google Scholar 

  11. Mohan, A., Chen, Z., Weinberger, K.Q.: Web-search ranking with initialized gradient boosted regression trees. Journal of Machine Learning Research 14 (2011)

    Google Scholar 

  12. Radicchi, F., Fortunato, S., Markines, B., Vespignani, A.: Diffusion of scientific credits and the ranking of scientists. Physical Review (2009)

    Google Scholar 

  13. Sayyadi, H., Getoor, L.: Future rank: Ranking scientific articles by predicting their future PageRank. In: Proceedings of the SIAM International Conference on Data Mining (2009)

    Google Scholar 

  14. Spearman, C.: The proof and measurement of association between two things. American Journal of Psychology 15 (1904)

    Google Scholar 

  15. Tang, J., Zhang, J., Yao, L., Li, J., Zhang, L., Su, Z.: ArnetMiner: Extraction and mining of academic social networks. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2008)

    Google Scholar 

  16. Vazirgiannis, M., Drosos, D., Senellart, P., Vlachou, A.: Web page rank prediction with Markov models. In: Proceedings of the International Conference on World Wide Web (2008)

    Google Scholar 

  17. Voudigari, E., Pavlopoulos, J., Vazirgiannis, M.: A framework for web Page Rank prediction. In: Iliadis, L., Maglogiannis, I., Papadopoulos, H. (eds.) EANN/AIAI 2011, Part II. IFIP AICT, vol. 364, pp. 240–249. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  18. Walker, D., Xie, H., Yan, K.-K., Maslov, S.: Ranking scientific publications using a simple model of network traffic. Technical Report CoRR, abs/physics/0612122 (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bento, C., Martins, B., Calado, P. (2013). Predicting the Future Impact of Academic Publications. In: Correia, L., Reis, L.P., Cascalho, J. (eds) Progress in Artificial Intelligence. EPIA 2013. Lecture Notes in Computer Science(), vol 8154. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40669-0_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-40669-0_32

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-40668-3

  • Online ISBN: 978-3-642-40669-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics