Abstract:
It is shown that the generalized Pareto distribution gives a good fit to citable documents, citations above a threshold and also for the h-index of countries. The h-index has a finite second moment, while the citable documents and citations are extremely heavy tailed with the estimated index of citations less than one. The relationship derived between the h-index, citation and number of publications is also investigated and the model proposed by Glänzel confirmed empirically.
Similar content being viewed by others
References
Barcza, K., & Telcs, A. (2009). Paretian publication patterns imply Paretian Hirsch index. Scientometrics, 81(2), 513–519. doi:10.1007/s11192-008-2175-8.
Beirlant, J., & Einmahl, J. H. J. (2010). Asymptotics for the Hirsch index. Scandinavian Journal of Statistics, 37, 355–364.
Beirlant, J., Glänzel, W., Carbonez, A., & Leemans, H. (2007). Scoring research outputs using statistical quantile plotting. Journal of Informetrics, 1, 185–192.
Egghe, L., & Rousseau, R. (2006). An informetric model for the Hirsch-index. Scientometrics, 69(1), 121–129.
Embrechts, P., Klüppelberg, C. & Mikosch, T. (1997). Modelling Extremal Events for Insurance and Finance. Berlin: Springer-Verlag.
Glänzel, W. (2006). On the H-index a mathematical approach to a new measure of publication activity and citation impact. Scientometrics, 67(2), 315–321.
Hirsch, J. A. (2005). An index to quantify an individual’s scientific research output. Proceedings of National Academy of Sciences USA, 102(46), 16569–16572. doi:10.1073/pnas.0507655102.
Johnson, N. L., Kotz, S., & Balakrishnan, N. (1994). Continuous univariate distributions (Vol. 1). New York: Wiley.
Moed, H. F. (2002). The impact-factors debate: The ISI’s uses and limits. Nature, 415(6873), 731–732.
Pratelli, L., Baccini, A., Barabesi, L., Marcheselli, M. (2011). Statistical analysis of the Hirsch index. Scandinavian Journal of Statistics. arXiv:1102.2701v1 [math.ST]. Forthcoming
Redner, S. (1998). How popular is your paper? An empirical study of the citation distribution. European Physical Journal B, 4, 131–134.
SCImago. (2007). SJR—SCImago Journal & Country Rank. http://www.scimagojr.com. Retrieved 8 Feb 2012.
van Zyl, J. M. (2011). A median regression model to estimate the parameters of the three-parameter generalized Pareto distribution. Communications in Statistics, Simulation and Computation, 41(4), 544–553.
Ye, F. Y. (2011). A unification of three models for the h-index. Journal of the American Society for Information Science and Technology, 62(1), 205–207.
Zhang, J., & Stephens, M. A. (2009). A new and efficient estimation method for the generalized Pareto distribution. Technometrics, 51(3), 316–325.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
van Zyl, J.M. The generalized Pareto distribution fitted to research outputs of countries. Scientometrics 94, 1099–1109 (2013). https://doi.org/10.1007/s11192-012-0798-2
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11192-012-0798-2