Abstract
The number of citations that a patent receives is considered an important indicator of the quality and impact of the patent. However, a variety of methods and data sources can be used to calculate this measure. This paper evaluates similarities between citation indicators that differ in terms of (a) the patent office where the focal patent application is filed; (b) whether citations from offices other than that of the application office are considered; and (c) whether the presence of patent families is taken into account. We analyze the correlations between these different indicators and the overlap between patents identified as highly cited by the various measures. Our findings reveal that the citation indicators obtained differ substantially. Favoring one way of calculating a citation indicator over another has non-trivial consequences and, hence, should be given explicit consideration. Correcting for patent families, especially when using a broader definition (INPADOC), provides the most uniform results.
Similar content being viewed by others
Notes
Often referred to as patent citations, forward citations or patent citation count. We will use these terms throughout this paper.
Up to 88 % of applications score a non-zero citation count on at least one of the citation indicators we computed.
This paper needs to be cited when the NBER database is used.
We show this later in Table 4.
The ‘duty of candor’ rule requires that applicant and inventors involved in a patent application must disclose all known information which may adversely affect the probability of obtaining a granted patent.
INPADOC is an abbreviation for INternational PAtent DOCumentation, the patent data collected but not generated by the EPO (2014). It is also used to denote the extended patent family in the EPO PATSTAT databases.
DOCDB is the EPO master documentation database (Martínez 2011). It is also used to denote the examiner’s technology-based patent family in the EPO PATSTAT databases.
Albrecht et al. (2010) define the DOCDB patent family as patent applications that have an equal ‘priority picture’: this can, under certain circumstances, include the priority application itself. Additionally, this family is corrected to include applications that have the same technical content but have been excluded due to a ‘discrepancy in the priority picture’ Albrecht et al. (2010: 283).
This statement holds for the vast majority of patent applications in the EPO PATSTAT database; there is a small minority of patents (0.09 % of DOCDB patent families) that do not fulfill this criterion due to discrepancies in their priority picture. However, these families do not affect the analyses presented later in this paper.
These imply changes in publication types; patent duplicates that occur before and after 2001; and applications that are not available before 2001 but partly available thereafter.
These are added to the database to maintain logical links and do not actually represent any patent applications.
In the case of applications filed through the PCT, other applications that followed this route were taken.
The exact figures are: 21 % for EPO applications, 12 % for USPTO applications and 37 % for PCT applications.
The size of the groups of highly cited patents identified by the 5 SD outlier criterion varies between 765 and 35,145 depending on the source office and indicator specification.
References
Ahuja, G., & Lampert, C. Morris. (2001). Entrepreneurship in the large corporation: A longitudinal study of how established firms create breakthrough inventions. Strategic Management Journal, 22(6–7), 521–543.
Albert, M. B., Avery, D., Narin, F., & McAllister, P. (1991). Direct validation of citation counts as indicators of industrially important patents. Research Policy, 20(3), 251–259.
Albrecht, M. A., Bosma, R., van Dinter, T., Ernst, J. L., van Ginkel, K., & Versloot-Spoelstra, F. (2010). Quality assurance in the EPO patent information resource. World Patent Information, 32(4), 279–286.
Alcácer, J., & Gittelman, M. (2006). Patent citations as a measure of knowledge flows: The influence of examiner citations. The Review of Economics and Statistics, 88(4), 774–779.
Arts, S., Appio, F., & Van Looy, B. (2012). Inventions shaping technological trajectories: Do existing patent indicators provide a comprehensive picture? Scientometrics, 97(2), 397–419.
Azagra-Caro, J. M., Mattsson, P., & Perruchas, F. (2011). Smoothing the lies: The distinctive effects of patent characteristics on examiner and applicant citations. Journal of the American Society for Information Science and Technology, 62(9), 1727–1740.
Baumol, W. J. (2004). Education for innovation: entrepreneurial breakthroughs vs. corporate incremental improvements (No. w10578). National Bureau of Economic Research.
Carpenter, M. P., Narin, F., & Woolf, P. (1981). Citation rates to technologically important patents. World Patent Information, 3(4), 160–163.
Chakrabarti, A. K. (1991). Competition in high technology: Analysis of patents of US, Japan, UK, France, West Germany, and Canada. Engineering Management, IEEE Transactions on, 38(1), 78–84.
Criscuolo, P. (2006). The ‘home advantage’ effect and patent families. A comparison of OECD triadic patents, the USPTO and the EPO. Scientometrics, 66(1), 23–41.
Criscuolo, P., & Verspagen, B. (2008). Does it matter where patent citations come from? Inventor vs. examiner citations in European patents. Research Policy, 37(10), 1892–1908.
Dosi, G. (1982). Technological paradigms and technological trajectories: A suggested interpretation of the determinants and directions of technical change. Research policy, 11(3), 147–162.
EPO. (2014). What’s in a name? Patent Information News, 4, 4.
Frietsch, R., Neuhäusler, P., Jung, T., & Van Looy, B. (2014). Patent indicators for macroeconomic growth—The value of patents estimated by export volume. Technovation, 34(9), 546–558.
Gambardella, A., Harhoff, D., & Verspagen, B. (2008). The value of European patents. European Management Review, 5(2), 69–84.
Graham, S., & Harhoff, D. (2006). Can post-grant reviews improve patent system design? A twin study of European and US patents. CEPR Discussion Paper No. 5680, CEPR London.
Hagedoorn, J., & Cloodt, M. (2003). Measuring innovative performance: Is there an advantage in using multiple indicators? Research Policy, 32(8), 1365–1379.
Hall, B. H., Jaffe, A. B., & Trajtenberg, M. (2001). The NBER patent citation data file: Lessons, insights and methodological tools (No. w8498). National Bureau of Economic Research.
Hall, B. H., Jaffe, A. B, & Trajtenberg, M. (2005). Market value and patent citations. RAND Journal of Economics,36(1), 16–38.
Harhoff, D., & Reitzig, M. (2004). Determinants of opposition against EPO patent grants—The case of biotechnology and pharmaceuticals. International Journal of Industrial Organization, 22(4), 443–480.
Harhoff, D., Scherer, F. M., & Vopel, K. (2003). Citations, family size, opposition and the value of patent rights. Research Policy, 32(8), 1343-1363.
Harris, C. W., & Kaiser, H. F. (1964). Oblique factor analytic solutions by orthogonal transformations. Psychometrika, 29(4), 347–362.
Jaffe, A. B., Trajtenberg, M., & Fogarty, M. S. (2000). Knowledge spillovers and patent citations: Evidence from a survey of inventors. The American Economic Review, 90(2), 215–218.
Jaffe, A. B., Trajtenberg, M., & Henderson, R. (1993). Geographic localization of knowledge spillovers as evidenced by patent citations. The Quarterly Journal of Economics, 108(3), 577–598.
Karki, M. M. S. (1997). Patent citation analysis: A policy analysis tool. World Patent Information, 19(4), 269–272.
Lanjouw, J. O., & Schankerman, M. (2004). Patent quality and research productivity: Measuring innovation with multiple indicators. The Economic Journal, 114(495), 441–465.
Lingua, D. G. (2005). INPADOC: 30 years of endeavours yet unmapped territories remain! World Patent Information, 27(2), 105–111.
MacGarvie, M. (2006). Do firms learn from international trade? Review of Economics and Statistics, 88(1), 46–60.
Magerman, T., Van Looy, B., & Debackere, K. (2011). In search of anticommons: Patent-paper pairs in biotechnology. An analysis of citation flows. MSI FEB Working paper, KU Leuven.
Martínez, C. (2011). Patent families: When do different definitions really matter? Scientometrics, 86(1), 39–63.
Michel, J., & Bettels, B. (2001). Patent citation analysis. A closer look at the basic input data from patent search reports. Scientometrics, 51(1), 185–201.
Nakamura, H., Suzuki, S., Kajikawa, Y., & Osawa, M. (2015). The effect of patent family information in patent citation network analysis: A comparative case study in the drivetrain domain. Scientometrics, 104(2), 437–452.
Narin, F., Noma, E., & Perry, R. (1987). Patents as indicators of corporate technological strength. Research Policy, 16(2), 143–155.
Neuhäusler, P., & Frietsch, R. (2012). Patent families as macro level patent value indicators: Applying weights to account for market differences. Scientometrics, 96(1), 1–23.
Neuhäusler, P., Frietsch, R., Schubert, T., & Blind, K. (2011). Patents and the financial performance of firms—An analysis based on stock market data (No. 28). Fraunhofer ISI discussion papers innovation systems and policy analysis.
Paci, R., & Usai, S. (2009). Knowledge flows across European regions. The Annals of Regional Science, 43(3), 669–690.
Sampat, B. N. (2004). Examining patent examination: an analysis of examiner and applicant generated prior art (Doctoral dissertation, University of Michigan).
SAS Institute Inc. (2008). SAS/STAT® 9.2 User’s Guide. Cary, NC: SAS Institute Inc.
Schoenmakers, W., & Duysters, G. (2010). The technological origins of radical inventions. Research Policy, 39(8), 1051–1059.
Trajtenberg, M. (1990). A penny for your quotes: Patent citations and the value of innovations. The RAND Journal of Economics,12(1), 172–187.
van Pottelsberghe de la Potterie, B., & François, D. (2009). The cost factor in patent systems. Journal of Industry, Competition and Trade, 9(4), 329–355.
Webb, C., Dernis, H., Harhoff, D., & Hoisl, K. (2005). Analysing European and international patent citations: A set of EPO patent database building blocks, OCDE Science. Technology and Industry Working Paper, 9.
Acknowledgments
Lin Zhang acknowledges the support of the National Natural Science Foundation of China Grant No. 71103064.
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix 1: Correlation between indicators from the same office
Appendix 2: Correlation between indicators from different offices
Appendix 3: Variable cluster method
This appendix explains the cluster algorithm that was used to cluster indicators. This method is an implementation of the VARCLUS procedure in the SAS® software package (SAS Institute 2008). What follows are excerpts from the SAS manual (SAS Institute 2008: 7461–7463) explaining the logic of the underlying procedure. Our specific settings are detailed in italics. Options not related to our analysis have been omitted.
‘The VARCLUS procedure divides a set of numeric variables into disjoint or hierarchical clusters. Associated with each cluster is a linear combination of the variables in the cluster. The linear combination used here consists of the first principal component. (…) The first principal component is a weighted average of the variables that explains as much variance as possible.
(…)
The VARCLUS procedure tries to maximize the variance that is explained by the cluster components, summed over all the clusters. The cluster components are oblique, not orthogonal, even when the cluster components are first principal components. In an ordinary principal component analysis, all components are computed from the same variables, and the first principal component is orthogonal to the second principal component and to every other principal component. In the VARCLUS procedure, each cluster component is computed from a different set of variables than all the other cluster components. The first principal component of one cluster might be correlated with the first principal component of another cluster. Hence, the VARCLUS algorithm is a type of oblique component analysis.
We use the correlation matrices as input for the principal component analysis used in the VARCLUS procedure (…)
The VARCLUS algorithm is both divisive and iterative. By default, the VARCLUS procedure begins with all variables in a single cluster. It then repeats the following steps:
-
1.
A cluster is chosen for splitting. Depending on (…) the largest eigenvalue associated with the second principal component (…)
-
2.
The chosen cluster is split into two clusters by finding the first two principal components, performing an orthoblique rotation (raw quartimax rotation on the eigenvectors; Harris and Kaiser 1964), and assigning each variable to the rotated component with which it has the higher squared correlation.
-
3.
Variables are iteratively reassigned to clusters to try to maximize the variance accounted for by the cluster components.
(…)VARCLUS stops splitting when every cluster has only one eigenvalue greater than one, thus satisfying the most popular criterion for determining the sufficiency of a single underlying dimension.’
Rights and permissions
About this article
Cite this article
Bakker, J., Verhoeven, D., Zhang, L. et al. Patent citation indicators: One size fits all?. Scientometrics 106, 187–211 (2016). https://doi.org/10.1007/s11192-015-1786-0
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11192-015-1786-0