Skip to main content
Log in

Patent citation indicators: One size fits all?

  • Published:
Scientometrics Aims and scope Submit manuscript

Abstract

The number of citations that a patent receives is considered an important indicator of the quality and impact of the patent. However, a variety of methods and data sources can be used to calculate this measure. This paper evaluates similarities between citation indicators that differ in terms of (a) the patent office where the focal patent application is filed; (b) whether citations from offices other than that of the application office are considered; and (c) whether the presence of patent families is taken into account. We analyze the correlations between these different indicators and the overlap between patents identified as highly cited by the various measures. Our findings reveal that the citation indicators obtained differ substantially. Favoring one way of calculating a citation indicator over another has non-trivial consequences and, hence, should be given explicit consideration. Correcting for patent families, especially when using a broader definition (INPADOC), provides the most uniform results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

Notes

  1. Often referred to as patent citations, forward citations or patent citation count. We will use these terms throughout this paper.

  2. Up to 88 % of applications score a non-zero citation count on at least one of the citation indicators we computed.

  3. This paper needs to be cited when the NBER database is used.

  4. We show this later in Table 4.

  5. The ‘duty of candor’ rule requires that applicant and inventors involved in a patent application must disclose all known information which may adversely affect the probability of obtaining a granted patent.

  6. INPADOC is an abbreviation for INternational PAtent DOCumentation, the patent data collected but not generated by the EPO (2014). It is also used to denote the extended patent family in the EPO PATSTAT databases.

  7. DOCDB is the EPO master documentation database (Martínez 2011). It is also used to denote the examiner’s technology-based patent family in the EPO PATSTAT databases.

  8. Albrecht et al. (2010) define the DOCDB patent family as patent applications that have an equal ‘priority picture’: this can, under certain circumstances, include the priority application itself. Additionally, this family is corrected to include applications that have the same technical content but have been excluded due to a ‘discrepancy in the priority picture’ Albrecht et al. (2010: 283).

  9. This statement holds for the vast majority of patent applications in the EPO PATSTAT database; there is a small minority of patents (0.09 % of DOCDB patent families) that do not fulfill this criterion due to discrepancies in their priority picture. However, these families do not affect the analyses presented later in this paper.

  10. These imply changes in publication types; patent duplicates that occur before and after 2001; and applications that are not available before 2001 but partly available thereafter.

  11. These are added to the database to maintain logical links and do not actually represent any patent applications.

  12. An example of this pertains to the well-known issue that EPO references other patents by referring to the references of their PCT equivalents via a non-patent reference in PATSTAT. This has been noted in Harhoff et al. (2003) and Neuhäusler et al. (2011).

  13. In the case of applications filed through the PCT, other applications that followed this route were taken.

  14. The exact figures are: 21 % for EPO applications, 12 % for USPTO applications and 37 % for PCT applications.

  15. The size of the groups of highly cited patents identified by the 5 SD outlier criterion varies between 765 and 35,145 depending on the source office and indicator specification.

References

  • Ahuja, G., & Lampert, C. Morris. (2001). Entrepreneurship in the large corporation: A longitudinal study of how established firms create breakthrough inventions. Strategic Management Journal, 22(6–7), 521–543.

    Article  Google Scholar 

  • Albert, M. B., Avery, D., Narin, F., & McAllister, P. (1991). Direct validation of citation counts as indicators of industrially important patents. Research Policy, 20(3), 251–259.

    Article  Google Scholar 

  • Albrecht, M. A., Bosma, R., van Dinter, T., Ernst, J. L., van Ginkel, K., & Versloot-Spoelstra, F. (2010). Quality assurance in the EPO patent information resource. World Patent Information, 32(4), 279–286.

    Article  Google Scholar 

  • Alcácer, J., & Gittelman, M. (2006). Patent citations as a measure of knowledge flows: The influence of examiner citations. The Review of Economics and Statistics, 88(4), 774–779.

    Article  Google Scholar 

  • Arts, S., Appio, F., & Van Looy, B. (2012). Inventions shaping technological trajectories: Do existing patent indicators provide a comprehensive picture? Scientometrics, 97(2), 397–419.

    Article  Google Scholar 

  • Azagra-Caro, J. M., Mattsson, P., & Perruchas, F. (2011). Smoothing the lies: The distinctive effects of patent characteristics on examiner and applicant citations. Journal of the American Society for Information Science and Technology, 62(9), 1727–1740.

    Article  Google Scholar 

  • Baumol, W. J. (2004). Education for innovation: entrepreneurial breakthroughs vs. corporate incremental improvements (No. w10578). National Bureau of Economic Research.

  • Carpenter, M. P., Narin, F., & Woolf, P. (1981). Citation rates to technologically important patents. World Patent Information, 3(4), 160–163.

    Article  Google Scholar 

  • Chakrabarti, A. K. (1991). Competition in high technology: Analysis of patents of US, Japan, UK, France, West Germany, and Canada. Engineering Management, IEEE Transactions on, 38(1), 78–84.

    Article  Google Scholar 

  • Criscuolo, P. (2006). The ‘home advantage’ effect and patent families. A comparison of OECD triadic patents, the USPTO and the EPO. Scientometrics, 66(1), 23–41.

    Article  Google Scholar 

  • Criscuolo, P., & Verspagen, B. (2008). Does it matter where patent citations come from? Inventor vs. examiner citations in European patents. Research Policy, 37(10), 1892–1908.

    Article  Google Scholar 

  • Dosi, G. (1982). Technological paradigms and technological trajectories: A suggested interpretation of the determinants and directions of technical change. Research policy, 11(3), 147–162.

  • EPO. (2014). What’s in a name? Patent Information News, 4, 4.

    Google Scholar 

  • Frietsch, R., Neuhäusler, P., Jung, T., & Van Looy, B. (2014). Patent indicators for macroeconomic growth—The value of patents estimated by export volume. Technovation, 34(9), 546–558.

    Article  Google Scholar 

  • Gambardella, A., Harhoff, D., & Verspagen, B. (2008). The value of European patents. European Management Review, 5(2), 69–84.

    Article  Google Scholar 

  • Graham, S., & Harhoff, D. (2006). Can post-grant reviews improve patent system design? A twin study of European and US patents. CEPR Discussion Paper No. 5680, CEPR London.

  • Hagedoorn, J., & Cloodt, M. (2003). Measuring innovative performance: Is there an advantage in using multiple indicators? Research Policy, 32(8), 1365–1379.

    Article  Google Scholar 

  • Hall, B. H., Jaffe, A. B., & Trajtenberg, M. (2001). The NBER patent citation data file: Lessons, insights and methodological tools (No. w8498). National Bureau of Economic Research.

  • Hall, B. H., Jaffe, A. B, & Trajtenberg, M. (2005). Market value and patent citations. RAND Journal of Economics,36(1), 16–38.

  • Harhoff, D., & Reitzig, M. (2004). Determinants of opposition against EPO patent grants—The case of biotechnology and pharmaceuticals. International Journal of Industrial Organization, 22(4), 443–480.

    Article  Google Scholar 

  • Harhoff, D., Scherer, F. M., & Vopel, K. (2003). Citations, family size, opposition and the value of patent rights. Research Policy, 32(8), 1343-1363.

    Google Scholar 

  • Harris, C. W., & Kaiser, H. F. (1964). Oblique factor analytic solutions by orthogonal transformations. Psychometrika, 29(4), 347–362.

    Article  MathSciNet  Google Scholar 

  • Jaffe, A. B., Trajtenberg, M., & Fogarty, M. S. (2000). Knowledge spillovers and patent citations: Evidence from a survey of inventors. The American Economic Review, 90(2), 215–218.

    Article  Google Scholar 

  • Jaffe, A. B., Trajtenberg, M., & Henderson, R. (1993). Geographic localization of knowledge spillovers as evidenced by patent citations. The Quarterly Journal of Economics, 108(3), 577–598.

    Article  Google Scholar 

  • Karki, M. M. S. (1997). Patent citation analysis: A policy analysis tool. World Patent Information, 19(4), 269–272.

    Article  MathSciNet  Google Scholar 

  • Lanjouw, J. O., & Schankerman, M. (2004). Patent quality and research productivity: Measuring innovation with multiple indicators. The Economic Journal, 114(495), 441–465.

    Article  Google Scholar 

  • Lingua, D. G. (2005). INPADOC: 30 years of endeavours yet unmapped territories remain! World Patent Information, 27(2), 105–111.

    Article  Google Scholar 

  • MacGarvie, M. (2006). Do firms learn from international trade? Review of Economics and Statistics, 88(1), 46–60.

    Article  Google Scholar 

  • Magerman, T., Van Looy, B., & Debackere, K. (2011). In search of anticommons: Patent-paper pairs in biotechnology. An analysis of citation flows. MSI FEB Working paper, KU Leuven.

  • Martínez, C. (2011). Patent families: When do different definitions really matter? Scientometrics, 86(1), 39–63.

    Article  Google Scholar 

  • Michel, J., & Bettels, B. (2001). Patent citation analysis. A closer look at the basic input data from patent search reports. Scientometrics, 51(1), 185–201.

    Article  Google Scholar 

  • Nakamura, H., Suzuki, S., Kajikawa, Y., & Osawa, M. (2015). The effect of patent family information in patent citation network analysis: A comparative case study in the drivetrain domain. Scientometrics, 104(2), 437–452.

    Article  Google Scholar 

  • Narin, F., Noma, E., & Perry, R. (1987). Patents as indicators of corporate technological strength. Research Policy, 16(2), 143–155.

    Article  Google Scholar 

  • Neuhäusler, P., & Frietsch, R. (2012). Patent families as macro level patent value indicators: Applying weights to account for market differences. Scientometrics, 96(1), 1–23.

  • Neuhäusler, P., Frietsch, R., Schubert, T., & Blind, K. (2011). Patents and the financial performance of firmsAn analysis based on stock market data (No. 28). Fraunhofer ISI discussion papers innovation systems and policy analysis.

  • Paci, R., & Usai, S. (2009). Knowledge flows across European regions. The Annals of Regional Science, 43(3), 669–690.

    Article  Google Scholar 

  • Sampat, B. N. (2004). Examining patent examination: an analysis of examiner and applicant generated prior art (Doctoral dissertation, University of Michigan).

  • SAS Institute Inc. (2008). SAS/STAT® 9.2 User’s Guide. Cary, NC: SAS Institute Inc.

  • Schoenmakers, W., & Duysters, G. (2010). The technological origins of radical inventions. Research Policy, 39(8), 1051–1059.

    Article  Google Scholar 

  • Trajtenberg, M. (1990). A penny for your quotes: Patent citations and the value of innovations. The RAND Journal of Economics,12(1), 172–187.

  • van Pottelsberghe de la Potterie, B., & François, D. (2009). The cost factor in patent systems. Journal of Industry, Competition and Trade, 9(4), 329–355.

    Article  Google Scholar 

  • Webb, C., Dernis, H., Harhoff, D., & Hoisl, K. (2005). Analysing European and international patent citations: A set of EPO patent database building blocks, OCDE Science. Technology and Industry Working Paper, 9.

Download references

Acknowledgments

Lin Zhang acknowledges the support of the National Natural Science Foundation of China Grant No. 71103064.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jurriën Bakker.

Appendices

Appendix 1: Correlation between indicators from the same office

See Tables 11, 12 and 13.

Table 11 Correlation of indicators of patents filed at the EPO
Table 12 Correlation of indicators of patents filed at the USPTO
Table 13 Correlation of indicators of patents filed at the PCT

Appendix 2: Correlation between indicators from different offices

See Tables 14, 15 and 16.

Table 14 Correlation coefficients of indicators pertaining to patents filed both at the EPO (columns) and the USPTO (rows)
Table 15 Correlation coefficients of indicators pertaining to patents filed both at the EPO (columns) and the PCT (rows)
Table 16 Correlation coefficients of indicators pertaining to patents filed both at the USPTO (columns) and the PCT (rows)

Appendix 3: Variable cluster method

This appendix explains the cluster algorithm that was used to cluster indicators. This method is an implementation of the VARCLUS procedure in the SAS® software package (SAS Institute 2008). What follows are excerpts from the SAS manual (SAS Institute 2008: 7461–7463) explaining the logic of the underlying procedure. Our specific settings are detailed in italics. Options not related to our analysis have been omitted.

‘The VARCLUS procedure divides a set of numeric variables into disjoint or hierarchical clusters. Associated with each cluster is a linear combination of the variables in the cluster. The linear combination used here consists of the first principal component. (…) The first principal component is a weighted average of the variables that explains as much variance as possible.

(…)

The VARCLUS procedure tries to maximize the variance that is explained by the cluster components, summed over all the clusters. The cluster components are oblique, not orthogonal, even when the cluster components are first principal components. In an ordinary principal component analysis, all components are computed from the same variables, and the first principal component is orthogonal to the second principal component and to every other principal component. In the VARCLUS procedure, each cluster component is computed from a different set of variables than all the other cluster components. The first principal component of one cluster might be correlated with the first principal component of another cluster. Hence, the VARCLUS algorithm is a type of oblique component analysis.

We use the correlation matrices as input for the principal component analysis used in the VARCLUS procedure (…)

The VARCLUS algorithm is both divisive and iterative. By default, the VARCLUS procedure begins with all variables in a single cluster. It then repeats the following steps:

  1. 1.

    A cluster is chosen for splitting. Depending on (…) the largest eigenvalue associated with the second principal component (…)

  2. 2.

    The chosen cluster is split into two clusters by finding the first two principal components, performing an orthoblique rotation (raw quartimax rotation on the eigenvectors; Harris and Kaiser 1964), and assigning each variable to the rotated component with which it has the higher squared correlation.

  3. 3.

    Variables are iteratively reassigned to clusters to try to maximize the variance accounted for by the cluster components.

(…)VARCLUS stops splitting when every cluster has only one eigenvalue greater than one, thus satisfying the most popular criterion for determining the sufficiency of a single underlying dimension.’

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bakker, J., Verhoeven, D., Zhang, L. et al. Patent citation indicators: One size fits all?. Scientometrics 106, 187–211 (2016). https://doi.org/10.1007/s11192-015-1786-0

Download citation

  • Received:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11192-015-1786-0

Keywords

JEL Classification

Mathematics Subject Classification

Navigation