Skip to main content
Log in

Clustering patents using non-exhaustive overlaps

  • Published:
Journal of Systems Science and Systems Engineering Aims and scope Submit manuscript

Abstract

Patent documents are unique external sources of information that reveal the core technology underlying new inventions. Patents also serve as a strategic data source that can be mined to discover state-of-the-art technical development and subsequently help guide R&D investments. This research incorporates an ontology schema to extract and represent patent concepts. A clustering algorithm with non-exhaustive overlaps is proposed to overcome deficiencies with exhaustive clustering methods used in patent mining and technology discovery. The non-exhaustive clustering approach allows for the clustering of patent documents with overlapping technical findings and claims, a feature that enables the grouping of patents that define related key innovations. Legal advisors can use this approach to study potential cases of patent infringement or devise strategies to avoid litigation. The case study demonstrates the use of non-exhaustive overlaps algorithm by clustering US and Japan radio frequency identification (RFID) patents and by analyzing the legal implications of automated discovery of patent infringement.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Anderberg, M. (1973). Cluster Analysis for Applications. Academic Press, New York

    MATH  Google Scholar 

  2. Berkhin, P. (2002). Survey of clustering data mining techniques. Technical Report, Accrue Software, Inc.

  3. Berry, M.J.A. & Linoff, G. (1997). Data Mining Techniques: For Marketing, Sale, and Customer Support. John Wiley & Sons Inc.

  4. Chen, B., Tai, P.C., Harrison, R. & Pan, Y. (2005). Novel hybrid hierarchical k-means clustering method (H-K-means) for microarray analysis. In: Proceedings of Computational Systems Bioinformatics Conference, Sandford CA, USA, August 8–11, 2005

  5. Chen, E. & Wu, G. (2005). An ontology learning method enhanced by frame semantics. In: Proceedings of the Seventh IEEE International Symposium on Multimedia (ISM), 374–382

  6. Chen, T.S., Tsai, T.H., Chen, Y.T., Lin, C.C., Chen, R.C., Li, S.Y. & Chen, H.Y. (2005). A combined k-means and hierarchical clustering method for improving the clustering efficiency of microarray. In: Proceedings of 2005 International Symposium on Intelligent Signal Processing and Communication Systems, 405–408, December 13–16, 2005

  7. Grandstrand, O. (1999). The Economics and Management of Intellectual Property: Toward Intellectual Capitalism. Edward Elgar Publishing

  8. Grilliches, Z. (1990). Patent statistics as economic indicators: a survey. Journal of Economic Literature, 28(4): 1661–1707

    Google Scholar 

  9. Gupta, V.K. (1999). Technological trends in the area of fullerenes using bibliometric analysis of patents. Scientometrics, 44(1): 17–31

    Article  Google Scholar 

  10. Han, J. & Kamber, M. (2000). Data Mining: Concepts and Techniques. Morgan Kaufman

  11. Hong S. (2009). The magic of patent information. World Intellectual Property Organization (WIPO). Available via DIALOG. http://www.wipo.int/sme/en/documents/patent_information.htm. Cited December 1, 2009

  12. Hu, H.L. (2006). Optimization in data mining an overlapping cluster algorithm to provide non-exhaustive clustering. Ph.D. Thesis, Department of Information Management, National Central University, Chung-Li, Taiwan, China

    Google Scholar 

  13. Japan Patent Office (JPO). (2009). Available via DIALOG. http://www.jpo.go.jp/. Cited December 1, 2009

  14. Jones, K.S. (1972). A statistical interpretation of term specificity and its application in retrieval. Journal of Documentation, 28(1): 11–20

    Article  Google Scholar 

  15. Kim, Y.G., Suh, J.H. & Park, S.C. (2008). Visualization of patent analysis for emerging technology. Expert Systems with Applications, 34: 1804–1812

    Article  Google Scholar 

  16. Lai, K.K. & Wu, S.J. (2005). Using the patent co-citation approach to establish a new patent classification system. Information Processing and Management, 41: 313–330

    Article  Google Scholar 

  17. Luhn, H.P. (1957). A statistical approach to mechanized encoding and searching of literary information. IBM Journal of Research and Development, 1(4): 309–317

    Article  MathSciNet  Google Scholar 

  18. MacQueen, J.B. (1967). Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, 1: 281–297

    MathSciNet  Google Scholar 

  19. Mogee, M.E. (2000). Foreign patenting behavior of small and large firms. International Journal of Technology Management, 19: 149–164

    Article  Google Scholar 

  20. Paci, R., Sassu, A. & Usai, S. (1997). International patenting and national technological specialization. Technovation, 17(1): 25–38

    Article  Google Scholar 

  21. Salton, G. & Buckley, C. (1988). Term-weighting approaches in automatic text retrieval. Information Processing and Management, 24(5): 513–523

    Article  Google Scholar 

  22. Sedding, J. & Kazakov, D. (2004). WordNet-based text document clustering. In: Proceedings of the Third Workshop on Robust Methods in Analysis of Natural Language Data, 104–113, Geneva

  23. Trappey, A.J.C., Huang, C.-J. & Wu, C.-Y. (2008). Building a formal ontology engineering methodology for knowledge definition and representation in design knowledge management. In: Proceedings of Management International Conference (MIC 2008), Barcelona, Spain, November 26–29, 2008

  24. Trappey, A.J.C., Trappey, C.V., Hsu, F.-C. & Hsiao, D.W. (2009). A fuzzy ontological knowledge document clustering methodology. IEEE Transactions on Systems, Man, Cybernetics: Part B, 39(3): 806–814

    Article  Google Scholar 

  25. Trappey, A.J.C., Trappey, C.V. & Wu, C.Y. (2009). Automatic patent document summarization for collaborative knowledge systems and services. Journal of Systems Science and Systems Engineering, 18(1): 71–94

    Article  Google Scholar 

  26. Trappey, C.V., Taghaboni-Dutta, F., Wu, H.Y. & Trappey, A.J.C. (2009). China RFID patent analysis. In: Proceedings of the ASME International Manufacturing Science and Engineering Conference, West Lafayette, Indiana, U.S.A., October 4–7, 2009

  27. United States Patent and Trademark Office (USPTO). (2009). Available via DIALOG. http://www.uspto.gov/. Cited December 1, 2009

  28. World Intellectual Property Organization (WIPO). (2009a). What is a patent? Available via DIALOG. http://www.wipo.int/patentscope/en/patents_faq.html#patent. Cited December 1, 2009

  29. WIPO. (2009b). International classifications. Available via DIALOG. http://www.wipo.int/classifications/fulltext/new_ipc/ipcen.html. Cited December 1, 2009

  30. WIPO. (2009c). IP and business: managing patent costs. Available via DIALOG. http://www.wipo.int/wipo_magazine/en/2006/05/article_0010.html. Cited December 1, 2009

  31. WIPO. (2009d). What does a patent do? Available via DIALOG. http://www.wipo.int/patentscope/en/patents_faq.html#patent_role. Cited December 1, 2009

  32. WIPO. (2009e). Available via DIALOG. http://www.wipo.int/portal/index.html.en. Cited December 1, 2009

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Amy J.C. Trappey.

Additional information

Charles Trappey is a professor of marketing in the Department of Management Science at the National Chiao Tung University.

Amy J.C. Trappey is chair professor in the Department of Industrial Engineering and Management and Dean, College of Management at the National Taipei University of Technology. She is also a faculty member of the Department of Industrial Engineering and Engineering Management, the National Tsing Hua University. Dr. Trappey is an ASME Fellow.

Chun-Yi Wu is a doctoral student in the Department of Industrial Engineering and Engineering Management at National Tsing Hua University and a system analyst and engineer at Avectec, Inc. His research interests include the development of computerized intelligent systems and the knowledge management of patents and intellectual properties.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Trappey, C.V., Trappey, A.J. & Wu, CY. Clustering patents using non-exhaustive overlaps. J. Syst. Sci. Syst. Eng. 19, 162–181 (2010). https://doi.org/10.1007/s11518-010-5134-x

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11518-010-5134-x

Keywords

Navigation