Abstract
Patent documents are unique external sources of information that reveal the core technology underlying new inventions. Patents also serve as a strategic data source that can be mined to discover state-of-the-art technical development and subsequently help guide R&D investments. This research incorporates an ontology schema to extract and represent patent concepts. A clustering algorithm with non-exhaustive overlaps is proposed to overcome deficiencies with exhaustive clustering methods used in patent mining and technology discovery. The non-exhaustive clustering approach allows for the clustering of patent documents with overlapping technical findings and claims, a feature that enables the grouping of patents that define related key innovations. Legal advisors can use this approach to study potential cases of patent infringement or devise strategies to avoid litigation. The case study demonstrates the use of non-exhaustive overlaps algorithm by clustering US and Japan radio frequency identification (RFID) patents and by analyzing the legal implications of automated discovery of patent infringement.
Similar content being viewed by others
References
Anderberg, M. (1973). Cluster Analysis for Applications. Academic Press, New York
Berkhin, P. (2002). Survey of clustering data mining techniques. Technical Report, Accrue Software, Inc.
Berry, M.J.A. & Linoff, G. (1997). Data Mining Techniques: For Marketing, Sale, and Customer Support. John Wiley & Sons Inc.
Chen, B., Tai, P.C., Harrison, R. & Pan, Y. (2005). Novel hybrid hierarchical k-means clustering method (H-K-means) for microarray analysis. In: Proceedings of Computational Systems Bioinformatics Conference, Sandford CA, USA, August 8–11, 2005
Chen, E. & Wu, G. (2005). An ontology learning method enhanced by frame semantics. In: Proceedings of the Seventh IEEE International Symposium on Multimedia (ISM), 374–382
Chen, T.S., Tsai, T.H., Chen, Y.T., Lin, C.C., Chen, R.C., Li, S.Y. & Chen, H.Y. (2005). A combined k-means and hierarchical clustering method for improving the clustering efficiency of microarray. In: Proceedings of 2005 International Symposium on Intelligent Signal Processing and Communication Systems, 405–408, December 13–16, 2005
Grandstrand, O. (1999). The Economics and Management of Intellectual Property: Toward Intellectual Capitalism. Edward Elgar Publishing
Grilliches, Z. (1990). Patent statistics as economic indicators: a survey. Journal of Economic Literature, 28(4): 1661–1707
Gupta, V.K. (1999). Technological trends in the area of fullerenes using bibliometric analysis of patents. Scientometrics, 44(1): 17–31
Han, J. & Kamber, M. (2000). Data Mining: Concepts and Techniques. Morgan Kaufman
Hong S. (2009). The magic of patent information. World Intellectual Property Organization (WIPO). Available via DIALOG. http://www.wipo.int/sme/en/documents/patent_information.htm. Cited December 1, 2009
Hu, H.L. (2006). Optimization in data mining an overlapping cluster algorithm to provide non-exhaustive clustering. Ph.D. Thesis, Department of Information Management, National Central University, Chung-Li, Taiwan, China
Japan Patent Office (JPO). (2009). Available via DIALOG. http://www.jpo.go.jp/. Cited December 1, 2009
Jones, K.S. (1972). A statistical interpretation of term specificity and its application in retrieval. Journal of Documentation, 28(1): 11–20
Kim, Y.G., Suh, J.H. & Park, S.C. (2008). Visualization of patent analysis for emerging technology. Expert Systems with Applications, 34: 1804–1812
Lai, K.K. & Wu, S.J. (2005). Using the patent co-citation approach to establish a new patent classification system. Information Processing and Management, 41: 313–330
Luhn, H.P. (1957). A statistical approach to mechanized encoding and searching of literary information. IBM Journal of Research and Development, 1(4): 309–317
MacQueen, J.B. (1967). Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, 1: 281–297
Mogee, M.E. (2000). Foreign patenting behavior of small and large firms. International Journal of Technology Management, 19: 149–164
Paci, R., Sassu, A. & Usai, S. (1997). International patenting and national technological specialization. Technovation, 17(1): 25–38
Salton, G. & Buckley, C. (1988). Term-weighting approaches in automatic text retrieval. Information Processing and Management, 24(5): 513–523
Sedding, J. & Kazakov, D. (2004). WordNet-based text document clustering. In: Proceedings of the Third Workshop on Robust Methods in Analysis of Natural Language Data, 104–113, Geneva
Trappey, A.J.C., Huang, C.-J. & Wu, C.-Y. (2008). Building a formal ontology engineering methodology for knowledge definition and representation in design knowledge management. In: Proceedings of Management International Conference (MIC 2008), Barcelona, Spain, November 26–29, 2008
Trappey, A.J.C., Trappey, C.V., Hsu, F.-C. & Hsiao, D.W. (2009). A fuzzy ontological knowledge document clustering methodology. IEEE Transactions on Systems, Man, Cybernetics: Part B, 39(3): 806–814
Trappey, A.J.C., Trappey, C.V. & Wu, C.Y. (2009). Automatic patent document summarization for collaborative knowledge systems and services. Journal of Systems Science and Systems Engineering, 18(1): 71–94
Trappey, C.V., Taghaboni-Dutta, F., Wu, H.Y. & Trappey, A.J.C. (2009). China RFID patent analysis. In: Proceedings of the ASME International Manufacturing Science and Engineering Conference, West Lafayette, Indiana, U.S.A., October 4–7, 2009
United States Patent and Trademark Office (USPTO). (2009). Available via DIALOG. http://www.uspto.gov/. Cited December 1, 2009
World Intellectual Property Organization (WIPO). (2009a). What is a patent? Available via DIALOG. http://www.wipo.int/patentscope/en/patents_faq.html#patent. Cited December 1, 2009
WIPO. (2009b). International classifications. Available via DIALOG. http://www.wipo.int/classifications/fulltext/new_ipc/ipcen.html. Cited December 1, 2009
WIPO. (2009c). IP and business: managing patent costs. Available via DIALOG. http://www.wipo.int/wipo_magazine/en/2006/05/article_0010.html. Cited December 1, 2009
WIPO. (2009d). What does a patent do? Available via DIALOG. http://www.wipo.int/patentscope/en/patents_faq.html#patent_role. Cited December 1, 2009
WIPO. (2009e). Available via DIALOG. http://www.wipo.int/portal/index.html.en. Cited December 1, 2009
Author information
Authors and Affiliations
Corresponding author
Additional information
Charles Trappey is a professor of marketing in the Department of Management Science at the National Chiao Tung University.
Amy J.C. Trappey is chair professor in the Department of Industrial Engineering and Management and Dean, College of Management at the National Taipei University of Technology. She is also a faculty member of the Department of Industrial Engineering and Engineering Management, the National Tsing Hua University. Dr. Trappey is an ASME Fellow.
Chun-Yi Wu is a doctoral student in the Department of Industrial Engineering and Engineering Management at National Tsing Hua University and a system analyst and engineer at Avectec, Inc. His research interests include the development of computerized intelligent systems and the knowledge management of patents and intellectual properties.
Rights and permissions
About this article
Cite this article
Trappey, C.V., Trappey, A.J. & Wu, CY. Clustering patents using non-exhaustive overlaps. J. Syst. Sci. Syst. Eng. 19, 162–181 (2010). https://doi.org/10.1007/s11518-010-5134-x
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11518-010-5134-x