Skip to main content

A Comparison of Patent Classifications with Clustering Analysis

  • Conference paper
  • First Online:
Web Information Systems Engineering – WISE 2015 (WISE 2015)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9419))

Included in the following conference series:

Abstract

There is an abundance of data and knowledge within any given patent. Through the use of textual mining and machine learning clustering techniques it is possible to discover meaningful associations throughout a corpus of patents. This research demonstrates that such relationships between USPTO patents exist. Through the use of k-means and k-medians clustering, the accuracy of the USPTO classes will be assessed. It will also be demonstrated that a more refined classification process would be beneficial to other areas of analysis and forecasting.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Blake, C.: Text mining. Ann. Rev. Inf. Sci. Technol. 45(1), 121–155 (2011)

    Article  Google Scholar 

  2. Chen, Y.-L., Chang, Y.-C.: A three-phase method for patent classification. Inf. Process. Manage. 48, 1017–1030 (2012)

    Article  Google Scholar 

  3. Chernoff, H., Gillick, L.S., Hartigan, J.A.: k-Means algorithms. Encycl. Stat. Sci. 6, 3858–3859 (2006)

    Google Scholar 

  4. Chou, L.-Y.: Knowledge discovery through bibiometrics and data mining: an example on marketing ethics. Int. J. Organ. Innov. 3, 106–139 (2011)

    Google Scholar 

  5. Goswami, S., Shishodia, M.S.: A fuzzy based approach to text mining and document clustering. Int. J. Data Min. Knowl. Manage. Proc. 3(3), 43–52 (2013)

    Article  Google Scholar 

  6. Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques. Elsevier, Waltham (2012)

    Book  Google Scholar 

  7. Hsu, C.C., Huang, Y.-P., Chang, K.-W.: Extended Naïve Bayes classifier for mixed data. Expert Syst. Appl. 35, 1080–1083 (2008)

    Article  Google Scholar 

  8. Jun, S., Park, S.S., Jang, D.S.: Technology forecasting using matrix mapping and patent clustering. Ind. Manage. Data Syst. 112, 786–807 (2011)

    Article  Google Scholar 

  9. Kang, I.-S., Na, S.-H., Kim, J., Lee, J.-H.: Cluster based patent retrieval. Inf. Process. Manage. 43, 1173–1182 (2007)

    Article  Google Scholar 

  10. Kasravi, K., Risov, M.: Patent mining - discovery of business value from patent repositories. In: Proceedings of the Fortieth Annual Hawaii International Conference on System Sciences, Waikoloa, Hawaii, USA (2007)

    Google Scholar 

  11. Kim, J.-H., Choi, K.-S.: Patent document categorization based on semantic structural information. Inf. Process. Manage. 43, 1200–1215 (2007)

    Article  Google Scholar 

  12. Karmakar, S., Zhu, Y.: Mining collaboration through textual semantic interpretation. In: 2011 11th International Conference on Hybrid Intelligent Systems (HIS), pp. 728–733 (2011)

    Google Scholar 

  13. Li, Y., Chung, S.M., Holt, J.D.: Text document clustering based on frequent word meaning sequences. Data Knowl. Eng. 64, 381–404 (2008)

    Article  Google Scholar 

  14. Maechler, M.: “Finding Groups in Data”: Cluster Analysis Extended Rousseeuw et al, Package “Cluster” (R Documentation). https://cran.r-project.org/web/packages/cluster/cluster.pdf. Accessed 21 July 2015

  15. Ruffaldi, E., Sani, E., Bergamasco, M.: Visualizing perspectives and trends in robotics based on patent mining. In: 2010 IEEE International Conference on Robotics and Automation, Anchorage, Alaska, USA 3–8 May 2010

    Google Scholar 

  16. Trappery, A.J.C., Hsu, F.-C., Trappery, C.V., Lin, C.-I.: Development of a patent document classification and search platform using a back-propagation network. Expert Syst. Appl. 31, 755–765 (2006)

    Article  Google Scholar 

  17. Trappey, C.V., Wu, H.-Y., Taghaboni-Dutta, F., Trappey, A.J.C.: Using patent data for technology forecasting: China RFID patent analysis. Adv. Eng. Inform. 25, 53–64 (2011)

    Article  Google Scholar 

  18. Tseng, Y.H., Lin, C.J., Lin, Y.I.: Text mining techniques for patent analysis. Inf. Process. Manage. 43, 1216–1247 (2007)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mick Smith .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Smith, M., Agrawal, R. (2015). A Comparison of Patent Classifications with Clustering Analysis. In: Wang, J., et al. Web Information Systems Engineering – WISE 2015. WISE 2015. Lecture Notes in Computer Science(), vol 9419. Springer, Cham. https://doi.org/10.1007/978-3-319-26187-4_38

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-26187-4_38

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-26186-7

  • Online ISBN: 978-3-319-26187-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics