Abstract
There is an abundance of data and knowledge within any given patent. Through the use of textual mining and machine learning clustering techniques it is possible to discover meaningful associations throughout a corpus of patents. This research demonstrates that such relationships between USPTO patents exist. Through the use of k-means and k-medians clustering, the accuracy of the USPTO classes will be assessed. It will also be demonstrated that a more refined classification process would be beneficial to other areas of analysis and forecasting.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Blake, C.: Text mining. Ann. Rev. Inf. Sci. Technol. 45(1), 121–155 (2011)
Chen, Y.-L., Chang, Y.-C.: A three-phase method for patent classification. Inf. Process. Manage. 48, 1017–1030 (2012)
Chernoff, H., Gillick, L.S., Hartigan, J.A.: k-Means algorithms. Encycl. Stat. Sci. 6, 3858–3859 (2006)
Chou, L.-Y.: Knowledge discovery through bibiometrics and data mining: an example on marketing ethics. Int. J. Organ. Innov. 3, 106–139 (2011)
Goswami, S., Shishodia, M.S.: A fuzzy based approach to text mining and document clustering. Int. J. Data Min. Knowl. Manage. Proc. 3(3), 43–52 (2013)
Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques. Elsevier, Waltham (2012)
Hsu, C.C., Huang, Y.-P., Chang, K.-W.: Extended Naïve Bayes classifier for mixed data. Expert Syst. Appl. 35, 1080–1083 (2008)
Jun, S., Park, S.S., Jang, D.S.: Technology forecasting using matrix mapping and patent clustering. Ind. Manage. Data Syst. 112, 786–807 (2011)
Kang, I.-S., Na, S.-H., Kim, J., Lee, J.-H.: Cluster based patent retrieval. Inf. Process. Manage. 43, 1173–1182 (2007)
Kasravi, K., Risov, M.: Patent mining - discovery of business value from patent repositories. In: Proceedings of the Fortieth Annual Hawaii International Conference on System Sciences, Waikoloa, Hawaii, USA (2007)
Kim, J.-H., Choi, K.-S.: Patent document categorization based on semantic structural information. Inf. Process. Manage. 43, 1200–1215 (2007)
Karmakar, S., Zhu, Y.: Mining collaboration through textual semantic interpretation. In: 2011 11th International Conference on Hybrid Intelligent Systems (HIS), pp. 728–733 (2011)
Li, Y., Chung, S.M., Holt, J.D.: Text document clustering based on frequent word meaning sequences. Data Knowl. Eng. 64, 381–404 (2008)
Maechler, M.: “Finding Groups in Data”: Cluster Analysis Extended Rousseeuw et al, Package “Cluster” (R Documentation). https://cran.r-project.org/web/packages/cluster/cluster.pdf. Accessed 21 July 2015
Ruffaldi, E., Sani, E., Bergamasco, M.: Visualizing perspectives and trends in robotics based on patent mining. In: 2010 IEEE International Conference on Robotics and Automation, Anchorage, Alaska, USA 3–8 May 2010
Trappery, A.J.C., Hsu, F.-C., Trappery, C.V., Lin, C.-I.: Development of a patent document classification and search platform using a back-propagation network. Expert Syst. Appl. 31, 755–765 (2006)
Trappey, C.V., Wu, H.-Y., Taghaboni-Dutta, F., Trappey, A.J.C.: Using patent data for technology forecasting: China RFID patent analysis. Adv. Eng. Inform. 25, 53–64 (2011)
Tseng, Y.H., Lin, C.J., Lin, Y.I.: Text mining techniques for patent analysis. Inf. Process. Manage. 43, 1216–1247 (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Smith, M., Agrawal, R. (2015). A Comparison of Patent Classifications with Clustering Analysis. In: Wang, J., et al. Web Information Systems Engineering – WISE 2015. WISE 2015. Lecture Notes in Computer Science(), vol 9419. Springer, Cham. https://doi.org/10.1007/978-3-319-26187-4_38
Download citation
DOI: https://doi.org/10.1007/978-3-319-26187-4_38
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26186-7
Online ISBN: 978-3-319-26187-4
eBook Packages: Computer ScienceComputer Science (R0)