Abstract
Self-Organizing Maps (SOM) is a powerful tool in visualizing mutual connection among various objects. In a previous work, SOM-based visualization was applied for revealing potential technical solutions varied in Japanese patent documents, in which meaningful pairs of technical words are implied in SOMs. Before application, text documents were quantified into numerical vectors considering co-occurrence frequency among technical words in sentences, and then, SOMs were constructed summarizing word features of co-occurrence probability vectors or correlation coefficient vectors. Recently, a fuzzy bag-of-words model was proposed for handling sparse characteristics of word feature values and shown to be useful in document classification. In this paper, a comparative study on utilizing fuzzy bag-of-words in conjunction with previous feature values is performed with the goal of revealing potential technical solutions varied in patent documents.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kohonen, T.: Self-Organizing Maps, 3rd edn. Springer, Heidelberg (2000). https://doi.org/10.1007/978-3-642-56927-2
Nishida, Y., Honda, K.: Visualization of potential technical solutions by self-organizing maps and co-cluster extraction. In: Joint 10th International Conference on Soft Computing and Intelligent Systems and 19th International Symposium on Advanced Intelligent Systems, pp. 820–825 (2018)
Lan, M., Tan, C.L., Su, J., Lu, Y.: Supervised and traditional term weighting methods for automatic text categorization. IEEE Trans. Pattern Anal. Mach. Intell. 31(4), 721–735 (2009)
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manage. 24(5), 513–523 (1988)
Zhao, L., Mao, K.: Fuzzy bag-of-words model for document representation. IEEE Trans. Fuzzy Syst. 26(2), 794–804 (2018)
Sato, T.: Neologism dictionary based on the language resources on the web for MeCab (2015). https://github.com/neologd/mecab-ipadic-neologd
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. In: International Conference Learning Representations (2013). https://arxiv.org/pdf/1301.3781.pdf
Abadi M., et al.: TensorFlow: large-scale machine learning on heterogeneous systems. https://www.tensorflow.org/
Acknowledgment
This work was achieved through the use of large-scale computer systems at the Cybermedia Center, Osaka University.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Nishida, Y., Honda, K. (2019). A Comparative Study on SOM-Based Visualization of Potential Technical Solutions Using Fuzzy Bag-of-Words and Co-occurrence Probability of Technical Words. In: Seki, H., Nguyen, C., Huynh, VN., Inuiguchi, M. (eds) Integrated Uncertainty in Knowledge Modelling and Decision Making. IUKM 2019. Lecture Notes in Computer Science(), vol 11471. Springer, Cham. https://doi.org/10.1007/978-3-030-14815-7_30
Download citation
DOI: https://doi.org/10.1007/978-3-030-14815-7_30
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-14814-0
Online ISBN: 978-3-030-14815-7
eBook Packages: Computer ScienceComputer Science (R0)