Skip to main content

A Comparative Study on SOM-Based Visualization of Potential Technical Solutions Using Fuzzy Bag-of-Words and Co-occurrence Probability of Technical Words

  • Conference paper
  • First Online:
Integrated Uncertainty in Knowledge Modelling and Decision Making (IUKM 2019)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11471))

Abstract

Self-Organizing Maps (SOM) is a powerful tool in visualizing mutual connection among various objects. In a previous work, SOM-based visualization was applied for revealing potential technical solutions varied in Japanese patent documents, in which meaningful pairs of technical words are implied in SOMs. Before application, text documents were quantified into numerical vectors considering co-occurrence frequency among technical words in sentences, and then, SOMs were constructed summarizing word features of co-occurrence probability vectors or correlation coefficient vectors. Recently, a fuzzy bag-of-words model was proposed for handling sparse characteristics of word feature values and shown to be useful in document classification. In this paper, a comparative study on utilizing fuzzy bag-of-words in conjunction with previous feature values is performed with the goal of revealing potential technical solutions varied in patent documents.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Kohonen, T.: Self-Organizing Maps, 3rd edn. Springer, Heidelberg (2000). https://doi.org/10.1007/978-3-642-56927-2

    Book  MATH  Google Scholar 

  2. Nishida, Y., Honda, K.: Visualization of potential technical solutions by self-organizing maps and co-cluster extraction. In: Joint 10th International Conference on Soft Computing and Intelligent Systems and 19th International Symposium on Advanced Intelligent Systems, pp. 820–825 (2018)

    Google Scholar 

  3. Lan, M., Tan, C.L., Su, J., Lu, Y.: Supervised and traditional term weighting methods for automatic text categorization. IEEE Trans. Pattern Anal. Mach. Intell. 31(4), 721–735 (2009)

    Article  Google Scholar 

  4. Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manage. 24(5), 513–523 (1988)

    Article  Google Scholar 

  5. Zhao, L., Mao, K.: Fuzzy bag-of-words model for document representation. IEEE Trans. Fuzzy Syst. 26(2), 794–804 (2018)

    Article  Google Scholar 

  6. Sato, T.: Neologism dictionary based on the language resources on the web for MeCab (2015). https://github.com/neologd/mecab-ipadic-neologd

  7. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. In: International Conference Learning Representations (2013). https://arxiv.org/pdf/1301.3781.pdf

  8. Abadi M., et al.: TensorFlow: large-scale machine learning on heterogeneous systems. https://www.tensorflow.org/

Download references

Acknowledgment

This work was achieved through the use of large-scale computer systems at the Cybermedia Center, Osaka University.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yasushi Nishida .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Nishida, Y., Honda, K. (2019). A Comparative Study on SOM-Based Visualization of Potential Technical Solutions Using Fuzzy Bag-of-Words and Co-occurrence Probability of Technical Words. In: Seki, H., Nguyen, C., Huynh, VN., Inuiguchi, M. (eds) Integrated Uncertainty in Knowledge Modelling and Decision Making. IUKM 2019. Lecture Notes in Computer Science(), vol 11471. Springer, Cham. https://doi.org/10.1007/978-3-030-14815-7_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-14815-7_30

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-14814-0

  • Online ISBN: 978-3-030-14815-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics