Contextually Self-Organized Maps of Chinese Words

  • Teuvo Kohonen
  • Hongbing Xing
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6731)

Abstract

Contextual SOMs of Chinese words have been constructed in this work. Differing from previous approaches, in which individual words were mapped onto the SOM, in this work histograms of various word classes or otherwise defined subsets of words were formed on the SOM array. It was found that the words are not only clustered according to the word classes, but joint or overlapping clusters of words from different classes can also be formed according to the role of the words as sentence constituents. A further new effect was found. When the histograms were formed using test words restricted to certain intervals of word frequencies, the histograms were found to depend on the frequency, and the corresponding partial clusters were often very compact.

Keywords

Target Word Word Frequency Test Word Chinese Word Text Corpus 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Ritter, H., Kohonen, T.: Self-organizing semantic maps. Biol. Cyb. 61, 241–254 (1989)CrossRefGoogle Scholar
  2. 2.
    Honkela, T., Pulkki, V., Kohonen, T.: Contextual relations of words in Grimm tales, analyzed by self-organizing maps. In: Fogelman-Soulié, F., Gallinari, P. (eds.) Proc. ICANN 1995, Int. Conf. on Artificial Neural Networks, vol. II, pp. 3–7. EC2, Nanterre, France (1995)Google Scholar
  3. 3.
    Kohonen, T.: Self-Organizing Maps, 3rd edn. Springer, Heidelberg (2001)CrossRefMATHGoogle Scholar
  4. 4.
    Kohonen, T.: Contextually Self-Organized Maps of Chinese Words. TKK Reports in Information and Computer Science, TKK-ICS-R30. Aalto University School of Science and Technology, Espoo, Finland (2010) (This report is downloadable from ics.tkk.fi/en/research/publications)
  5. 5.
    Kohonen, T.: Contextually Self-Organized Maps of Chinese Words. Part II, TKK Reports in Information and Computer Science, TKK-ICS-R35. Aalto University School of Science and Technology, Espoo, Finland (2010) (This report is downloadable from ics.tkk.fi/en/research/publications)
  6. 6.
    Sun, H.L., Sun, D.J., Huang, J.P., Li, D.J., Xing, H.B.: Corpus for modern Chinese research. In: Luo, Z.S., Yuan, Y.L. (eds.) Studies in the Chinese language and characters in the era of computers, pp. 283–294. Tsinghua University Press, Beijing, China (1996)Google Scholar
  7. 7.
    Vesanto, J., Alhoniemi, E., Himberg, J., Kiviluoto, K., Parviainen, J.: Self-organizing map for data mining in Matlab: the SOM Toolbox. Simulation News Europe 25, 54 (1999) (The SOM Toolbox software package is downloadable from ics.tkk.fi/en/research/software)
  8. 8.
    Kohonen, T., Nieminen, I.T., Honkela, T.: On the quantization error in SOM vs. VQ: A critical and systematic study. In: Príncipe, J.C., Miikkulainen, R. (eds.) WSOM 2009. LNCS, vol. 5629, pp. 133–144. Springer, Heidelberg (2009)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Teuvo Kohonen
    • 1
  • Hongbing Xing
    • 2
  1. 1.Aalto UniversityEspooFinland
  2. 2.Beijing Language and Culture UniversityBeijingChina

Personalised recommendations