Impact of Feature Selection for Corpus-Based WSD in Turkish

  • Zeynep Orhan
  • Zeynep Altan
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4293)


Word sense disambiguation (WSD) is an important intermediate stage for many natural language processing applications. The senses of an ambiguous word are the classification of usages for that word. WSD is basically a mapping function from a context to a set of applicable senses depending on various parameters. Resource selection, determination of senses for ambiguous words, decision of effective features, algorithms, and evaluation criteria are the major issues in a WSD system. This paper deals with the feature selection strategies for word sense disambiguation task in Turkish language. There are many different features that can contribute to the meaning of a word. These features can vary according to the metaphorical usages, POS of the word, pragmatics, etc. The observations indicated that detecting the critical features can contribute much than the learning methodologies.


Feature Selection Natural Language Processing Ambiguous Word Word Sense Disambiguation Case Marker 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Barwise, J., Perry, J.: Situations and Attitudes. MIT Press, Cambridge, MA (1983)Google Scholar
  2. 2.
    Bloomfield, L.: Language. Henry Holt, New York (1933)Google Scholar
  3. 3.
    Daeleman, W.: Machine Learning Of Language: A Model And A Problem. In: ESSLLI 2002 Workshop On ML Approaches In CL, Trento, Italy, August 5-9, 2002, pp. 134–145 (2002)Google Scholar
  4. 4.
    Edmonds, P.: SENSEVAL: The evaluation of word sense disambiguation systems. ELRA Newsletter 7(3), 5–14 (2002)Google Scholar
  5. 5.
    Ide, N., Veronis, J.: Introduction To The Special Issue On Word Sense Disambiguation: The State Of The Art. Computational Linguistics 24(1), 1–40 (1998)Google Scholar
  6. 6.
    Miller, G.: Ambiguous Words. iMP Magazine (March 22, 2001),
  7. 7.
    Montoyo, A., Suárez, A., Rigau, G., Palomar, M.: Combining Knowledge and Corpus-based Word Sense Disambiguation Methods. JAIR 23, 299–330 (2005)MATHGoogle Scholar
  8. 8.
    Mooney, R.J.: Comparative Experiments On Disambiguating Word Senses: An Illustration Of The Role Of Bias In Machine Learning. In: Brill, E., Church, K. (eds.) Proceedings Of The Conference EMNLP, ACL, Somerset, NJ, pp. 82–91 (1996)Google Scholar
  9. 9.
    Ng, H.T., Lee, H.B.: Integrating Multiple Knowledge Sources to Disambiguate Word Sense: An Exemplar-Based Approach. In: Proceedings Of The 34th Annual Meeting Of ACL, Santa Cruz, pp. 40–47. Morgan Kaufmann Publishers, San Francisco (1996)Google Scholar
  10. 10.
    Oflazer, K., Say, B., Tur, D.Z.H., Tur, G.: Building A Turkish Treebank. In: Abeille, A. (ed.) Invited Chapter In Building And Exploiting Syntactically-Annotated Corpora, Kluwer Academic Publishers, Dordrecht (2003)Google Scholar
  11. 11.
    Orhan, Z., Altan, Z.: Determining Effective Features for Word Sense Disambiguation in Turkish. IU - JEEE 5(2), 1341–1352 (2005)Google Scholar
  12. 12.
    Preiss, J., Yarowsky, D.: The Proceedings of SENSEVAL-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems (2001)Google Scholar
  13. 13.
    Tuğlacı, P.: Okyanus Ansiklopedik Türkçe Sözlük. ABC Kitabevi, İstanbul (1995)Google Scholar
  14. 14.
    Türkçe Sözlük(Turkish Dictionary): TDK (2005),
  15. 15.
    Weaver, W.: Translation. Mimeographed, p. 12, reprinted in Locke, W.N., Booth, A.D. (eds.) (1955), Machine translation of languages, pp. 15–23. John Wiley & Sons, New York (1949) Google Scholar
  16. 16.
    Wittgenstein, L.: Philosophical Investigations  translated by Anscombe, G.E.M. Basil Blackwell, Oxford (1953)Google Scholar
  17. 17.
    Witten, I.H., Frank, E.: DataMining: Practical Machine Learning Toolsand Techniques with Java Implementations. Morgan Kaufmann, San Francisco (1999)Google Scholar
  18. 18.
    Yarowsky, D.: Hierarchical Decision Lists For Word Sense Disambiguation. Computers And The Humanities 34(2), 179–186 (2000)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Zeynep Orhan
    • 1
  • Zeynep Altan
    • 2
  1. 1.Department of Computer EngineeringFatih UniversityBüyükçekmece, IstanbulTurkey
  2. 2.Department of Computer EngineeringMaltepe UniversityMaltepe, IstanbulTurkey

Personalised recommendations