Case-Sensitivity of Classifiers for WSD: Complex Systems Disambiguate Tough Words Better

  • Harri M. T. Saarikoski
  • Steve Legrand
  • Alexander Gelbukh
Conference paper

DOI: 10.1007/978-3-540-70939-8_23

Part of the Lecture Notes in Computer Science book series (LNCS, volume 4394)
Cite this paper as:
Saarikoski H.M.T., Legrand S., Gelbukh A. (2007) Case-Sensitivity of Classifiers for WSD: Complex Systems Disambiguate Tough Words Better. In: Gelbukh A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2007. Lecture Notes in Computer Science, vol 4394. Springer, Berlin, Heidelberg


We present a novel method for improving disambiguation accuracy by building an optimal ensemble (OE) of systems where we predict the best available system for target word using a priori case factors (e.g. amount of training per sense). We report promising results of a series of best-system prediction tests (best prediction accuracy is 0.92) and show that complex/simple systems disambiguate tough/easy words better. The method provides the following benefits: (1) higher disambiguation accuracy for virtually any base systems (current best OE yields close to 2% accuracy gain over Senseval-3 state of the art) and (2) economical way of building more effective ensembles of all types (e.g. optimal, weighted voting and cross-validation based). The method is also highly scalable in that it utilizes readily available factors available for any ambiguous word in any language for estimating word difficulty and defines classifier complexity using known properties only.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer Berlin Heidelberg 2007

Authors and Affiliations

  • Harri M. T. Saarikoski
    • 1
  • Steve Legrand
    • 2
  • Alexander Gelbukh
    • 3
  1. 1.KIT Language Technology Doctorate School, Helsinki UniversityFinland
  2. 2.Department of Computer Science, University of JyväskyläFinland
  3. 3.Instituto Politecnico Nacional, Mexico CityMexico

Personalised recommendations