Abstract
TLC is a supervised training (S) system that uses a Bayesianstatistical model and features of a word's context to identifyword sense. We describe the classifier's operation and how itcan be configured to use only topical context cues, only localcues, or a combination of both. Our results on Senseval'sfinal run are presented along with a comparison to theperformance of the best S system and the average for S systems.We discuss ways to improve TLC by enriching its featureset and by substituting other decision procedures for the Bayesianmodel. Future development of supervised training classifiers willdepend on the availability of tagged training data. TLC canassist in the hand-tagging effort by helping human taggers locateinfrequent senses of polysemous words.
Similar content being viewed by others
References
Brill, E. “Some advances in rule-based part of speech tagging” Proceedings of the Twelfth National Conference on Artificial Intelligence, Seattle: AAAI. 1994.
Chiang T-H., Y-C. Lin and K-Y Su. “Robust learning, smoothing, and parameter tying on syntactic ambiguity resolution”, Computational Linguistics, Vol. no. 21–3, 1995, pp. 321–349.
Fellbaum, C. (ed). WordNet: An Electronic Lexical Database, Cambridge: MIT Press. 1998.
Good, I. F. “The population frequencies of species and the estimation of population parameters”, Biometrica, Vol. no. 40, 1953, pp. 237–264.
Joshi, A.K., B. Srinivas. “Disambiguation of Super Parts of Speech (or Supertags): Almost parsing”, Proceedings of COLING 1994, 1994, pp. 154–160.
Leacock, C., M. Chodorow and G. A. Miller. “Using corpus statistics and WordNet relations for sense identification”, Computational Linguistics, Vol. no. 24–1, 1998, pp. 147–165.
Miller, G. A., R. Tengi and S. Landes (submitted for publication). “Matching the Tagging to the Task”.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Chodorow, M., Leacock, C. & Miller, G.A. A Topical/Local Classifier for Word Sense Identification. Computers and the Humanities 34, 115–120 (2000). https://doi.org/10.1023/A:1002463121011
Issue Date:
DOI: https://doi.org/10.1023/A:1002463121011