Abstract
We propose a new clustering algorithm for the induction of the morphological paradigms. Our method is unsupervised and exploits the syntactic categories of the words acquired by an unsupervised syntactic category induction algorithm [1]. Previous research [2,3] on joint learning of morphology and syntax has shown that both types of knowledge affect each other making it possible to use one type of knowledge to help learn the other one.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Clark, A.S.: Inducing syntactic categories by context distribution clustering. In: Proceedings of CoNLLL 2000 and LLL 2000, Morristown, NJ, USA, pp. 91–94. ACL (2000)
Clark, A.S.: Combining distributional and morphological information for part of speech induction. In: EACL 2003: Proceedings of the 10th EACL, Morristown, NJ, USA, pp. 59–66. ACL (2003)
Hu, Y., Matveeva, I., Goldsmith, J., Sprague, C.: Using morphology and syntax together in unsupervised learning. In: Proceedings of the Workshop on Psychocomputational Models of Human Language Acquisition, Ann Arbor, Michigan, June 2005, pp. 20–27. ACL (2005)
Karlsson, F.: Finnish grammar. WSOY, Juva (1983)
Harris, Z.S.: Distributional structure. Word 10(23), 146–162 (1954)
Brent, M.R., Murthy, S.K., Lundberg, A.: Discovering morphemic suffixes a case study in mdl induction. In: Fifth International Workshop on AI and Statistics, pp. 264–271 (1995)
Creutz, M., Lagus, K.: Unsupervised discovery of morphemes. In: Proceedings of the ACL-02 workshop on Morphological and phonological learning, Morristown, NJ, USA, pp. 21–30. ACL (2002)
Goldsmith, J.: Unsupervised learning of the morphology of a natural language. Computational Linguistics 27(2), 153–198 (2001)
Bordag, S.: Unsupervised and knowledge-free morpheme segmentation and analysis. In: Peters, C., Jijkoun, V., Mandl, T., Müller, H., Oard, D.W., Peñas, A., Petras, V., Santos, D. (eds.) CLEF 2007. LNCS, vol. 5152, pp. 881–891. Springer, Heidelberg (2008)
Schone, P., Jurafsky, D.: Knowledge-free induction of morphology using latent semantic analysis. In: Proceedings of CoNLL 2000 and LLL 2000, Morristown, NJ, USA, pp. 67–72. ACL (2000)
Snover, M.G., Jarosz, G.E., Brent, M.R.: Unsupervised learning of morphology using a novel directed search algorithm: Taking the first step. In: Proceedings of the ACL 2002 Workshop on Morphological and Phonological Learning, Morristown, NJ, USA, pp. 11–20. ACL (2002)
Creutz, M.: Unsupervised segmentation of words using prior distributions of morph length and frequency. In: ACL 2003: Proceedings of the 41st ACL, pp. 280–287, Morristown, NJ, USA. ACL (2003)
Monson, C.: Paramor: From Paradigm Structure to Natural Language Morphology Induction. PhD thesis, Language Technologies Institute, School of Computer Science, Carnegie Mellon University (2008)
Schmid, H.: Probabilistic part-of-speech tagging using decision trees. In: Proceedings of the International Conference on New Methods in Language Processing, Manchester, UK (1994)
Kurimo, M., Virpioja, S., Turunen, V.T., Blackwood, G.W., Byrne, W.: Overview and results of morpho challenge 2009. In: Multilingual Information Access Evaluation Vol. I 10th Workshop of the CLEF 2009, Corfu, Greece, September 30 - October 2, Revised Selected Papers. Springer, Heidelberg (2009)
Monson, C., Hollingshead, K., Roark, B.: Probabilistic paramor. In: Working Notes for the CLEF Workshop, Corfu, Greece (2009)
Lignos, C., Chan, E., Marcus, M.P., Yang, C.: A rule-based unsupervised morphology learning framework. In: Working Notes for the CLEF Workshop, Corfu, Greece (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Can, B., Manandhar, S. (2010). Clustering Morphological Paradigms Using Syntactic Categories. In: Peters, C., et al. Multilingual Information Access Evaluation I. Text Retrieval Experiments. CLEF 2009. Lecture Notes in Computer Science, vol 6241. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15754-7_77
Download citation
DOI: https://doi.org/10.1007/978-3-642-15754-7_77
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15753-0
Online ISBN: 978-3-642-15754-7
eBook Packages: Computer ScienceComputer Science (R0)