Abstract
In many languages, abbreviations are widely used either in writing or talking. However, abbreviations are likely to be ambiguous. Therefore, there is a need for disambiguation. That is, abbreviations should be expanded correctly. Disambiguation of abbreviations is critical for correct understanding not only for the abbreviations themselves but also for the whole text. Little research has been done concerning disambiguation of abbreviations for documents in English and Latin. Nothing has been done for the Hebrew language. In this ongoing work, we investigate a basic model, which expands abbreviations contained in Jewish Law Documents written in Hebrew. This model has been implemented in a prototype system. Currently, experimental results show that abbreviations are expanded correctly in a rate of almost 60%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Adar, E.: S-RAD: A Simple and Robust Abbreviation Dictionary. Technical Report. HP Laboratories (2002)
Ashkenazi, S., Jarden, D.: Ozar Rashe Tevot: Thesaurus of Hebrew Abbreviations (in Hebrew). Kiryat Sefere LTD., Jerusalem (1994)
Cabezas, C., Resnik, P., Stevens, J.: Supervised Sense Tagging using Support Vector Machines. In: Proceedings of the Second International Workshop on Evaluating Word Sense Disambiguation Systems (SENSEVAL-2), Toulouse, France (2001)
Cortes, C., Vapnik, V.: Support-Vector Networks. Machine Learning 20, 273–297 (1995)
Gale, W.A., Church, K.W., Yarowsky, D.: One Sense Per Discourse. In: Proceedings of the ARPA Workshop on Speech and Natural Language Processing, pp. 233–237 (1992)
Glinert, L., Gilinert, L.: Hebrew – An Essential Grammar. Routledge, London (1994)
HaCohen, R.Y.M.: Mishnah Berurah, vol. 3. Pisgah Foundation, Jerusalem (1993)
Joachims, T.: Learning to Classify Text using Support Vector Machines. Kluwer Academic Publishers, Dordrecht (2001)
Liu, H., Aronson, A.R., Friedman, C.: A Study of Abbreviations in MEDLINE Abstracts. In: Proc AMIA Symp., pp. 464–468 (2002)
Liu, H., Lussier, Y.A., Friedman, C.: A Study of Abbreviations in the UMLS. In: Proc. AMIA Symp., pp. 393–397 (2001)
Rydberg-Cox, J.A.: Automatic Disambiguation of Latin Abbreviations in Early Modern Texts for Humanities Digital Libraries. In: International Conference on Digital Libraries, Proceedings of the third ACM/IEEE-CS joint conference on Digital libraries, pp. 372–373. IEEE Computer Society, Los Alamitos (2003)
Wartski, I.: Hebrew Grammar and Explanatory Notes. The Linguaphone Institute, London (1900)
Wintner, S.: Hebrew Computational Linguistics: Past and Future. Artificial Intelligence Review 21(2), 113–138 (2003)
WordiQ (2003), http://www.wordiq.com/
Yelin, D.: Dikduk HaLason HaIvrit (Hebrew Grammar, in Hebrew), Jerusalem (1970)
Yu, Z., Tsuruoka, Y., Tsujii, J.: Automatic Resolution of Ambiguous Abbreviations in Biomedical Texts using Support Vector Machines and One Sense Per Discourse Hypothesis. In: SIGIR 2003 Workshop on Text Analysis and Search for Bioinformatics, pp. 57–62 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
HaCohen-Kerner, Y., Kass, A., Peretz, A. (2004). Baseline Methods for Automatic Disambiguation of Abbreviations in Jewish Law Documents. In: Vicedo, J.L., MartÃnez-Barco, P., MuÅ„oz, R., Saiz Noeda, M. (eds) Advances in Natural Language Processing. EsTAL 2004. Lecture Notes in Computer Science(), vol 3230. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30228-5_6
Download citation
DOI: https://doi.org/10.1007/978-3-540-30228-5_6
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23498-2
Online ISBN: 978-3-540-30228-5
eBook Packages: Springer Book Archive