Abstract
In this paper, we describe various language models (LMs) and combinations created to support word prediction and completion in Hebrew. We define and apply 5 general types of LMs: (1) Basic LMs (unigrams, bigrams, trigrams, and quadgrams), (2) Backoff LMs, (3) LMs Integrated with tagged LMs, (4) Interpolated LMs, and (5) Interpolated LMs Integrated with tagged LMs. 16 specific implementations of these LMs were compared using 3 types of Israeli web newspaper corpora. The foremost keystroke saving results were achieved with LMs of the most complex variety, the Interpolated LMs Integrated with tagged LMs. Therefore, we conclude that combining all strengths by creating a synthesis of all four basic LMs and the tagged LMs leads to the best results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Tam, C., Wells, D.: Evaluating the Benefits of Displaying Word Prediction Lists on a Personal Digital Assistant at the Keyboard Level. Assistive Technology 21, 105–114 (2009)
Anson, D., Moist, P., Przywara, M., Wells, H., Saylor, H., Maxime, H.: The Effects of Word Completion and Word Prediction on Typing Rates Using On-Screen Keyboards. Assistive Technology 18, 146–154 (2006)
Beukelman, D., Mirenda, P.: Augmentative and Alternative Communication: Supporting Children and Adults with Complex Communication Needs, 3rd edn., p. 77. Brookes Publishing, Baltimore, MD (2008)
Darragh, J.J., Witten, I.H., James, M.L.: The Reactive Keyboard: A Predictive Typing Aid. Computer 23(11), 41–49 (1990)
Calculator, S., et al.: Roles and Responsibilities of Speech-Language Pathologists With Respect to Augmentative and Alternative Communication: Position Statement. Technical report, American Speech-Language-Hearing Association (2004), http://www.asha.org/docs/html/PS2005-00113.html
Fossett, B., Mirenda, P.: Augmentative and Alternative Communication. In: Odom, S.L., Horner, R.H., Snell, M.E. (eds.) Handbook of Developmental Disabilities, pp. 330–366. Guilford Press (2009) ISBN 978-1-60623-248-4
Beukelman, D., Mirenda, P.: Augmentative & Alternative Communication: Supporting Children & Adults with Complex Communication Needs, 3rd edn. Paul H. Brookes Publishing Company (2005) ISBNÂ 978-1-55766-684-0
Trnka, K., McCaw, J., Yarrington, D., McCoy, K.F.: User Interaction with Word Prediction: The Effects of Prediction Quality. Special Issue of ACM Trans. on Accessible Computing (TACCESS) on Augmentative and Alternative Communication 1(3), 1–34 (2009)
Darragh, J.J., Witten, I.H.: Adaptive Predictive Text Generation and the Reactive Keyboard. Interacting with Computers 3(1), 27–50 (1991)
Wandmacher, T., Antoine, J.-Y.: Methods to Integrate a Language Model with Semantic Information for a Word Prediction Component. In: Proc. ACL SIGDAT Joint Conference EMNLP-CoLLN 2007, Prague, Tchéquie, pp. 503–513 (2007)
Newell, A., Langer, S., Hickey, M.: The Rôle of Natural Language Processing in Alternative and Augmentative Communication. Natural Language Engineering 4(1), 1–16 (1998)
Carlberger, A., Carlberger, J., Magnuson, T., Hunnicutt, M.S., Palazuelos-Cagigas, S.E., Navarro, S.A.: Profet, a New Generation of Word Prediction: An Evaluation Study. In: Copestake, A., Langer, S., Palazuelos-Cagigas, S. (eds.) Natural Lang. Processing for Communication Aids, Madrid. Proc. of a Workshop Sponsored by acl, pp. 23–28 (1997)
Li, J., Hirst, G.: Semantic Knowledge in Word Completion. In: Proceedings of the 7th Int. ACM SIGACCESS Conf. on Computers and Accessibility (ASSETS), pp. 121–128 (2005)
Trnka, K., McCoy, K.F.: Evaluating Word Prediction: Framing Keystroke Savings. In: ACL (Short Papers) 2008, pp. 261–264 (2008)
Swiffin, A.L., Pickering, J.A., Arnott, J.L., Newell, A.F.: PAL: An Effort Efficient Portable Communication Aid and Keyboard Emulator. In: Proceedings of the 8th Annual Conference on Rehabilitation Technology, pp. 197–199 (1985)
Carlberger, J.: Word Prediction: Design and Implementation of a Probabilistic Word Prediction Program. Master dissertation. Royal Institute of Technology, Stockholm (1997)
Shein, F., Nantais, T., Nishiyama, R., Tam, C., Marshall, P.: Word Cueing for Persons with Writing Difficulties: WordQ. In: Technology and Persons with Disabilities Conference, Los Angeles, CA (2001)
Morris, C., Newell, A., Booth, L., Ricketts, I., Arnott, J.: Syntax Pal: A System to Improve the Written Syntax of Language-Impaired Users. Assistive Techn. 4(2), 51–59 (1992)
Netzer, Y., Adler, M., Elhadad, M.: Word Prediction in Hebrew: Preliminary and Surprising Results. In: ISAAC (2008)
HaCohen-Kerner, Y., Greenfield, I.: Basic Word Completion and Prediction for Hebrew. In: Calderón-Benavides, L., González-Caro, C., Chávez, E., Ziviani, N. (eds.) SPIRE 2012. LNCS, vol. 7608, pp. 237–244. Springer, Heidelberg (2012)
Kimelfeld, B., Kovacs, E., Sagiv, Y., Yahav, D.: Using Language Models and the HITS Algorithm for XML Retrieval. In: Fuhr, N., Lalmas, M., Trotman, A. (eds.) INEX 2006. LNCS, vol. 4518, pp. 253–260. Springer, Heidelberg (2007)
Kirchhoff, K., Vergyri, D., Bilmes, J., Duh, K., Stolcke, A.: Morphology-based Language Modeling for Conversational Arabic Speech Recognition. Computer Speech & Language 20(4), 589–608 (2006)
McMahon, J.G.G.: Statistical Language Processing Based on Self-organising Word Classification. Doctoral dissertation, Queen’s University of Belfast (1994)
Beyerlein, P.: Discriminative Model Combination. In: Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal, vol. 1, pp. 481–484 (1998)
Badaskar, S., Agarwal, S., Arora, S.: Identifying Real or Fake Articles: Towards better Language Modeling. In: IJCNLP, pp. 817–822 (2008)
Adler, M.: Hebrew Morphological Disambiguation: An Unsupervised Stochastic Word-based Approach. Ph.D. Dissertation, Ben Gurion University, Israel (2007)
Adler, M., Netzer, Y., Gabay, D., Goldberg, Y., Elhadad, M.: Tagging a Hebrew Corpus: The Case of Participles. In: LREC 2008, European Language Resources Association, Marrakech, Morocco (2008)
Goldberg, Y., Adler, M., Elhadad, M.: EM Can Find Pretty Good HMM POS-Taggers (When Given a Good Start). In: Proceedings of the ACL 2008 Conference, pp. 746–754 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
HaCohen-Kerner, Y., Applebaum, A., Bitterman, J. (2014). Experiments with Language Models for Word Completion and Prediction in Hebrew. In: Przepiórkowski, A., Ogrodniczuk, M. (eds) Advances in Natural Language Processing. NLP 2014. Lecture Notes in Computer Science(), vol 8686. Springer, Cham. https://doi.org/10.1007/978-3-319-10888-9_44
Download citation
DOI: https://doi.org/10.1007/978-3-319-10888-9_44
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10887-2
Online ISBN: 978-3-319-10888-9
eBook Packages: Computer ScienceComputer Science (R0)