Skip to main content

Experiments with Language Models for Word Completion and Prediction in Hebrew

  • Conference paper
Advances in Natural Language Processing (NLP 2014)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8686))

Included in the following conference series:

Abstract

In this paper, we describe various language models (LMs) and combinations created to support word prediction and completion in Hebrew. We define and apply 5 general types of LMs: (1) Basic LMs (unigrams, bigrams, trigrams, and quadgrams), (2) Backoff LMs, (3) LMs Integrated with tagged LMs, (4) Interpolated LMs, and (5) Interpolated LMs Integrated with tagged LMs. 16 specific implementations of these LMs were compared using 3 types of Israeli web newspaper corpora. The foremost keystroke saving results were achieved with LMs of the most complex variety, the Interpolated LMs Integrated with tagged LMs. Therefore, we conclude that combining all strengths by creating a synthesis of all four basic LMs and the tagged LMs leads to the best results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Tam, C., Wells, D.: Evaluating the Benefits of Displaying Word Prediction Lists on a Personal Digital Assistant at the Keyboard Level. Assistive Technology 21, 105–114 (2009)

    Article  Google Scholar 

  2. Anson, D., Moist, P., Przywara, M., Wells, H., Saylor, H., Maxime, H.: The Effects of Word Completion and Word Prediction on Typing Rates Using On-Screen Keyboards. Assistive Technology 18, 146–154 (2006)

    Article  Google Scholar 

  3. Beukelman, D., Mirenda, P.: Augmentative and Alternative Communication: Supporting Children and Adults with Complex Communication Needs, 3rd edn., p. 77. Brookes Publishing, Baltimore, MD (2008)

    Google Scholar 

  4. Darragh, J.J., Witten, I.H., James, M.L.: The Reactive Keyboard: A Predictive Typing Aid. Computer 23(11), 41–49 (1990)

    Article  Google Scholar 

  5. Calculator, S., et al.: Roles and Responsibilities of Speech-Language Pathologists With Respect to Augmentative and Alternative Communication: Position Statement. Technical report, American Speech-Language-Hearing Association (2004), http://www.asha.org/docs/html/PS2005-00113.html

  6. Fossett, B., Mirenda, P.: Augmentative and Alternative Communication. In: Odom, S.L., Horner, R.H., Snell, M.E. (eds.) Handbook of Developmental Disabilities, pp. 330–366. Guilford Press (2009) ISBN 978-1-60623-248-4

    Google Scholar 

  7. Beukelman, D., Mirenda, P.: Augmentative & Alternative Communication: Supporting Children & Adults with Complex Communication Needs, 3rd edn. Paul H. Brookes Publishing Company (2005) ISBN 978-1-55766-684-0

    Google Scholar 

  8. Trnka, K., McCaw, J., Yarrington, D., McCoy, K.F.: User Interaction with Word Prediction: The Effects of Prediction Quality. Special Issue of ACM Trans. on Accessible Computing (TACCESS) on Augmentative and Alternative Communication 1(3), 1–34 (2009)

    Google Scholar 

  9. Darragh, J.J., Witten, I.H.: Adaptive Predictive Text Generation and the Reactive Keyboard. Interacting with Computers 3(1), 27–50 (1991)

    Article  Google Scholar 

  10. Wandmacher, T., Antoine, J.-Y.: Methods to Integrate a Language Model with Semantic Information for a Word Prediction Component. In: Proc. ACL SIGDAT Joint Conference EMNLP-CoLLN 2007, Prague, Tchéquie, pp. 503–513 (2007)

    Google Scholar 

  11. Newell, A., Langer, S., Hickey, M.: The Rôle of Natural Language Processing in Alternative and Augmentative Communication. Natural Language Engineering 4(1), 1–16 (1998)

    Article  Google Scholar 

  12. Carlberger, A., Carlberger, J., Magnuson, T., Hunnicutt, M.S., Palazuelos-Cagigas, S.E., Navarro, S.A.: Profet, a New Generation of Word Prediction: An Evaluation Study. In: Copestake, A., Langer, S., Palazuelos-Cagigas, S. (eds.) Natural Lang. Processing for Communication Aids, Madrid. Proc. of a Workshop Sponsored by acl, pp. 23–28 (1997)

    Google Scholar 

  13. Li, J., Hirst, G.: Semantic Knowledge in Word Completion. In: Proceedings of the 7th Int. ACM SIGACCESS Conf. on Computers and Accessibility (ASSETS), pp. 121–128 (2005)

    Google Scholar 

  14. Trnka, K., McCoy, K.F.: Evaluating Word Prediction: Framing Keystroke Savings. In: ACL (Short Papers) 2008, pp. 261–264 (2008)

    Google Scholar 

  15. Swiffin, A.L., Pickering, J.A., Arnott, J.L., Newell, A.F.: PAL: An Effort Efficient Portable Communication Aid and Keyboard Emulator. In: Proceedings of the 8th Annual Conference on Rehabilitation Technology, pp. 197–199 (1985)

    Google Scholar 

  16. Carlberger, J.: Word Prediction: Design and Implementation of a Probabilistic Word Prediction Program. Master dissertation. Royal Institute of Technology, Stockholm (1997)

    Google Scholar 

  17. Shein, F., Nantais, T., Nishiyama, R., Tam, C., Marshall, P.: Word Cueing for Persons with Writing Difficulties: WordQ. In: Technology and Persons with Disabilities Conference, Los Angeles, CA (2001)

    Google Scholar 

  18. Morris, C., Newell, A., Booth, L., Ricketts, I., Arnott, J.: Syntax Pal: A System to Improve the Written Syntax of Language-Impaired Users. Assistive Techn. 4(2), 51–59 (1992)

    Article  Google Scholar 

  19. Netzer, Y., Adler, M., Elhadad, M.: Word Prediction in Hebrew: Preliminary and Surprising Results. In: ISAAC (2008)

    Google Scholar 

  20. HaCohen-Kerner, Y., Greenfield, I.: Basic Word Completion and Prediction for Hebrew. In: Calderón-Benavides, L., González-Caro, C., Chávez, E., Ziviani, N. (eds.) SPIRE 2012. LNCS, vol. 7608, pp. 237–244. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  21. Kimelfeld, B., Kovacs, E., Sagiv, Y., Yahav, D.: Using Language Models and the HITS Algorithm for XML Retrieval. In: Fuhr, N., Lalmas, M., Trotman, A. (eds.) INEX 2006. LNCS, vol. 4518, pp. 253–260. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  22. Kirchhoff, K., Vergyri, D., Bilmes, J., Duh, K., Stolcke, A.: Morphology-based Language Modeling for Conversational Arabic Speech Recognition. Computer Speech & Language 20(4), 589–608 (2006)

    Article  Google Scholar 

  23. McMahon, J.G.G.: Statistical Language Processing Based on Self-organising Word Classification. Doctoral dissertation, Queen’s University of Belfast (1994)

    Google Scholar 

  24. Beyerlein, P.: Discriminative Model Combination. In: Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal, vol. 1, pp. 481–484 (1998)

    Google Scholar 

  25. Badaskar, S., Agarwal, S., Arora, S.: Identifying Real or Fake Articles: Towards better Language Modeling. In: IJCNLP, pp. 817–822 (2008)

    Google Scholar 

  26. Adler, M.: Hebrew Morphological Disambiguation: An Unsupervised Stochastic Word-based Approach. Ph.D. Dissertation, Ben Gurion University, Israel (2007)

    Google Scholar 

  27. Adler, M., Netzer, Y., Gabay, D., Goldberg, Y., Elhadad, M.: Tagging a Hebrew Corpus: The Case of Participles. In: LREC 2008, European Language Resources Association, Marrakech, Morocco (2008)

    Google Scholar 

  28. Goldberg, Y., Adler, M., Elhadad, M.: EM Can Find Pretty Good HMM POS-Taggers (When Given a Good Start). In: Proceedings of the ACL 2008 Conference, pp. 746–754 (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

HaCohen-Kerner, Y., Applebaum, A., Bitterman, J. (2014). Experiments with Language Models for Word Completion and Prediction in Hebrew. In: Przepiórkowski, A., Ogrodniczuk, M. (eds) Advances in Natural Language Processing. NLP 2014. Lecture Notes in Computer Science(), vol 8686. Springer, Cham. https://doi.org/10.1007/978-3-319-10888-9_44

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-10888-9_44

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-10887-2

  • Online ISBN: 978-3-319-10888-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics