Language Resources and Evaluation

, Volume 42, Issue 1, pp 75–98

Language resources for Hebrew


DOI: 10.1007/s10579-007-9050-8

Cite this article as:
Itai, A. & Wintner, S. Lang Resources & Evaluation (2008) 42: 75. doi:10.1007/s10579-007-9050-8


We describe a suite of standards, resources and tools for computational encoding and processing of Modern Hebrew texts. These include an array of XML schemas for representing linguistic resources; a variety of text corpora, raw, automatically processed and manually annotated; lexical databases, including a broad-coverage monolingual lexicon, a bilingual dictionary and a WordNet; and morphological processors which can analyze, generate and disambiguate Hebrew word forms. The resources are developed under centralized supervision, so that they are compatible with each other. They are freely available and many of them have already been used for several applications, both academic and industrial.


Language resources Hebrew Corpora Lexicon Morphological processing WordNet 

Copyright information

© Springer Science+Business Media B.V. 2007

Authors and Affiliations

  1. 1.Department of Computer Science, TechnionIsrael Institute of TechnologyHaifaIsrael
  2. 2.Department of Computer ScienceUniversity of HaifaHaifaIsrael

Personalised recommendations