Language Resources and Evaluation

, Volume 42, Issue 1, pp 75–98

Language resources for Hebrew

Article

DOI: 10.1007/s10579-007-9050-8

Cite this article as:
Itai, A. & Wintner, S. Lang Resources & Evaluation (2008) 42: 75. doi:10.1007/s10579-007-9050-8

Abstract

We describe a suite of standards, resources and tools for computational encoding and processing of Modern Hebrew texts. These include an array of XML schemas for representing linguistic resources; a variety of text corpora, raw, automatically processed and manually annotated; lexical databases, including a broad-coverage monolingual lexicon, a bilingual dictionary and a WordNet; and morphological processors which can analyze, generate and disambiguate Hebrew word forms. The resources are developed under centralized supervision, so that they are compatible with each other. They are freely available and many of them have already been used for several applications, both academic and industrial.

Keywords

Language resources Hebrew Corpora Lexicon Morphological processing WordNet 

Copyright information

© Springer Science+Business Media B.V. 2007

Authors and Affiliations

  1. 1.Department of Computer Science, TechnionIsrael Institute of TechnologyHaifaIsrael
  2. 2.Department of Computer ScienceUniversity of HaifaHaifaIsrael

Personalised recommendations