Building a Standard Amazigh Corpus

  • Siham Boulaknadel
  • Fadoua Ataa Allah
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 179)


Natural language processing is showing more interest in the Amazigh language in recent years. Suitable resources for Amazighe are becoming a vital necessity for the progress of this research. Corpora are an important resource but Amazighe lacks sufficient resources in this field, therefore we have been conducted to build an Amazighe corpus. In this paper, we present preliminary result experiments with a corpus for Standard Amazighe. We selected samples of published data from different Amazighe varieties. The selection was driven mainly by the amount of data available. We still demonstrate the completeness and representativeness of this corpus using metrics and show its suitability for language engineering experiments.


Machine Translation Graphic System Word Processing Program Corpus Construction International Phonetic Alphabet 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    McEnery, T., Wilson, A.: Corpus linguistics. Edinburgh University Press, Edinburgh (1996)Google Scholar
  2. 2.
    Boukous, A.: Société, langues et cultures au Maroc: Enjeux symboliques. Najah El Jadida, Casablanca, Maroc (1995)Google Scholar
  3. 3.
    Moustaoui, A.: The Amazigh language within Morocco’s language policy, Dossier 14, University of Autònoma de Madrid (2003)Google Scholar
  4. 4.
    Ouakrim, O.: Fonética y fonología del Bereber. Survey, University of Autònoma de Barcelona (1995)Google Scholar
  5. 5.
    Ameur, M., Bouhjar, A., Boukhris, F., Boukouss, A., Boumalk, A., Elmedlaoui, M., Iazzi, E.M., Souifi, H.: Initiation à la langue amazighe. IRCAM, Rabat, Morocco (2004)Google Scholar
  6. 6.
    Andries, P.: Unicode 5.0 en pratique, Codage des caractères et internationalisation des logiciels et des documents. Dunod, France (2008)Google Scholar
  7. 7.
    Manning, C., Schütze, H.: Foundations of Statistical Natural Language Processing, MIT Press, Cambridge (1999)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  1. 1.Institut Royal de la Culture AmazigheRabatMorocco

Personalised recommendations