Natural Language Processing of Semitic Languages

  • Imed┬áZitouni

Table of contents

  1. Front Matter
    Pages i-xxiv
  2. Natural Language Processing Core-Technologies

    1. Front Matter
      Pages 1-1
    2. Ray Fabri, Michael Gasser, Nizar Habash, George Kiraz, Shuly Wintner
      Pages 3-41
    3. Reut Tsarfaty
      Pages 67-128
    4. Mona Diab, Yuval Marton
      Pages 129-159
    5. Ilana Heintz
      Pages 161-196
  3. Natural Language Processing Applications

    1. Front Matter
      Pages 197-197
    2. Hany Hassan, Kareem Darwish
      Pages 199-219
    3. Behrang Mohit
      Pages 221-245
    4. Khadiga Mahmoud Seddik, Ali Farghaly
      Pages 247-277
    5. Vittorio Castelli, Imed Zitouni
      Pages 279-298
    6. Kareem Darwish
      Pages 299-334
    7. Yassine Benajiba, Paolo Rosso, Lahsen Abouenour, Omar Trigui, Karim Bouzoubaa, Lamia Belguith
      Pages 335-370
    8. Lamia Hadrich Belguith, Mariem Ellouze, Mohamed Hedi Maaloul, Maher Jaoua, Fatma Kallel Jaoua, Philippe Blache
      Pages 371-408
    9. Hagen Soltau, George Saon, Lidia Mangu, Hong-Kwang Kuo, Brian Kingsbury, Stephen Chu et al.
      Pages 409-459

About this book


Research in Natural Language Processing (NLP) has rapidly advanced in recent years, resulting in exciting algorithms for sophisticated processing of text and speech in various languages. Much of this work focuses on English; in this book we address another group of interesting and challenging languages for NLP research: the Semitic languages. The Semitic group of languages includes Arabic (206 million native speakers), Amharic (27 million), Hebrew (7 million), Tigrinya (6.7 million), Syriac (1 million) and Maltese (419 thousand). Semitic languages exhibit unique morphological processes, challenging syntactic constructions, and various other phenomena that are less prevalent in other natural languages. These challenges call for unique solutions, many of which are described in this book.

The 13 chapters presented in this book bring together leading scientists from several universities and research institutes worldwide. While this book devotes some attention to cutting-edge algorithms and techniques, its primary purpose is a thorough explication of best practices in the field. Furthermore, every chapter describes how the techniques discussed apply to Semitic languages. The book covers both statistical approaches to NLP, which are dominant across various applications nowadays, and the more traditional, rule-based approaches, that were proven useful for several other application domains. We hope that this book will provide a "one-stop-shop'' for all the requisite background and practical advice when building NLP applications for Semitic languages.


Arabic, Hebrew, Syriac, Amharic Information Retrieval Language Modeling Machine Translation Morphology Natural Language Processing Semantic parsing Semitic Languages Speech Recognition

Editors and affiliations

  • Imed┬áZitouni
    • 1
  1. 1.MicrosoftRedmondUSA

Bibliographic information

  • DOI
  • Copyright Information Springer-Verlag Berlin Heidelberg 2014
  • Publisher Name Springer, Berlin, Heidelberg
  • eBook Packages Computer Science Computer Science (R0)
  • Print ISBN 978-3-642-45357-1
  • Online ISBN 978-3-642-45358-8
  • Series Print ISSN 2192-032X
  • Series Online ISSN 2192-0338
  • Buy this book on publisher's site