Advertisement

Automatic Morphological Annotation in a Text-to-Speech System for Hebrew

  • Branislav Popović
  • Milan Sečujski
  • Vlado Delić
  • Marko Janev
  • Igor Stanković
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8113)

Abstract

The paper presents the module for automatic morphological annotation within a text synthesizer for Hebrew, based on an efficient combination of two approaches. The first approach includes the selection of lexemes from appropriate lexica, while the other approach involves automatic morphological analysis of text input using a complex expert algorithm relying on a set of transformational rules and using 6 types of scoring procedures. The module operates on a set of 30 part-of-speech tags with more than 3000 corresponding morphological categories. The paper discusses the advantages of the proposed method in the context of an extremely morphologically complex language such as Hebrew, with particular emphasis given to the relative importance of individual scoring procedures. When all 6 scoring procedures are applied, the accuracy of 99.6% is achieved on a corpus of 3093 sentences (55046 words).

Keywords

part-of-speech tagging speech synthesis Hebrew 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Manning, C., Schütze, H.: Foundations of statistical natural language processing. MIT Press, Cambridge (2000)Google Scholar
  2. 2.
    Aronoff, M., Rees-Miller, J.: Morphophonemics of modern Hebrew. Wiley-Blackwell, San Francisco (2003)Google Scholar
  3. 3.
    Fellman, J.: Concerning the “revival” of the Hebrew language. Anthropol. Linguist. 15(5), 250–257 (1973)Google Scholar
  4. 4.
    Lembersky, G., Shacham, D., Wintner, S.: Morphological disambiguation of Hebrew: A case study in classifier combination. Nat. Lang. Eng. Available on CJO 2012 (2012)Google Scholar
  5. 5.
    Wintner, S.: Hebrew computational linguistics: Past and Future. Artif. Intell. Rev. 21(2), 113–138 (2004)MathSciNetzbMATHCrossRefGoogle Scholar
  6. 6.
    Bar-Haim, R., Sima’an, K., Winter, Y.: Part-of-speech tagging of modern Hebrew text. Nat. Lang. Eng. 14(2), 223–251 (2008)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2013

Authors and Affiliations

  • Branislav Popović
    • 1
  • Milan Sečujski
    • 1
  • Vlado Delić
    • 1
  • Marko Janev
    • 2
  • Igor Stanković
    • 3
  1. 1.Faculty of Technical SciencesUniversity of Novi SadNovi SadSerbia
  2. 2.Mathematical InstituteSerbian Academy of Sciences and ArtsBelgradeSerbia
  3. 3.Brest National Engineering SchoolEuropean Center for Virtual RealityBrestFrance

Personalised recommendations