Advertisement

Normalization of Non-standard Words with Finite State Transducers for Russian Speech Synthesis

  • Artem Lukanin
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 542)

Abstract

This paper describes finite state transducers employed for expansion of numbers, acronyms and graphic abbreviations into full-word numerals and phrases in the task of Russian speech synthesis. The developed finite state transducers cover cardinal and ordinal numbers, convert phone numbers, dates, codes, etc. The developed project is the first Russian open-source normalization system known to the author.

Keywords

Preprocessing Text-to-speech Morphology Numeral Abbreviation Acronym 

References

  1. 1.
    Reichel, U.D., Pfitzinger, H.R.: Text preprocessing for speech synthesis (2006)Google Scholar
  2. 2.
    The Festival Speech Synthesis System. http://www.cstr.ed.ac.uk/projects/festival/
  3. 3.
  4. 4.
    Paumier, S.: Unitex 3.1.beta User Manual. Université Paris-Est Marne-la-Vallée. http://igm.univ-mlv.fr/~unitex/UnitexManual3.1.pdf (2015). Accessed 15 Jan 2015
  5. 5.
    Dutoit, T.: An Introduction to Text-to-Speech Synthesis, vol. 3. Springer Science & Business Media, Berlin (1997)Google Scholar
  6. 6.
    Sproat, R., Black, A., Chen, S., Kumar, S., Ostendorfk, M., Richards, C.: Normalization of non-standard words. Comput. Speech Lang. 15, 287–333 (2001)CrossRefGoogle Scholar
  7. 7.
    Sproat, R.: Lightly supervised learning of text normalization: Russian number names. In: Spoken Language Technology Workshop (SLT), 2010 IEEE, pp. 436–441. IEEE, December 2010Google Scholar
  8. 8.
    Khomitsevich, O.G., Rybin, S.V., Anichkin, I.M.: Linguistic analysis for text normalization and homonymy resolution in a Russian TTS system [Иcпoльзoвaниe лингвиcтичecкoгo aнaлизa для нopмaлизaции тeкcтa и cнятия oмoнимии в cиcтeмe cинтeзa pyccкoй peчи]. Instrument making. Thematic issue “Speech information systems” [Пpибopocтpoeниe. Teмaтичecкий выпycк «Peчeвыe инфopмaциoнныe cиcтeмы»], vol. 2, pp. 42–46. Izvestija vuzov (2013)Google Scholar
  9. 9.
    Nagel, S.: Formenbildung im Russischen. Formale Beschreibung und Automatisierung für das CISLEX-Wörterbuchsystem (2002)Google Scholar
  10. 10.
    Russian Grammar [Pyccкaя гpaммaтикa], vol. 1. Nauka, Moscow (1980)Google Scholar
  11. 11.
    Rosental, D.E., Golub, I.B., Telenkova, M.A.: The Modern Russian Language [Coвpeмeнный pyccкий язык]. Airis-Press, Moscow (1997)Google Scholar
  12. 12.
    Rosental, D.E., Djandjakova, E.V., Kabanova, N.P.: Reference Book on Orthography, Pronunciation, Literary Editing [Cпpaвoчник пo пpaвoпиcaнию, пpoизнoшeнию, литepaтypнoмy peдaктиpoвaнию]. CheRo, Moscow (1998)Google Scholar
  13. 13.
    Linguistics. Big encyclopedic dictionary [Языкoзнaниe. Бoльшoй энциклoпeдичecкий cлoвapь]. Big Russian Encyclopedy, Moscow (1998)Google Scholar
  14. 14.
    Akhmanova, O.S.: The Dictionary of Linguistic Terms [Cлoвapь лингвиcтичecкиx тepминoв]. Editorial URSS, Moscow (2004)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  1. 1.South Ural State UniversityChelyabinskRussia

Personalised recommendations