Abstract
The paper outlines the process of creating a new voice in the MARY Text-to-Speech Platform, evaluating and proposing extensions on the existing tools and methodology. It particularly focuses on the development of the phoneme set, the Grapheme to Phone (GtP) conversion module and the subsequent process for generating a corpus for building the new voice. The work presented in this paper was carried out as part of the process for the support of the Greek Language in the MARY TtS system, however the outlined methodology should be applicable for other languages as well.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Syrdal, A., Kim, Y.-J.: Dialog speech acts and prosody: Considerations for TTS. In: Proc. of the Speech Prosody, Brazil (2008)
Huang, X., Acero, A., Hon, H.W.: Spoken Language Processing: A Guide to Theory, Algorithm and System Development. Prentice Hall PTR (2001)
Stavropoulou, P., Spiliotopoulos, D., Kouroupetroglou, G.: Where Greek Text to Speech Fails. In: Proc. of the 11th International Conference on Greek Linguistics, Rhodes (September 2013)
Schröder, M., Trouvain, J.: The German Text-to-Speech Synthesis System MARY: A Tool for Research, Development and Teaching. International Journal of Speech Technology 6(4), 365–377 (2003)
Pammi, S., Charfuelan, M., Schröder, M.: Multilingual Voice Creation Toolkit for the MARY TTS Platform. In: LREC 2010, Malta (2010)
Schröder, M., Charfuelan, M., Pammi, S., Steiner, I.: Open source voice creation toolkit for the MARY TTS Platform. In: Proc. Interspeech, Florence, Italy (2011)
Ladefoged, P., Johnson, K.: A Course in Phonetics. Wadsworth, Cengage Learning Inc., Boston (2010)
Taylor, P.: Text to Speech Synthesis. Cambridge University Press, Cambridge (2009)
Arvaniti, A.: Greek Phonetics: The State of the Art. Journal of Greek Linguistics 8, 97–208 (2007)
Fotinea, S.-E., Tambouratzis, G.: A Methodology for Creating a Segment Inventory for Greek Time Domain Speech Synthesis. International Journal of Speech Technology 8(2), 161–172 (2005)
Fourli-Kartsouni, F., Slavakis, K., Kouroupetroglou, G., Theodoridis, S.: A Bayesian Network Approach to Semantic Labelling of Text Formatting in XML Corpora of Documents. In: Stephanidis, C. (ed.) Universal Access in HCI, Part III, HCII 2007. LNCS, vol. 4556, pp. 299–308. Springer, Heidelberg (2007)
Wikipedia, http://dumps.wikimedia.org/elwiki/latest/elwiki-latest-pages-articles.xml.bz2 (accessed May 2013)
Voice Import Tools Tutorial: How to build a new Voice with Voice Import Tools, http://mary.opendfki.de/wiki/VoiceImportToolsTutorial
Fotinea, S.-E., Tambouratzis, G., Carayannis, G.: Constructing a segment database for greek time domain speech synthesis. In: Proc. of EUROSPEECH 2001 Scandinavia, 7th European Conference on Speech Communication and Technology, 2nd INTERSPEECH Event, Aalborg, Denmark, September 3-7 (2001)
Corpus of Greek Text, http://www.sek.edu.gr
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Stavropoulou, P., Tsonos, D., Kouroupetroglou, G. (2014). Language Resources and Evaluation for the Support of the Greek Language in the MARY Text-to-Speech. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2014. Lecture Notes in Computer Science(), vol 8655. Springer, Cham. https://doi.org/10.1007/978-3-319-10816-2_63
Download citation
DOI: https://doi.org/10.1007/978-3-319-10816-2_63
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10815-5
Online ISBN: 978-3-319-10816-2
eBook Packages: Computer ScienceComputer Science (R0)