From Portuguese to Mirandese: Fast Porting of a Letter-to-Sound Module Using FSTs
This paper describes our efforts in porting our letter-to-sound module from European Portuguese to Mirandese, the second official language in Portugal. We describe the rule formalism and the composition of the various transducers involved in the letter-to-sound conversion. We propose a set of extra SAMPA symbols to be used in the phonetic transcription of Mirandese, and we briefly cover the set of rules and results obtained for the two languages. Although at a very preliminary stage, we also describe our efforts at building a waveform generation module also based on finite state transducers. The use of finite state transducers allowed a very flexible and modular framework for deriving and testing new rule sets. Our experience led us to believe that letter-to-sound modules could be helpful tools for researchers involved in the establishment of orthographic conventions for lesser spoken languages.
KeywordsRegular Expression Speech Synthesis Pitch Period Phonetic Transcription Line Spectral Frequency
Unable to display preview. Download preview PDF.
- 1.M. Barros-Ferreira and D. Raposo, editors. Convenção Ortográfica da Língua Mirandesa. Câmara Municipal de Miranda do Douro — Centro de Linguística da Universidade de Lisboa, 1999.Google Scholar
- 2.I. Trancoso, M. Viana, F. Silva, G. Marques, and L. Oliveira. Rule-based vs. neural network based approaches to letter-to-phone conversion for portuguese common and proper names. In Proc. ICSLP’ 94, Yokohama, Japan, September 1994.Google Scholar
- 3.L. Oliveira, M.C. Viana, A.I. Mata, and I. Trancoso. Progress report of project dixi+: A portuguese text-to-speech synthesizer for alternative and augmentative communication. Technical report, FCT, January 2001.Google Scholar
- 4.D. Caseiro, I. Trancoso, L. Oliveira, and C. Viana. Grapheme-to-phone using finite state transducers. In Proc. 2002 IEEE Workshop on Speech Synthesis, Santa Monica, CA, USA, September 2002.Google Scholar
- 5.L. Oliveira, M. Viana, and I. Trancoso. A rule-based text-to-speech system for portuguese. In Proc. ICASSP’ 1992, San Francisco, USA, March 1992.Google Scholar
- 6.K. Koskenniemi. Two-Level morphology: A general Computational Model for Word-Form Recognition and Production. PhD thesis, University of Helsinki, 1983.Google Scholar
- 7.E.L. Antworth. Pc-kimmo: A two-level processor for morphological analysis. Technical report, Occasional Publications in Academic Computing No 16. Dallas, TX: Summer Institute of Linguistics, 1990.Google Scholar
- 8.M. Mohri and R. Sproat. An efficient compiler for weighted rewrite rules. In 34th Annual Meeting of the Association for Computational Linguistics, Santa Cruz, USA, 1996.Google Scholar
- 9.J. Vasconcellos. Estudos de Philologia Mirandesa. Imprensa Nacional, Lisboa, 1900.Google Scholar
- 10.D. Caseiro, I. Trancoso, C. Viana, and M. Barros. A comparative description of gtop modules for portuguese and mirandese using finite state transducers. In Proc. ICPhS’ 2003, Barcelona, Spain, August 2003.Google Scholar