Skip to main content

From Portuguese to Mirandese: Fast Porting of a Letter-to-Sound Module Using FSTs

  • Conference paper
  • First Online:
Computational Processing of the Portuguese Language (PROPOR 2003)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2721))

Abstract

This paper describes our efforts in porting our letter-to-sound module from European Portuguese to Mirandese, the second official language in Portugal. We describe the rule formalism and the composition of the various transducers involved in the letter-to-sound conversion. We propose a set of extra SAMPA symbols to be used in the phonetic transcription of Mirandese, and we briefly cover the set of rules and results obtained for the two languages. Although at a very preliminary stage, we also describe our efforts at building a waveform generation module also based on finite state transducers. The use of finite state transducers allowed a very flexible and modular framework for deriving and testing new rule sets. Our experience led us to believe that letter-to-sound modules could be helpful tools for researchers involved in the establishment of orthographic conventions for lesser spoken languages.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. M. Barros-Ferreira and D. Raposo, editors. Convenção Ortográfica da Língua Mirandesa. Câmara Municipal de Miranda do Douro — Centro de Linguística da Universidade de Lisboa, 1999.

    Google Scholar 

  2. I. Trancoso, M. Viana, F. Silva, G. Marques, and L. Oliveira. Rule-based vs. neural network based approaches to letter-to-phone conversion for portuguese common and proper names. In Proc. ICSLP’ 94, Yokohama, Japan, September 1994.

    Google Scholar 

  3. L. Oliveira, M.C. Viana, A.I. Mata, and I. Trancoso. Progress report of project dixi+: A portuguese text-to-speech synthesizer for alternative and augmentative communication. Technical report, FCT, January 2001.

    Google Scholar 

  4. D. Caseiro, I. Trancoso, L. Oliveira, and C. Viana. Grapheme-to-phone using finite state transducers. In Proc. 2002 IEEE Workshop on Speech Synthesis, Santa Monica, CA, USA, September 2002.

    Google Scholar 

  5. L. Oliveira, M. Viana, and I. Trancoso. A rule-based text-to-speech system for portuguese. In Proc. ICASSP’ 1992, San Francisco, USA, March 1992.

    Google Scholar 

  6. K. Koskenniemi. Two-Level morphology: A general Computational Model for Word-Form Recognition and Production. PhD thesis, University of Helsinki, 1983.

    Google Scholar 

  7. E.L. Antworth. Pc-kimmo: A two-level processor for morphological analysis. Technical report, Occasional Publications in Academic Computing No 16. Dallas, TX: Summer Institute of Linguistics, 1990.

    Google Scholar 

  8. M. Mohri and R. Sproat. An efficient compiler for weighted rewrite rules. In 34th Annual Meeting of the Association for Computational Linguistics, Santa Cruz, USA, 1996.

    Google Scholar 

  9. J. Vasconcellos. Estudos de Philologia Mirandesa. Imprensa Nacional, Lisboa, 1900.

    Google Scholar 

  10. D. Caseiro, I. Trancoso, C. Viana, and M. Barros. A comparative description of gtop modules for portuguese and mirandese using finite state transducers. In Proc. ICPhS’ 2003, Barcelona, Spain, August 2003.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Trancoso, I., Viana, C., Barros, M., Caseiro, D., Paulo, S. (2003). From Portuguese to Mirandese: Fast Porting of a Letter-to-Sound Module Using FSTs. In: Mamede, N.J., Trancoso, I., Baptista, J., das Graças Volpe Nunes, M. (eds) Computational Processing of the Portuguese Language. PROPOR 2003. Lecture Notes in Computer Science(), vol 2721. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45011-4_7

Download citation

  • DOI: https://doi.org/10.1007/3-540-45011-4_7

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-40436-1

  • Online ISBN: 978-3-540-45011-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics