Language-Independent Data-Oriented Grapheme-to-Phoneme Conversion

  • Walter M. P. Daelemans
  • Antal P. J. van den Bosch
Chapter

Abstract

We describe an approach to grapheme-to-phoneme conversion that is both language-independent and data-oriented. Given a set of examples (spelling words with their associated phonetic representation) in a language, a grapheme-to-phoneme conversion system is automatically produced for that language that takes as its input the spelling of words and produces as its output the phonetic transcription according to the rules implicit in the training data. We describe the design of the system and compare its performance to knowledge-based and alternative data-oriented approaches.

Keywords

Acoustics Univer SR87 Santen Havoc 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. [AHK87]
    J. Allen, S. Hunnicut, and D. H. Klatt. From Text to Speech: The MITalk System. Cambridge University Press, Cambridge, 1987.Google Scholar
  2. [BD93]
    A. van den Bosch and W. Daelemans, Data-oriented methods for grapheme-to-phoneme conversion. In Proceedings of the Sixth Conference of the European Chapter of the ACL, 45–53, 1993.Google Scholar
  3. [DB91]
    T. G. Dietterich and G. Bakiri. Error-correcting output codes: A general method for improving multiclass inductive learning programs. In Proceedings AAAI-91, Menlo Park, CA, 572–577, 1991.Google Scholar
  4. [DB92]
    W. Daelemans and A. van den Bosch. Generalization performance of backpropagation learning on a syllabification task. In Proceedings of the 3rd Twente Workshop on Language Technology, M. Drossaers and A. Nijholt, eds. Universiteit Twente, Enschede, 27–37, 1992.Google Scholar
  5. [DB93]
    W. Daelemans and A. van den Bosch. TABTALK: Reusability in data-oriented grapheme-to-phoneme conversion. In Proceedings of Eurospeech, Berlin, 1459–1466, 1993.Google Scholar
  6. [DGD94]
    W. Daelemans, S. Gillis, and G. Durieux. The acquisition of stress, a dataoriented approach. Computational Linguistics 20(3):421–451, 1994.Google Scholar
  7. [DK82]
    P. A. Devijver and J. Kittler. Pattern Recognition: A Statistical Approach. Prentice-Hall, London, 1982MATHGoogle Scholar
  8. [GR91]
    A. R. Golding and P. S. Rosenbloom. Improving rule-based systems through case-based reasoning. In Proceedings AAAI-91, Menlo Park, CA, 22–27, 1991.Google Scholar
  9. [HH93]
    J. Heemskerk and V. J. van Heuven. MORPA, a lexicon-based MORphological PArser. In Analysis and Synthesis of Speech: Strategic Research Towards High-Quality Text-to-Speech Generation, V. J. van Heuven and L. C. W. Pols, eds. Mouton de Gruyter, Berlin, 1993.Google Scholar
  10. [Hee93]
    J. Heemskerk. A probabilistic context-free grammar for disambiguation in morphological parsing. In Proceedings EACL-93, Utrecht, 1993.Google Scholar
  11. [Leh87]
    W. Lehnert. Case-based problem solving with a large knowledge base of learned cases. In Proceedings AAAI-87, Seattle, WA, 1987.Google Scholar
  12. [NH93]
    A. Nunn and V. J. van Heuven. MORPHON, lexicon-based text-to-phoneme conversion and phonological rules. In Analysis and Synthesis of Speech: Strategic Research Towards High-Quality Text-to-Speech Generation, V. J. van Heuven and L. C. W. Pols, eds. Mouton de Gruyter, Berlin, 1993.Google Scholar
  13. [Qui93]
    J.R. Quinlan, C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco, CA, 1993.Google Scholar
  14. [SR87]
    T. J. Sejnowski and C. R. Rosenberg. Parallel networks that learn to pronounce English text. Complex Systems 1:145–168, 1987.MATHGoogle Scholar
  15. [Sta87]
    C. W. Stanfill. Memory-based reasoning applied to English pronunciation. In Proceedings AAAI-87, Seattle, WA, 577–581, 1987.Google Scholar
  16. [SW86]
    C. W. Stanfill and D. Waltz. Toward memory-based reasoning. Communications of the ACM, 29(12): 1213–1228, 1986.CrossRefGoogle Scholar
  17. [WH90]
    A. Weijters and G. Hoppenbrouwers. NetSpraak: Een neuraal netwerk voor grafeem-foneem-omzetting. Tabu 20(1): 1–25, 1990.Google Scholar

Copyright information

© Springer Science+Business Media New York 1997

Authors and Affiliations

  • Walter M. P. Daelemans
  • Antal P. J. van den Bosch

There are no affiliations available

Personalised recommendations