Skip to main content

Synthesizing Allophonic Glottalization

  • Chapter
Progress in Speech Synthesis

Abstract

This chapter presents a method for synthesizing allophonic glottalization. The method is motivated by empirical studies of the phonological context for glottalization and of its acoustic consequences. A baseline study of production explored glottalization in two situations : (1) vowel-vowel hiatus across a word boundary, and (2) voiceless stops before sonorants. The study showed that allophonic glottalization depends on the segmental context, syllabic position, and phrasal prosody. Successful synthesis of contextually appropriate glottalization requires an architecture with a running window over a fully parsed phonological structure, or its effective equivalent. The signal coding used was based on the source model and cascade formant synthesis presented by [Kla87]. Synthesis of glottalization can be achieved by lowering the fundamental frequency (pulsive F function serving as0), keeping all other factors in formant synthesis constant. Thus, any synthesis procedure that has the ability to directly control F0 will be able to reproduce glottalization in a similar manner. For fully natural, theoretically correct synthesis, additional control parameters are needed to control the length of the glottal pulse and for spectral tilt.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. T. Ananthapdmanabha. Acoustic analysis of voice source dynamics. Speech Transmission Laboratory 2-3:1–24, 1984.

    Google Scholar 

  2. H. T. Bunnell, D. Yarrington, and K. E. Barner. Pitch control in diphone synthesis. In Proceedings of the Second ESCA/IEEE Workshop on Speech Synthesis, 127–130, 1994.

    Google Scholar 

  3. J. Coleman. The phonetic interpretation of headed phonological structures containing overlapping constituents. Phonology 9(1): 1–44, 1994.

    Article  Google Scholar 

  4. A. Dirksen and J. Coleman. All-prosodic synthesis architecture. In Proceedings of the Second ESCA/IEEE Workshop on Speech Synthesis, 232–235, 1994.

    Google Scholar 

  5. C. Gobl. A preliminary study of acoustic voice quality correlates. Speech Transmission Laboratory 4:9–27, 1989.

    Google Scholar 

  6. I. Karlsson. Voice source dynamics for female speakers. In Proceedings of ICSLP’ 90, 69–72, 1990.

    Google Scholar 

  7. D. H. Klatt. Text-to-speech conversion. J. Acoust. Soc. Amer. 82(3): 737–793, 1988.

    Article  Google Scholar 

  8. K. J. Kohler. Glottal stops and glottalization in German. Phonetica 51:38–51, 1994.

    Article  Google Scholar 

  9. P. Ladefoged. A Course in Phonetics, 3rd edition. Harcourt Brace Jovanovich, Fort Worth, 1993.

    Google Scholar 

  10. J. Local. Modelling assimilation in a non-segmental rule-free phonology. In Papers in Laboratory Phonology II, G. Docherty and D. Ladd, eds. Cambridge University Press, Cambridge, 190–223, 1992.

    Google Scholar 

  11. E. Moulines and F. Charpentier. Pitch synchronous waveform processing techniques for text to speecn synthesis using diphones. Speech Comm. 9:453–467, 1990.

    Article  Google Scholar 

  12. J. Pierrehumbert and M. Beckman. Japanese Tone Structure. MIT Press, Cambridge, Mass., 1988.

    Google Scholar 

  13. J. Pitrelli, M. Beckman, and J. Hirschberg. Evaluation of prosodic transcription labelling reliability in the ToBI framework. In Proceedings of ICSLP’ 94,18–22, 1994.

    Google Scholar 

  14. J. Pierrehumbert. Prosodic effects on glottal allophones. In Vocal Fold Physiology 8, O. Fujimura, ed. Singular Publishing Group, San Diego, 39–60, 1995.

    Google Scholar 

  15. J. Pierrehumbert. Knowledge of variation. In Papers from the 30th Regional Meeting of the Chicago Linguistic Society. University of Chicago, Chicago, 1995.

    Google Scholar 

  16. J. Pierrehumbert and D. Talkin. Lenition of /h/ and glottal stop. In Papers in Laboratory Phonology II, G. Docherty and D. Ladd, eds. Cambridge University Press, Cambridge, 90–116, 1992.

    Google Scholar 

  17. A. E. Rosenberg. Effect of glottal pulse shape on the quality of natural vowels. J. Acoust. Soc. Amer., 49:583–590, 1971.

    Article  Google Scholar 

Download references

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1997 Springer Science+Business Media New York

About this chapter

Cite this chapter

Pierrehumbert, J.B., Frisch, S. (1997). Synthesizing Allophonic Glottalization. In: van Santen, J.P.H., Olive, J.P., Sproat, R.W., Hirschberg, J. (eds) Progress in Speech Synthesis. Springer, New York, NY. https://doi.org/10.1007/978-1-4612-1894-4_2

Download citation

  • DOI: https://doi.org/10.1007/978-1-4612-1894-4_2

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4612-7328-8

  • Online ISBN: 978-1-4612-1894-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics