Skip to main content

Prosodic Features in German Speech: Stress Assignment by Man and Machine

  • Conference paper

Part of the book series: NATO ASI Series ((NATO ASI F,volume 46))

Abstract

We present a method for automatically assigning a stress score to every syllable in an utterance. The syllables are detected by looking at the energy in three energy bands. Based on features representing the prosodic parameters duration, intensity, and pitch, a stress assignment number for each syllable is calculated. The algorithm is tested on material collected from real dialogs. The automatic stress assignment is compared with the result of a stress perception experiment by human listeners.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Bibliography

  1. Dik, S. G: “Functional Grammar”, Foris Publications, Dordrecht, 1981.

    Google Scholar 

  2. Ehrlich, U., Niemann, H.: “Using Semantic and Pragmatic Knowledge for the Interpretation of Syntactic Constituents”, in this volume.

    Google Scholar 

  3. Kunzmann, S., Kuhn, T., Niemann, H.: “An Experimental Environment for Generating Word Hypotheses in Continuous Speech”, in this volume.

    Google Scholar 

  4. Lea, W.: “Prosodic Aids to Speech Recognition”, in Lea, W.: “Trends in Speech Recognition”, Prentice Hall, N. J., 1980.

    Google Scholar 

  5. Niemann, H., Brietzmann, A., Mühlfeld, R., Regel, P., Schukat, G.: “The Speech Understanding and Dialog System EVAR”, in De Mori, R., Suen, G: “New Systems and Architectures for Automatic Speech Recognition and Synthesis”, Springer Verlag, Berlin, 1985.

    Google Scholar 

  6. Schmölz, S.: “Zur automatischen Bestimmung der Satzbetonung”, Diplomarbeit, Lehrstuhl für Informatik 5 (Mustererkennung), Universität Erlangen, 1987.

    Google Scholar 

  7. [SENSennef, S.: “Real-time harmonic pitch detector”, IEEE Trans. Acoust, Speech, Signal Processing, vol. ASSP-26, S. 358–364, 1978.

    Article  Google Scholar 

  8. Vaissiere, J.: “A Suprasegmental Component in a French Speech Recognition System: Reducing the Number of Lexical Hypotheses and Detecting the Main Boundary”, Recherches/Acoustique, CNET, Vol. 7, 1982/83.

    Google Scholar 

  9. Waibel, A.: “Prosody and Speech Recognition”, Ph.D. thesis, Carnegie-Mellon University, Pittsburgh, USA, 1986.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1988 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nöth, E., Niemann, H., Schmölz, S. (1988). Prosodic Features in German Speech: Stress Assignment by Man and Machine. In: Niemann, H., Lang, M., Sagerer, G. (eds) Recent Advances in Speech Understanding and Dialog Systems. NATO ASI Series, vol 46. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-83476-9_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-83476-9_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-83478-3

  • Online ISBN: 978-3-642-83476-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics