Abstract
We present a method for automatically assigning a stress score to every syllable in an utterance. The syllables are detected by looking at the energy in three energy bands. Based on features representing the prosodic parameters duration, intensity, and pitch, a stress assignment number for each syllable is calculated. The algorithm is tested on material collected from real dialogs. The automatic stress assignment is compared with the result of a stress perception experiment by human listeners.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
Bibliography
Dik, S. G: “Functional Grammar”, Foris Publications, Dordrecht, 1981.
Ehrlich, U., Niemann, H.: “Using Semantic and Pragmatic Knowledge for the Interpretation of Syntactic Constituents”, in this volume.
Kunzmann, S., Kuhn, T., Niemann, H.: “An Experimental Environment for Generating Word Hypotheses in Continuous Speech”, in this volume.
Lea, W.: “Prosodic Aids to Speech Recognition”, in Lea, W.: “Trends in Speech Recognition”, Prentice Hall, N. J., 1980.
Niemann, H., Brietzmann, A., Mühlfeld, R., Regel, P., Schukat, G.: “The Speech Understanding and Dialog System EVAR”, in De Mori, R., Suen, G: “New Systems and Architectures for Automatic Speech Recognition and Synthesis”, Springer Verlag, Berlin, 1985.
Schmölz, S.: “Zur automatischen Bestimmung der Satzbetonung”, Diplomarbeit, Lehrstuhl für Informatik 5 (Mustererkennung), Universität Erlangen, 1987.
[SENSennef, S.: “Real-time harmonic pitch detector”, IEEE Trans. Acoust, Speech, Signal Processing, vol. ASSP-26, S. 358–364, 1978.
Vaissiere, J.: “A Suprasegmental Component in a French Speech Recognition System: Reducing the Number of Lexical Hypotheses and Detecting the Main Boundary”, Recherches/Acoustique, CNET, Vol. 7, 1982/83.
Waibel, A.: “Prosody and Speech Recognition”, Ph.D. thesis, Carnegie-Mellon University, Pittsburgh, USA, 1986.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1988 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nöth, E., Niemann, H., Schmölz, S. (1988). Prosodic Features in German Speech: Stress Assignment by Man and Machine. In: Niemann, H., Lang, M., Sagerer, G. (eds) Recent Advances in Speech Understanding and Dialog Systems. NATO ASI Series, vol 46. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-83476-9_5
Download citation
DOI: https://doi.org/10.1007/978-3-642-83476-9_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-83478-3
Online ISBN: 978-3-642-83476-9
eBook Packages: Springer Book Archive