Abstract
In this paper, the various pitch determination methods and algorithms (PDAs) are grouped into two major classes: time-domain PDAs and short-term analysis PDAs. The short-term analysis PDAs leave the signal domain by a short-term transformation. They supply a sequence of average pitch estimates from consecutive frames. The individual algorithm is characterized by the short-term transform it applies. The time-domain methods, on the other hand, track the signal period by period. Extraction and isolation of the fundamental harmonic, and investigation of the temporal signal structure are the two extremes between which most of these PDAs are found. After a short review of these principles, the paper finally discusses different application-oriented aspects, i.e. the role of the PDA in phonetics, education, phoniatrics, and speech communication systems.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
Bibliography
BAUDRY M., DUPEYRAT P. (1976): Analyse du signal vocal — Utilisation des extrema du signal et de leurs amplitudes — detection du fondamental et recherche des formants. Proc.Journ. Etud.Parole (Nancy), vol. 7, pp. 247...257.
DE MORI R., LAFACE P., MAKHONINE V.A., MEZZALAMA M. (1977): A syntactic procedure for the recognition of glottal pulses in continuous speech. Pattern Recognition, vol. 9, pp. 181...189
DIBBERN U. (1972): Grundfrequenzmessung bei der menschlichen Sprache. Fortschritte der Akustik. Bericht von der Gemeinschaftstagung Stuttgart, pp. 345...348
DOLANSKY L.O. (1955): An instantaneous pitch-period indicator. J.Acoust.Soc.Am., vol. 27, pp. 67...72
DUBNOWSKI J.J., SCHAFER R.W., RABINER L.R. (1976): Real-time digital hardware pitch detector. IEEE-T-ASSP, vol. 24, pp. 2... 8
ERB H.J. (1972.2): Untersuchung und Vergleich von Verfahren zur Erkennung der Sprachgrundfrequenz. Darmstadt: TH Darmstadt; Res.Rept. 41, Inst. f. Übertragungstechnik, 41 pp.
FANT C.G.M. (1958): Modern Instruments and Methods for Acoustic Studies of Speech. Proc. 8th Congress of Linguistics, Oslo, pp. 282...358
FILIP M. (1969): Envelope periodicity detection. J.Acoust.Soc. Am., vol. 45, pp. 719...732
FLANAGAN J.L. (1972.1): Speech analysis, synthesis, and perception. Berlin/New York: Springer.
FLANAGAN J.L., SASLOW M.G. (1958): Pitch discrimination for synthetic vowels. J.Acoust.Soc.Am., vol. 30, pp. 435...442
FRIEDMAN D.H. (1977): Pseudo-maximum-likelihood pitch extraction. IEEE-T-ASSP, vol. 25, pp. 213...221
FRIEDMAN D.H. (1978): Multidimensional pseudo-maximum-likelihood pitch estimation. IEEE-T-ASSP, vol. 26, pp. 185...196
GALAND C, ESTEBAN D., DUBUS F. (1976): Détection de la mélodie par autocorrélation non linéaire. Proc.Journ.Etud.Parole (Nancy), vol. 7, pp. 333...345.
GOLD B. (1977): Digital Speech Networks. Proc.IEEE, vol. 65, pp. 1636...1658.
GOLD P., RABINER L.R. (1969): Parallel processing techniques for estimating pitch periods of speech in the time domain. J.Acoust.Soc.Am., vol. 46, pp. 442...448.
GRÜTZMACHER M., LOTTERMOSER W. (1937): Über ein Verfahren zur trägheitsfreien Aufzeichnung von Melodiekurven. Akustische Zeitschrift, vol. 2, pp. 242...248
HESS W.J. (1976): A pitch-synchronous digital feature extraction system for phonemic recognition of speech. IEEE-T-ASSP, vol. 24, pp. 14...25.
HESS W.J. (1979): Time-domain pitch period extraction of speech signals using three nonlinear digital filters. Proc.IEEE ICASSP-79, Washington DC, pp. 773...776
LEON P., MARTIN PH. (1969): Prolégomènes à l’étude des structures intonatives. Paris, Montréal: Didier; Studia Phonetica, No. 2, 225 pp.
MCGONEGAL C.A., RABINER L.R., ROSENBERG A.E. (1975): A semiautomatic pitch detector (SAPD). IEEE-T-ASSP, vol. 23, pp. 570...574
MCGONEGAL C.A., RABINER L.R., ROSENBERG A.E. (1977): A subjective evaluation of pitch detection methods using LPC synthesized speech. IEEE-T-ASSP, vol. 25, pp. 221...229
MCKINNEY N.P. (1965): Laryngeal frequency analysis for linguistic research. Ann Arbor MI: Communic. Sciences Lab., Univ. of Michigan; Res.Rept. No. 14, 340 pp.
MILLER N.J. (1975): Pitch detection by data reduction. IEEE-T-ASSP, vol. 23, pp. 72...79
MILLER R.L. (1970): Performance characteristics of an experimental harmonic identification pitch extraction system (HIPEX). J.Acoust.Soc.Am., vol. 47, pp. 1593...1601
MOORER J.A. (1974): The optimum comb method of pitch period analysis of continuous digitized speech. IEEE-T-ASSP, vol. 22, pp. 330...338
NOLL A.M. (1967): Cepstrum Pitch Determination. J.Acoust.Soc.Am., vol. 41, pp. 293...309.
NOLL A.M. (1970): Pitch determination of human speech by the harmonic product spectrum, the harmonic sum spectrum, and a maximum likelihood estimate. Microwave Inst.Conf.Proc. — Symposium on computer processing in communications, april 1969, vol. 19, pp. 779...797
RABINER L.R. (1977): On the use of autocorrelation analysis for pitch detection. IEEE-T-ASSP, vol. 25, pp. 24...33
RABINER L.R., CHENG M.J., ROSENBERG A.E., MCGONEGAL A. (1976): A comparative study of several pitch detection algorithms. IEEE-T-ASSP, vol. 24, pp. 399...413
REDDY D.R. (1967): Pitch period determination of speech sounds. Comm.ACM, vol. 10, pp. 343...348
RISBERG A., MÖLLER A., FUJISAKI H. (1960): Voice Fundamental Frequency Tracking. STL-QPSR, No. 1, pp. 3...5
ROSS M.J., SHAFFER H.L., COHEN A., FREUDBERG R., MANLEY H.J. (1974): Average magnitude difference function pitch extractor. IEEE-T-ASSP, vol. 22, pp. 353...361
SCHROEDER M.R. (1968): Period histogram and product spectrum: New methods for fundamental-frequency measurement. J.Acoust. Soc.Am., vol. 43, pp. 829...834.
SENEFF S. (1978): Real-time harmonic pitch detector. IEEE-T-ASSP, vol. 26, pp. 358...364
SONDHI M.M. (1968): New methods of pitch extraction. IEEE-T-AU, vol. 16, pp. 262...266
TERHARDT E. (1979): Calculating virtual pitch. Hearing Research, vol. 1.
TILLMANN H.G. (1978): Bestimmung der Stimmperiode in der Zeitfunktion des digitalen Sprachschallsignals. Forschungsberichte des IPSK, München, vol. 9, pp. 207...213
TUCKER W.H., BATES R.T.H. (1978): A pitch estimation algorithm for speech and music. IEEE-T-ASSP, vol. 26, pp. 597...604
UN C.K., YANG S.C. (1977): A Pitch Extraction Algorithm Based on LPC Inverse Filtering and AMDF. IEEE-T-ASSP, vol. 25, pp. 565...572
WINCKEL F. (1963): Tonhöhenextraktor zur Messung und Steuerung von Stimme und Sprache. Arch.Ohren-Nasen-Kehlkopfheilkunde, vol. 182, pp. 651...655.
WISE J.D., CAPRIO J.R., PARKS T.W. (1976): Maximum likelihood pitch estimation. IEEE-T-ASSP, vol. 24, pp. 418...423
ZURCHER J.F. (1977): La mesure du fondamental par la détection de crêtes. Techniques employés, résultats. Proc.Journ. Etud. Parole (Aix-en-P.), vol. 8, pp. 119...126.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1980 D. Reidel Publishing Company
About this paper
Cite this paper
Hess, W.J. (1980). Pitch Determination of Speech Signals — A Survey. In: Simon, J.C. (eds) Spoken Language Generation and Understanding. NATO Advanced Study Institutes Series, vol 59. Springer, Dordrecht. https://doi.org/10.1007/978-94-009-9091-3_12
Download citation
DOI: https://doi.org/10.1007/978-94-009-9091-3_12
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-009-9093-7
Online ISBN: 978-94-009-9091-3
eBook Packages: Springer Book Archive