A New Probabilistic Spectral Pitch Estimator: Exact and MCMC-approximate Strategies

Thornburg, Harvey D.; Leistikow, Randal J.

doi:10.1007/978-3-540-31807-1_3

Harvey D. Thornburg¹⁷ &
Randal J. Leistikow¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3310))

Included in the following conference series:

International Symposium on Computer Music Modeling and Retrieval

1033 Accesses
4 Citations

Abstract

We propose a robust probabilistic pitch (f ₀) estimator in the presence of interference and low SNR conditions, without the computational requirements of optimal time-domain methods. Our analysis is driven by sinusoidal peaks extracted by a windowed STFT. Given f ₀ and a reference amplitude (A ₀), peak frequency/amplitude observations are modeled probabilistically in order to be robust to undetected harmonics, spurious peaks, skewed peak estimates, and inherent deviations from ideal or other assumed harmonic structure. Parameters f ₀ and A ₀ are estimated by maximizing the observations’ likelihood (here A ₀ is treated as a nuisance parameter). Some previous spectral pitch estimation methods, most notably the work of Goldstein [3], introduce a probabilistic framework with a corresponding maximum likelihood approach. However, our method significantly extends the latter in order to guarantee robustness under adverse conditions, facilitating possible extensions to the polyphonic context. For instance, our addressing of spurious as well as undetected peaks averts a sudden breakdown under low-SNR conditions. Furthermore, our assimilation of peak amplitudes facilitates the incorporation of timbral knowledge. Our method utilizes a hidden, discrete-valued descriptor variable identifying spurious/undetected peaks. The likelihood evaluation, requiring a computationally unwieldy summation over all descriptor states, is successfully approximated by a MCMC traversal chiefly amongst high-probability states. The MCMC traversal obtains virtually identical evaluations for the entire likelihood surface at a fraction of the computational cost.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Cinlar, E.: Introduction to Stochastic Processes. Prentice-Hall, Englewood Cliffs (1975)
MATH Google Scholar
Fitzgerald, W.J.: Markov chain Monte Carlo methods with applications to signal processing. Elsevier Signal Processing 81(1), 3–18 (2001)
MATH Google Scholar
Goldstein, J.: An optimum processor theory for the central formation of the pitch of complex tones. J. Acoust. Soc. Amer. 54, 1496–1516 (1973)
Article Google Scholar
Hory, C., Martin, N., Chehikian, A.: Spectrogram segmentation by means of statistical features for non-stationary signal interpretation. IEEE Trans. ASSP 50(12), 2915–2925 (2002)
MathSciNet Google Scholar
Knuth, D., Vardi, I., Richberg, R.: 6581 (The asymptotic expansion of the middle binomial coefficient). American Mathematical Monthly 97(7), 626–630 (1990)
Article MathSciNet Google Scholar
McAulay, R.J., Quatieri, T.F.: Speech analysis/synthesis based on a sinusoidal representation. IEEE Trans. ASSP 34(4), 744–754 (1986)
Article Google Scholar
Rabiner, L.R.: A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE 77(2), 257–286 (1989)
Article Google Scholar
Leistikow, R., Thornburg, H., et al.: Bayesian Identification of Closely-Spaced Chords from Single-Frame STFT Peaks. In: Proc. 7th International Conference on Digital Audio Effects (DAFx 2004), Naples (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Center for Computer Research in Music and Acoustics (CCRMA), Department of Music, Stanford University, Stanford, CA, USA
Harvey D. Thornburg & Randal J. Leistikow

Authors

Harvey D. Thornburg
View author publications
You can also search for this author in PubMed Google Scholar
Randal J. Leistikow
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Mærsk Mc-Kinney Møller Institute, University of Southern Denmark, Campus 55, 5230, Odense M, Denmark
Uffe Kock Wiil

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Thornburg, H.D., Leistikow, R.J. (2005). A New Probabilistic Spectral Pitch Estimator: Exact and MCMC-approximate Strategies. In: Wiil, U.K. (eds) Computer Music Modeling and Retrieval. CMMR 2004. Lecture Notes in Computer Science, vol 3310. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-31807-1_3

Download citation

DOI: https://doi.org/10.1007/978-3-540-31807-1_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24458-5
Online ISBN: 978-3-540-31807-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics