Abstract
Pitch is one of the most key parameter in speech coding, speech synthesis and so on, the traditional methods for pitch detection are prone to error at a low SNR at present. A pitch detection method based on pitch harmonic (PH) and the harmonic number based on PH is proposed in this paper. At first, the pitch harmonic is roughly estimated by pitch estimation filter with amplitude compression (PEFAC). Secondly, the weighted algorithm based on modified circular average magnitude difference function (MCAMDF) and pulse sequence is used to compute the pitch harmonic number. At last a pitch tracking method is applied to compute the pitch period candidates accurately. By simulation experiments, it is shown that the proposed pitch detection method has more accurate and more low algorithm complexity than the traditional methods at both high and low SNR.
XU. Jingyun—Project supported by the National Nature Science Foundation of China (No. 61271248), Natural Science Foundation of Huzhou City (No. 2015YZ04).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Hong, W.: Low Bit Rate Speech Coding. National Defense Industry Press, Beijing (2005)
Gonzalez, S., Brookes, M.: PEFAC-a pitch estimation algorithm robust to high levels of noise. IEEE Trans. Audio Speech Lang. Process. 22(2), 518–530 (2014)
Jingyun, X., Xiaoqun, Z.: Voiced/unvoiced classification and pitch estimation based on amplitude compression filter. J. Electron. Inf. Technol. 38(3), 586–593 (2016)
Xu, J.D., Chang, L., Cui, H.J., et al.: A pitch period detection algorithm using time and frequency analyses. J. Tsinghua Univ. 52(3), 413–415, 420 (2012)
Shahnaz, C., Zhu, W.P., Omair, M.: Pitch estimation based on a harmonic sinusoidal autocorrelation model and a time-domain matching scheme. IEEE Trans. Acoust. Speech Sig. Process. 20(1), 322–335 (2012)
Huang, F., Lee, T.: Pitch estimation in noisy speech using accumulated peak spectrum and sparse estimation technique. IEEE Trans. Audio Speech Lang. Process. 21(1), 99–109 (2013)
Byrne, D., Dillon, H., Tran, K., et al.: An international comparison of long term average speech spectra. J. Acoust. Soc. Am. 96(4), 2108–2120 (1994)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Jingyun, X., Xiaoqun, Z., Zhiduan, C. (2017). A Pitch Estimation Method Robust to High Levels of Noise. In: Xin-lin, H. (eds) Machine Learning and Intelligent Communications. MLICOM 2016. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 183. Springer, Cham. https://doi.org/10.1007/978-3-319-52730-7_28
Download citation
DOI: https://doi.org/10.1007/978-3-319-52730-7_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-52729-1
Online ISBN: 978-3-319-52730-7
eBook Packages: Computer ScienceComputer Science (R0)