Skip to main content
Log in

An Algorithm for Measurement of the Pitch Frequency of Speech Signals Based on Complementary Ensemble Decomposition Into Empirical Modes

  • Published:
Measurement Techniques Aims and scope

The problem of increasing the precision with which the pitch frequency of speech signals is measured is considered. Existing algorithms for determining this frequency are presented and a new algorithm based on complementary ensemble empirical mode decomposition is developed. The results of the investigations confirm the robustness of the algorithm in the presence of frequency modulation of the pitch of speech signals.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  1. V. G. Mikhailov and L.V. Zlatousova, Measurement of the Parameters of Speech, Radio i Syyaz, Moscow (1987).

    Google Scholar 

  2. A. Camacho and J. G. Harris, “A sawtooth waveform inspired pitch estimator for speech and music,” J. Acoust. Soc. Amer., 123, No. 4, No. 9, 1638–1652 (2008).

  3. E. G. Zhilyakov and A. A. Firsova, “Estimation of the period of the pitch of the sounds of human speech,” Nauch. Vedom. Belgor. Gos. Univ., Ser. Istor. Politol. Ekon. Informat., No. 1 (144), Iss. 25/1, 173–181 (2013).

  4. E. Azarov, M. Vashkevich, and A. Petrovsky, “Instantaneous pitch estimation based on RAPT framework,” in: Proc. 20th Europ. Signal Proc. Conf., EUSIPCO (2012), pp. 2787–2791.

  5. T. Abe, T. Kobayashi, and S. Imai, “Robust pitch estimation with harmonics enhancement in noisy environment based on instantaneous frequency,” in: Proc. ICSLP96 (1996), Vol. 2, pp. 1277–1290.

  6. M. K. Hasan, C. Shahnaz, and S. A. Fattah, “Determination of pitch of noisy speech using dominant harmonic frequency,” in: Proc. IEEE Int. Symp. Circuits and Systems (2003), Vol. 2, pp. 556–559.

  7. A. Chevigne and H. Kawahara, “YIN, a fundamental frequency estimator for speech and music,” J. Acoust. Soc. Amer., 111, No. 4, 1917–1930 (2002).

    Article  ADS  Google Scholar 

  8. J. R. Yeh, J. S. Shieh, and N. E. Huang, “Complementary ensemble empirical mode decomposition: A novel noise enhanced data analysis method,” Adv. Adapt. Data Analysis, 2, No. 2, 135–156 (2010).

    Article  MathSciNet  Google Scholar 

  9. A. K. Alimuradov and F. Sh. Murtazov, “Methods of increasing the efficiency of recognition of speech signals in voice control systems,” Izmer. Tekhn., No. 10, 20–24 (2015).

  10. A. K. Alimuradov and P. P. Churakov, “Noise-robust speech signals processing for the voice control system based on the complementary ensemble empirical mode decomposition,” in: Int. Sib. Conf. Control and Communications, SIBCON 2015, Omsk, Russia, May 21–23, 2015.

  11. N. E. Huang, S. Zheng, and R. L. Steven, “The empirical mode decomposition and the Hilbert spectrum for nonlinear and nonstationary time series analysis,” Proc. Roy. Soc. London. A, 454, 903–995 (1998).

    Article  ADS  MATH  Google Scholar 

  12. W. Zhaohua and N. E. Huang, “Ensemble empirical mode decomposition: A noise-assisted data analysis method,” Adv. Adapt. Data Analysis, 1, No. 1, 1–41 (2009).

    Article  Google Scholar 

  13. Sh. Bhawna and K. Sukhvinder, “Distnction between EMD & EEMD algorithm for pitch detection in speech processing,” Int. J. Eng. Trends and Technol., 7, No. 3, 119–125 (2014).

    Article  Google Scholar 

  14. G. Schlotthauer, M. E. Torres, and H. L. Rufi ner, “A new algorithm for instantaneous f0 speech extraction based on ensemble empirical mode decomposition,” in: 17th Europ. Signal Proc. Conf., EUSIPCO 2009, Glasgow, Scotland (2009), pp. 2347–2351.

  15. G. Priyanka and P. Mahendra Kumar, “Determine the pitch markers in speech signal using ensemble empirical mode decomposition,” Int. J. Adv. Res. Comp. Sci. and Software Eng., 2, No. 7, 90–96 (2012).

    Google Scholar 

  16. A. K. Alimuradov, “Investigation of frequency-selective properties of methods of decomposition into empirical modes for the purpose of estimating the pitch frequency of speech signals,” Tr. MFTI, 7, No. 3, 56–68 (2015).

    Google Scholar 

  17. X. Huang, A. Acero, and H.-W. Hon, Spoken Language Processing. Guide to Algorithms and System Development, Prentice-Hall, Upper Saddle River (2001).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to A. K. Alimuradov.

Additional information

Translated from Izmeritel’naya Tekhnika, No. 12, pp. 53–57, December, 2016.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Alimuradov, A.K. An Algorithm for Measurement of the Pitch Frequency of Speech Signals Based on Complementary Ensemble Decomposition Into Empirical Modes. Meas Tech 59, 1316–1323 (2017). https://doi.org/10.1007/s11018-017-1135-1

Download citation

  • Received:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11018-017-1135-1

Keywords

Navigation