Skip to main content

Optimization of Pitch Tracking and Quantization

  • Conference paper
  • First Online:
Book cover Speech and Computer (SPECOM 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9319))

Included in the following conference series:

Abstract

The article presents the results of the research focused on procedures for allocating and quantizing values of the pitch. Corresponding optimization tasks are set and accomplished. The results of experimental study of the developed algorithm for determining the pitch lag and its optimal quantizer are presented. The gain in noise immunity and signal to noise ratio compared to the known solutions is shown.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Ayuso, A.J.R., Soler, J.M.L. (eds.): Speech Recognition and Coding: New Advances and Trends. Nato ASI Subseries F: vol. 147. Springer, Heidelberg (1995)

    MATH  Google Scholar 

  2. Azarov, E., Vashkevich, M.I., Likhachov, D.S., Petrovsky, A.A.: Pitch modification of speech signal using harmonic model with time-varying parameters. Tr. SPIIRAN 32, 5–26 (2014)

    Google Scholar 

  3. Babkin, V.: Basic channels of interpersonal communication and their projection on the infocommunications systems. In: 7th International Conference of Moscow Institute of Control Sciences RAS, pp. 175–178. Moscow Institute of Control Sciences RAS (2005)

    Google Scholar 

  4. Basov, O.O., Nosov, M.V., Shalaginov, V.A.: Pitch-jitter analysis of the speech signal. Tr. SPIIRAN 32, 27–44 (2014)

    Google Scholar 

  5. Basov, O.O., Saitov, I.A.: Basic channels of interpersonal communication and their projection on the infocommunications systems. Tr. SPIIRAN 30, 122–140 (2013)

    Google Scholar 

  6. Basov, O.: Principles of construction of polymodal info-communication systems based on multimodal architectures of subscriber’s terminals. Tr. SPIIRAN 2(39), 109–122 (2015)

    Google Scholar 

  7. Chu, W.C.: Speech Coding Algorithms: Foundation and Evolution of Standardized Coders. Wiley, Hoboken (2004)

    Google Scholar 

  8. Huang, X., Acero, A., Hon, H.W.: Spoken Language Processing: A Guide to Theory, Algorithm, and System Development. Prentice Hall PTR, Upper Saddle River (2001). Foreword By-Reddy, R

    Google Scholar 

  9. Kocharov, D., Skrelin, P., Volskaya, N.: F0 declination patterns in russian. In: Ronzhin, A., Potapova, R., Delic, V. (eds.) SPECOM 2014. LNCS, vol. 8773, pp. 217–226. Springer, Heidelberg (2014)

    Google Scholar 

  10. Kondaurova, M.V., Francis, A.L.: The relationship between native allophonic experience with vowel duration and perception of the english tense/lax vowel contrast by spanish and russian listeners. J. Acoust. Soc. Am. 124(6), 3959–3971 (2008)

    Article  Google Scholar 

  11. Makarova, V.: Perceptual correlates of sentence-type intonation in russian and japanese. J. Phonetics 29(2), 137–154 (2001)

    Article  Google Scholar 

  12. Max, J.: Quantizing for minimum distortion. IRE Trans. Inf. Theory 6(1), 7–12 (1960)

    Article  MathSciNet  Google Scholar 

  13. McCree, A., Stachurski, J., Unno, T., Ertan, E., Paksoy, E., Viswanathan, V., Heikkinen, A., Rämö, A., Himanen, S., Blöcher, P., et al.: A 4 kb/s hybrid melp/celp speech coding candidate for ITU standardization. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. 629–632. IEEE (2002)

    Google Scholar 

  14. Meshcheryakov, R., Bondarenko, V.: Dialogue as a basis for construction of speech systems. Cybern. Syst. Anal. 44(2), 175–184 (2008)

    Article  MATH  Google Scholar 

  15. Prokhorov, Y.: Statistical models and recurrent prediction of speech signals. Radio and Communications (1984)

    Google Scholar 

  16. Ramishvili, G.: Automatic speaker recognition by voice. Radio and Communications (1981)

    Google Scholar 

  17. Ronzhin, A.L., Budkov, V.Y., Karpov, A.A.: Multichannel system of audio-visual support of remote mobile participant at e-meeting. In: Balandin, S., Dunaytsev, R., Koucheryavy, Y. (eds.) ruSMART 2010. LNCS, vol. 6294, pp. 62–71. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  18. Ronzhin, A., Budkov, V., Kipyatkova, I.: Parad-r: speech analysis software for meeting support. In: 9th International Conference on Information, Communications and Signal Processing (ICICS), pp. 1–4. IEEE (2013)

    Google Scholar 

  19. Ronzhin, A., Budkov, V.: Speaker turn detection based on multimodal situation analysis. In: Železný, M., Habernal, I., Ronzhin, A. (eds.) SPECOM 2013. LNCS, vol. 8113, pp. 302–309. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  20. Vary, P., Martin, R.: Digital Speech Transmission: Enhancement, Coding and Error Concealment. Wiley, Chichester (2006)

    Book  Google Scholar 

Download references

Acknowledgments

This work is partially supported by the Russian Foundation for Basic Research (grants № 15-07-06744-a,13-08-0741-a).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Andrey Ronzhin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Basov, O., Ronzhin, A., Budkov, V. (2015). Optimization of Pitch Tracking and Quantization. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds) Speech and Computer. SPECOM 2015. Lecture Notes in Computer Science(), vol 9319. Springer, Cham. https://doi.org/10.1007/978-3-319-23132-7_39

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-23132-7_39

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-23131-0

  • Online ISBN: 978-3-319-23132-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics