Abstract
The article presents the results of the research focused on procedures for allocating and quantizing values of the pitch. Corresponding optimization tasks are set and accomplished. The results of experimental study of the developed algorithm for determining the pitch lag and its optimal quantizer are presented. The gain in noise immunity and signal to noise ratio compared to the known solutions is shown.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ayuso, A.J.R., Soler, J.M.L. (eds.): Speech Recognition and Coding: New Advances and Trends. Nato ASI Subseries F: vol. 147. Springer, Heidelberg (1995)
Azarov, E., Vashkevich, M.I., Likhachov, D.S., Petrovsky, A.A.: Pitch modification of speech signal using harmonic model with time-varying parameters. Tr. SPIIRAN 32, 5–26 (2014)
Babkin, V.: Basic channels of interpersonal communication and their projection on the infocommunications systems. In: 7th International Conference of Moscow Institute of Control Sciences RAS, pp. 175–178. Moscow Institute of Control Sciences RAS (2005)
Basov, O.O., Nosov, M.V., Shalaginov, V.A.: Pitch-jitter analysis of the speech signal. Tr. SPIIRAN 32, 27–44 (2014)
Basov, O.O., Saitov, I.A.: Basic channels of interpersonal communication and their projection on the infocommunications systems. Tr. SPIIRAN 30, 122–140 (2013)
Basov, O.: Principles of construction of polymodal info-communication systems based on multimodal architectures of subscriber’s terminals. Tr. SPIIRAN 2(39), 109–122 (2015)
Chu, W.C.: Speech Coding Algorithms: Foundation and Evolution of Standardized Coders. Wiley, Hoboken (2004)
Huang, X., Acero, A., Hon, H.W.: Spoken Language Processing: A Guide to Theory, Algorithm, and System Development. Prentice Hall PTR, Upper Saddle River (2001). Foreword By-Reddy, R
Kocharov, D., Skrelin, P., Volskaya, N.: F0 declination patterns in russian. In: Ronzhin, A., Potapova, R., Delic, V. (eds.) SPECOM 2014. LNCS, vol. 8773, pp. 217–226. Springer, Heidelberg (2014)
Kondaurova, M.V., Francis, A.L.: The relationship between native allophonic experience with vowel duration and perception of the english tense/lax vowel contrast by spanish and russian listeners. J. Acoust. Soc. Am. 124(6), 3959–3971 (2008)
Makarova, V.: Perceptual correlates of sentence-type intonation in russian and japanese. J. Phonetics 29(2), 137–154 (2001)
Max, J.: Quantizing for minimum distortion. IRE Trans. Inf. Theory 6(1), 7–12 (1960)
McCree, A., Stachurski, J., Unno, T., Ertan, E., Paksoy, E., Viswanathan, V., Heikkinen, A., Rämö, A., Himanen, S., Blöcher, P., et al.: A 4 kb/s hybrid melp/celp speech coding candidate for ITU standardization. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. 629–632. IEEE (2002)
Meshcheryakov, R., Bondarenko, V.: Dialogue as a basis for construction of speech systems. Cybern. Syst. Anal. 44(2), 175–184 (2008)
Prokhorov, Y.: Statistical models and recurrent prediction of speech signals. Radio and Communications (1984)
Ramishvili, G.: Automatic speaker recognition by voice. Radio and Communications (1981)
Ronzhin, A.L., Budkov, V.Y., Karpov, A.A.: Multichannel system of audio-visual support of remote mobile participant at e-meeting. In: Balandin, S., Dunaytsev, R., Koucheryavy, Y. (eds.) ruSMART 2010. LNCS, vol. 6294, pp. 62–71. Springer, Heidelberg (2010)
Ronzhin, A., Budkov, V., Kipyatkova, I.: Parad-r: speech analysis software for meeting support. In: 9th International Conference on Information, Communications and Signal Processing (ICICS), pp. 1–4. IEEE (2013)
Ronzhin, A., Budkov, V.: Speaker turn detection based on multimodal situation analysis. In: Železný, M., Habernal, I., Ronzhin, A. (eds.) SPECOM 2013. LNCS, vol. 8113, pp. 302–309. Springer, Heidelberg (2013)
Vary, P., Martin, R.: Digital Speech Transmission: Enhancement, Coding and Error Concealment. Wiley, Chichester (2006)
Acknowledgments
This work is partially supported by the Russian Foundation for Basic Research (grants № 15-07-06744-a,13-08-0741-a).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Basov, O., Ronzhin, A., Budkov, V. (2015). Optimization of Pitch Tracking and Quantization. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds) Speech and Computer. SPECOM 2015. Lecture Notes in Computer Science(), vol 9319. Springer, Cham. https://doi.org/10.1007/978-3-319-23132-7_39
Download citation
DOI: https://doi.org/10.1007/978-3-319-23132-7_39
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23131-0
Online ISBN: 978-3-319-23132-7
eBook Packages: Computer ScienceComputer Science (R0)