Abstract
In this paper, we studied the phonetic approach for voice processing. A method for automatic recognition of speech signals, in which each quasistationary segment is associated with a fuzzy set of phonemes, was developed. We proposed the operation of the probabilistic triangular norm for fuzzy sets corresponding to the input frame and the nearest reference phoneme. The developed method was experimentally shown to allow a 1.5–5% reduction in the probability of erroneous recognition in comparison with known analogues.
Similar content being viewed by others
REFERENCES
Springer Handbook of Speech Recognition, Ed. by J. Benesty, M. Sondh, and Y. Huang (Springer-Verlag, New York, 2008).
I. S. Kipyatkova and A. A. Karpov, Avtom. Telemekh., No. 5, 110 (2017).
A. V. Savchenko, J. Commun. Technol. Electron. 59, 310 (2014).
A. V. Savchenko and L. V. Savchenko, Pattern Recogn. Lett., No. 65, 145 (2015).
O. N. Korsun, I. M. Finaev, V. Ya. Chuchupal, and A. A. Yatsko, Nauka Obrazov., No. 1, 103 (2013).
A. V. Savchenko, J. Commun. Technol. Electron. 62, 788 (2017).
A. A. Kargin and T. V. Sharii, Iskusst. Intellekt, No. 3, 210 (2010).
A. V. Savchenko, Search Techniques in Intelligent Classification Systems (Springer-Verlag, Switzerland, 2016).
V. V. Savchenko, J. Commun. Technol. Electron. 62, 788 (2017).
A. V. Savchenko, J. Commun. Technol. Electron. 61, 430 (2016).
S. Kullback, Information Theory and Statistics (Dover Publications, New York, 1997).
N. Ramou and M. Guerti, J. Commun. Technol. Electron. 59, 1274 (2014).
V. E. Antsiperov, J. Commun. Technol. Electron. 53, 65 (2008).
V. V. Savchenko, J. Commun. Technol. Electron. 42, 393 (1997).
R. M. Gray, A. H. Gray, and Y. Masuyama, IEEE Trans. Acoust., Speech, Signal Process. 8, 367 (1980).
L. V. Savchenko and A. V. Savchenko, in Proc. 6th Int. Conf. on Nonlinear Speech Processing (NOLISP 2013), Mons, Belgium, June 19–21, 2013 (NOLISP, 2013).
R. Halavati, S. B. Shouraki, and S. H. Zadeh, Appl. Soft Comp. 7, 828 (2007).
Professor Higgins: English without accent! (Certificate on the state registration of the computer, ZAO IstraSoft, 30.07.2009).
V. V. Savchenko, Nauch. Vedom. Belgorod. Gos. Univ., Ser. Ekonomika. Informatika 33 (1), 74 (2015).
L. Gillick and S. J. Cox, in Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP-89), Glasgow (UK), May 1989 (IEEE, New York, 1989), Vol. 1, p. 532.
A. V. Savchenko, Automation and Remote Control 74, 1225 (2013).
A. V. Savchenko, Proc. Int. Joint Conf. on Rough Sets (IJCRS, 2017).
ACKNOWLEDGMENTS
The work was prepared within the framework of the Basic Research Program at the National Research University Higher School of Economics (HSE) and supported within the framework of a subsidy by the Russian Academic Excellence Project “5-100”.
Author information
Authors and Affiliations
Corresponding author
Additional information
Translated by A. Ivanov
Rights and permissions
About this article
Cite this article
Savchenko, L.V., Savchenko, A.V. Fuzzy Phonetic Encoding of Speech Signals in Voice Processing Systems. J. Commun. Technol. Electron. 64, 238–244 (2019). https://doi.org/10.1134/S1064226919030173
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1134/S1064226919030173