Abstract
The paper addresses the accuracy of an approximate solution to the inverse problem of retrieving the shape of a voice source from a speech signal for a known signal-to-noise ratio (SNR). It is shown that if the source is found as a function of time with the A.N. Tikhonov regularization method, the accuracy of the found approximation is worse than the accuracy of speech signal recording by an order of magnitude. In contrast, adequate parameterization of the source ensures approximate solution accuracy comparable with the accuracy of the problem data. A corresponding algorithm is considered. On the basis of linear (in terms of data errors) estimates of approximate parametric solution accuracy, parametric models with the best accuracy can be chosen. This comparison has been carried out for the known voice source models, i.e., model [17] and the LF model [18]. The advantages of the latter are shown. Thus, for SNR = 40 dB, the relative accuracy of an approximate solution found with this algorithm is about 1% for the LF model and about 2% for model [17] as compared to an accuracy of 7–8% in the regularization method. The role of accuracy estimates found in speaker identification problems is discussed.
Similar content being viewed by others
References
V. N. Sorokin, Speech inversion: Problems and solutions, in Dynamics of Speech Production and Perception, Ed. by P. Divenyu, S. Greenberg, and G. Meyer, (IOS, 2006), p. 263, [in Russian].
V. N. Sorokin, Speech inversion in physiology and technology, in Speech Communication at the Leading Edge, Ed. by R. H. Yablonski, (Nova Science, 2008), [in Russian].
A. S. Leonov and V. N. Sorokin, Dokl. Mathem. 84, 740 (2011).
A. S. Leonov and V. N. Sorokin, Dokl. Mathem. 85, 432 (2012).
A. N. Tikhonov and V. Ya. Arsenin, Methods of Solution of Ill-Posed Problems, (Nauka, Moscow, 1979), 2nd ed., [in Russian].
M. M. Lavrent’ev, Some Ill-Posed Problems of Mathematical Physics, (Sib. Otd. Akad. Nauk SSSR, Novosibirsk, 1962), [in Russian].
V. K. Ivanov, V. V. Vasin, and V. P. Tanana, Theory of Linear Ill-Posed Problems and Its Applications, (Nauka, Moscow, 1978), [in Russian].
G. M. Vainikko and A. Yu. Veretennikov, Iteration Procedures in Ill-Posed Problems, (Nauka, Moscow, 1986), [in Russian].
A. B. Bakushinskii and A. V. Goncharskii, Iteration Methods of Ill-Posed Problem Solution, (Nauka, Moscow, 1989), [in Russian].
D. Flanagan, Analysis, Synthesis and Perception of Speech, (Svyaz’, Moscow, 1968), [in Russian].
D. Kewly-Port and C. S. Watson, J. Acoust. Soc. Am. 95, 485 (1994).
V. N. Sorokin and V. P. Trifonenkov, Acoust. Phys. 42, 368 (1996).
G. K. Vallabha and B. Tuller, Speech Commun. 38, 141 (2002).
A. S. Leonov, Solution of Ill-Posed Inverse Problems. Review of Theory, Practical Algorithms and Demonstrations in MatnLab (URSS, Moscow, 2010), [in Russian].
V. A. Yurko, Introduction into Inverse Spectral Problems (Nauka, Moscow, 2007), [in Russian].
V. A. Morozov, Regular Methods for Solving of Ill-Posed Problems (Nauka, Moscow, 1987), [in Russian].
T. Ananthapadmanabha, Speech Trans. Lab.-Quarterly Progress Status Report 2–3, 1 (1984).
G. Fant, J. Liljencrants, and Q. A. Lin, Speech Trans. Lab.-Quarterly Progress Status Report 4, 1 (1985).
A. S. Leonov, Compt. Mathem. Mathem. Phys. 54, 575 (2014).
A. N. Tikhonov, Dokl. Akad. Nauk SSSR 39, 195 (1943).
T. G. Kolda, R. M. Lewis, and V. Torczon, SIAM Review 45, 385 (2003).
D. H. Klatt, JASA 82, 737 (1987).
P. Milenkovic, J. Acoust. Soc. Am. 93, 1087 (1993).
D. Childers and H. Hu, J. Acoust. Soc. Am. 96, 2026 (1994).
G. Fant, Speech Trans. Lab.-Quarterly Progress Status Report 1, 85 (1979).
J. Schoentgen, Speech Commun. 11, 499 (1992).
J. Schoentgen, J. Acoust. Soc. Am. 114, 2906 (2003).
E. Rank and G. Kubin, Speech Commun. 48, 775 (2006).
I. R. Titze, The Myoelastic Aerodynamic Theory of Phonation (National Center for Voice and Speech, Iowa City, 2006).
A. S. Leonov and V. N. Sorokin, Acoust. Phys. 60, 323 (2014).
Author information
Authors and Affiliations
Corresponding author
Additional information
Original Russian Text © A.S. Leonov, V.N. Sorokin, 2014, published in Akusticheskii Zhurnal, 2014, Vol. 60, No. 6, pp. 656–662.
Rights and permissions
About this article
Cite this article
Leonov, A.S., Sorokin, V.N. Accuracy in determining voice source parameters. Acoust. Phys. 60, 687–693 (2014). https://doi.org/10.1134/S1063771014050078
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1134/S1063771014050078