Abstract
The paper considers the inverse problem of finding the shape of a voice-source pulse from a specified segment of a speech signal using a special mathematical model that relates these quantities. A variational method for solving the formulated inverse problem for two new parametric classes of sources is proposed: a piecewise-linear source and an A-source. The error in the obtained approximate solutions of the inverse problem is considered, and a technique to numerically estimate this error is proposed, which is based on the theory of a posteriori estimates of the accuracy in solving ill-posed problems. A computer study of the adequacy of the proposed models of sources, and a study of the a posteriori estimates of the accuracy in solving inverse problems for such sources were performed using various types of voice signals. Numerical experiments for speech signals showed satisfactory properties of such a posteriori estimates, which represent the upper bounds of possible errors in solving the inverse problem. The estimate of the most probable error in determining the source-pulse shapes for the investigated speech material is on average ~7%. It is noted that the a posteriori accuracy estimates can be used as a criterion for the quality of determining the voice-source pulse shape in the speaker-identification problem.
Similar content being viewed by others
References
T. Drugman, B. Bozkurt, and T. Dutoit, COMPUTER SPEECH AND LANGUAGE 26 (1), 20–34 (2012).
A. S. Leonov and V. N. Sorokin, Doklady Mathem. 85 (3), 432–435 (2012).
A. N. Tikhonov and V. Ya. Arsenin, Methods of Ill-posed Problem Solution, (Nauka, Moscow, 1979) [in Russian].
A. S. Leonov, Solution of Ill-posed Inverse Problems. Theory, Practical Algorithms and Demonstrations in MATLAB, (URSS, Moscow, 2010) [in Russian].
V. K. Ivanov, V. V. Vasin, and V. P. Tanaka, Theory of Linear Ill-posed Problems and Its Applications, (Nauka, Moscow, 1978) [in Russian].
A. S. Leonov and V. N. Sorokin, Acoust. Phys. 60 (6), 687–693 (2014).
G. Fant, J. Liljencrants, and Q. A. Lin, STL–QPSR, no. 4, 1–13 (1985).
T. Ananthapadmanabha, STL–QPSR, nos. 2–3, 1–24 (1984).
V. N. Sorokin, Acoust. Phys. 62 (2), 244–254 (2016).
A. S. Leonov, Numer. Anal. Appl. 5 (1), 68–83 (2012).
A. S. Leonov, J. Inverse and Ill-Posed Problems 20 (5−6), 637–665 (2012).
CMU ARCTIC speech synthesis databases. http://festvox. org/cmu arctic.
A. S. Leonov and V. N. Sorokin, Doklady Mathem. 84 (2), 740–742 (2011).
V. N. Sorokin, Speech Processes (Narodnoe Obrazovanie, Moscow, 2012) [in Russian].
A. I. Tsyplikhin and V. N. Sorokin, Informats. Protsessy 6 (3), 177–207 (2006).
I. R. Titze, The Myoelastic Aerodynamic Theory of Phonation (National Center for Voice and Speech, Iowa City, 2006).
A. S. Leonov and V. N. Sorokin, Acoust. Phys. 60 (3), 323–334 (2014).
P. Alku, M. Airas, E. Björkner, and A. Sundberg, J. Acoust. Soc. Am. 120 (2), 1052–1062 (2006).
J. Nocedal and S. J. Wright, Numerical Optimization (Springer Series. Operations Research, Springer Verlag, 2006), 2nd ed.
R. H. Byrd, M. E. Hribar, and J. Nocedal, SIAM J. Optimization. 9 (4), 877–900 (1999).
D. G. Childers, A. Paige, and A. Moore, Archives of Otolaryngology 102 (7), 407–410 (1976).
Yu. L. Gaponenko and V. A. Vinokurov, Sov. Math. Dokl. 25 (2), 325–328 (1982).
A. G. Yagola, N. N. Nikolaeva, and V. N. Titarenko, Sibir. Zh. Vychisl. Matem. 6 (2), 171–180 (2003).
Author information
Authors and Affiliations
Corresponding author
Additional information
Original Russian Text © A.S. Leonov, V.N. Sorokin, 2017, published in Akusticheskii Zhurnal, 2017, Vol. 63, No. 5, pp. 532–545.
Rights and permissions
About this article
Cite this article
Leonov, A.S., Sorokin, V.N. Upper bound of errors in solving the inverse problem of identifying a voice source. Acoust. Phys. 63, 570–582 (2017). https://doi.org/10.1134/S1063771017050074
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1134/S1063771017050074