Skip to main content
Log in

Accuracy in determining voice source parameters

  • Acoustic Signal Processing. Computer Simulation
  • Published:
Acoustical Physics Aims and scope Submit manuscript

Abstract

The paper addresses the accuracy of an approximate solution to the inverse problem of retrieving the shape of a voice source from a speech signal for a known signal-to-noise ratio (SNR). It is shown that if the source is found as a function of time with the A.N. Tikhonov regularization method, the accuracy of the found approximation is worse than the accuracy of speech signal recording by an order of magnitude. In contrast, adequate parameterization of the source ensures approximate solution accuracy comparable with the accuracy of the problem data. A corresponding algorithm is considered. On the basis of linear (in terms of data errors) estimates of approximate parametric solution accuracy, parametric models with the best accuracy can be chosen. This comparison has been carried out for the known voice source models, i.e., model [17] and the LF model [18]. The advantages of the latter are shown. Thus, for SNR = 40 dB, the relative accuracy of an approximate solution found with this algorithm is about 1% for the LF model and about 2% for model [17] as compared to an accuracy of 7–8% in the regularization method. The role of accuracy estimates found in speaker identification problems is discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. V. N. Sorokin, Speech inversion: Problems and solutions, in Dynamics of Speech Production and Perception, Ed. by P. Divenyu, S. Greenberg, and G. Meyer, (IOS, 2006), p. 263, [in Russian].

    Google Scholar 

  2. V. N. Sorokin, Speech inversion in physiology and technology, in Speech Communication at the Leading Edge, Ed. by R. H. Yablonski, (Nova Science, 2008), [in Russian].

    Google Scholar 

  3. A. S. Leonov and V. N. Sorokin, Dokl. Mathem. 84, 740 (2011).

    Article  MATH  MathSciNet  Google Scholar 

  4. A. S. Leonov and V. N. Sorokin, Dokl. Mathem. 85, 432 (2012).

    Article  MATH  MathSciNet  Google Scholar 

  5. A. N. Tikhonov and V. Ya. Arsenin, Methods of Solution of Ill-Posed Problems, (Nauka, Moscow, 1979), 2nd ed., [in Russian].

    MATH  Google Scholar 

  6. M. M. Lavrent’ev, Some Ill-Posed Problems of Mathematical Physics, (Sib. Otd. Akad. Nauk SSSR, Novosibirsk, 1962), [in Russian].

    Google Scholar 

  7. V. K. Ivanov, V. V. Vasin, and V. P. Tanana, Theory of Linear Ill-Posed Problems and Its Applications, (Nauka, Moscow, 1978), [in Russian].

    MATH  Google Scholar 

  8. G. M. Vainikko and A. Yu. Veretennikov, Iteration Procedures in Ill-Posed Problems, (Nauka, Moscow, 1986), [in Russian].

    Google Scholar 

  9. A. B. Bakushinskii and A. V. Goncharskii, Iteration Methods of Ill-Posed Problem Solution, (Nauka, Moscow, 1989), [in Russian].

    Google Scholar 

  10. D. Flanagan, Analysis, Synthesis and Perception of Speech, (Svyaz’, Moscow, 1968), [in Russian].

    Google Scholar 

  11. D. Kewly-Port and C. S. Watson, J. Acoust. Soc. Am. 95, 485 (1994).

    Article  ADS  Google Scholar 

  12. V. N. Sorokin and V. P. Trifonenkov, Acoust. Phys. 42, 368 (1996).

    ADS  Google Scholar 

  13. G. K. Vallabha and B. Tuller, Speech Commun. 38, 141 (2002).

    Article  MATH  Google Scholar 

  14. A. S. Leonov, Solution of Ill-Posed Inverse Problems. Review of Theory, Practical Algorithms and Demonstrations in MatnLab (URSS, Moscow, 2010), [in Russian].

    Google Scholar 

  15. V. A. Yurko, Introduction into Inverse Spectral Problems (Nauka, Moscow, 2007), [in Russian].

    Google Scholar 

  16. V. A. Morozov, Regular Methods for Solving of Ill-Posed Problems (Nauka, Moscow, 1987), [in Russian].

    Google Scholar 

  17. T. Ananthapadmanabha, Speech Trans. Lab.-Quarterly Progress Status Report 2–3, 1 (1984).

    Google Scholar 

  18. G. Fant, J. Liljencrants, and Q. A. Lin, Speech Trans. Lab.-Quarterly Progress Status Report 4, 1 (1985).

    Google Scholar 

  19. A. S. Leonov, Compt. Mathem. Mathem. Phys. 54, 575 (2014).

    Article  Google Scholar 

  20. A. N. Tikhonov, Dokl. Akad. Nauk SSSR 39, 195 (1943).

    MathSciNet  Google Scholar 

  21. T. G. Kolda, R. M. Lewis, and V. Torczon, SIAM Review 45, 385 (2003).

    Article  ADS  MATH  MathSciNet  Google Scholar 

  22. D. H. Klatt, JASA 82, 737 (1987).

    Article  Google Scholar 

  23. P. Milenkovic, J. Acoust. Soc. Am. 93, 1087 (1993).

    Article  ADS  Google Scholar 

  24. D. Childers and H. Hu, J. Acoust. Soc. Am. 96, 2026 (1994).

    Article  ADS  Google Scholar 

  25. G. Fant, Speech Trans. Lab.-Quarterly Progress Status Report 1, 85 (1979).

    Google Scholar 

  26. J. Schoentgen, Speech Commun. 11, 499 (1992).

    Article  Google Scholar 

  27. J. Schoentgen, J. Acoust. Soc. Am. 114, 2906 (2003).

    Article  ADS  Google Scholar 

  28. E. Rank and G. Kubin, Speech Commun. 48, 775 (2006).

    Article  Google Scholar 

  29. I. R. Titze, The Myoelastic Aerodynamic Theory of Phonation (National Center for Voice and Speech, Iowa City, 2006).

    Google Scholar 

  30. A. S. Leonov and V. N. Sorokin, Acoust. Phys. 60, 323 (2014).

    Article  ADS  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to A. S. Leonov.

Additional information

Original Russian Text © A.S. Leonov, V.N. Sorokin, 2014, published in Akusticheskii Zhurnal, 2014, Vol. 60, No. 6, pp. 656–662.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Leonov, A.S., Sorokin, V.N. Accuracy in determining voice source parameters. Acoust. Phys. 60, 687–693 (2014). https://doi.org/10.1134/S1063771014050078

Download citation

  • Received:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1134/S1063771014050078

Keywords

Navigation