Advertisement

International Journal of Speech Technology

, Volume 10, Issue 2–3, pp 89–94 | Cite as

Parameters evaluation of SOLA algorithm for time scale modification

  • Zhou JunEmail author
  • Tan Wei
  • Chen Yanpu
  • Gao Yue
Article
  • 123 Downloads

Abstract

Successful operation of the Synchronous Overlap and Add (SOLA) algorithm for Time Scale Modification (TSM) of speech is closely tied to the proper choice of parameters. This paper investigates the quality of time scale modified speech under different values of primary parameters. Based on Mean Opinion Score (MOS) tests and Bark Spectral Distortion (BSD) measure, the proper choices of synthesis shift (Ss) and the duration of the shift search interval (K max ) are given experimentally. The conclusions can be helpful for operating the SOLA algorithm for time scale modification of speech.

Keywords

Speech signal Time scale modification SOLA Quality measure 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Wong, W., & Au, O. C. (2002). Fast SOLA-based time scale modification using modified envelope matching[A]. In: Proc. of IEEE international conference on acoustics, speech and signal processing[C] (pp. 3188–3191). Orlando, FL. Google Scholar
  2. Roucos, S., & Wilgus, A. M. (1985). High quality time scale modification for speech. In: Proc. IEEE int. conf. acoustics, speech., signal processing (vol. 1, pp. 493–496). Google Scholar
  3. Griffin, D. W., & Lin, J. S. (1984). Signal estimation from modified short-time Fourier transform. IEEE Trans. Acoust., Speech, Signal Processing, ASSP-32(2), 236–243. CrossRefGoogle Scholar
  4. McAulay, R. J., & Quatieri, T. F. (1986). Speech analysis–synthesis based on a sinusoidal representation. IEEE Trans. Acoust., Speech, Signal Prosess., ASSP-34, 744–754. CrossRefGoogle Scholar
  5. Du, S.-F. (2005). Adaptive synchronous overlap and add algorithm for time scale modification of speech (In Chinese). Google Scholar
  6. Su, Y. (1997), A novel approach for hi-fi audio signal processing. Patent No. CN 1145519A (In Chinese). Google Scholar
  7. Hejna, D. J. (1990). Real-time time-scale modification of speech via the synchronized overlap-add algorithm. Master Thesis Google Scholar
  8. Chen, Y.-P. (2001). Study on auditory perception and its applications in speech enhancement. PhD Thesis (In Chinese). Google Scholar

Copyright information

© Springer Science+Business Media, LLC 2009

Authors and Affiliations

  1. 1.The Army Aviation Research Institute of Headquarters of General StaffBeiJingChina
  2. 2.Electronic and Information LabsXi’an Communications CollegeXi’anChina

Personalised recommendations