Advertisement

Complexity Reduction of WSOLA-Based Time-Scale Modification Using Signal Period Estimation

  • Duk Su Kim
  • Young Han Lee
  • Hong Kook Kim
  • Song Ha Choi
  • Ji Woon Kim
  • Myeong Bo Kim
Part of the Communications in Computer and Information Science book series (CCIS, volume 120)

Abstract

In this paper, we propose a computational complexity reduction method for a waveform similarity overlap-and-add (WSOLA) based time-scale modification (TSM) algorithm using signal period estimation. In the proposed method, a signal period is estimated from the normalized cross-correlation. An optimal shift, a maximally similar point, of WSOLA for the current frame can be estimated from the estimated period obtained from the previous frame. Then, we reduce the search range for calculating the normalized cross-correlation around the estimated optimal shift instead of calculating for the full search range. In this manner, we can reduce the computational complexity required for normalized cross-correlations, which dominates most of the complexity in WSOLA. It is shown from experiments that the proposed method gives a relative complexity reduction of 56% for the WSOLA-based TSM algorithm while maintaining speech quality.

Keywords

Time-scale modification WSOLA complexity reduction signal period estimation 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Chong-White, N.R., Cox, R.V.: Enhancing speech intelligibility using variable rate time-scale modification. Journal of the Acoustical Society of America 120(6), 3452 (2006)CrossRefGoogle Scholar
  2. 2.
    Moulines, E., Charpentier, F.: Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Communication 9(5-6), 453–467 (1990)CrossRefGoogle Scholar
  3. 3.
    Wayman, J.L., Wilson, D.L.: Some improvements on the synchronized-overlap-add method of time scale modification for use in real-time speech compression and noise filtering. IEEE Transactions on Acoustics, Speech, and Signal Processing 36(1), 139–140 (1988)CrossRefGoogle Scholar
  4. 4.
    Roucos, S., Wilgus, A.: High quality time-scale modification of speech. In: Proceedings of ICASSP, pp. 236–239 (1985)Google Scholar
  5. 5.
    Verhelst, W., Roelands, M.: An overlap-add technique based on waveform similarity (WSOLA) for high quality time-scale modification of speech. In: Proceedings of ICASSP, pp. 554–557 (1985)Google Scholar
  6. 6.
    Choi, W.Y.: Audio signal time-scale modification method using variable length synthesis and reduced cross-correlation computations. U.S. Patent Application 2005/0273321 (2005)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Duk Su Kim
    • 1
  • Young Han Lee
    • 1
  • Hong Kook Kim
    • 1
  • Song Ha Choi
    • 2
  • Ji Woon Kim
    • 2
  • Myeong Bo Kim
    • 2
  1. 1.School of Information and CommunicationsGwangju Institute of Science and TechnologyGwangjuKorea
  2. 2.Camcoder Business Team, Digital Media BusinessSamsung ElectronicsSuwon-siKorea

Personalised recommendations