Complexity Reduction of WSOLA-Based Time-Scale Modification Using Signal Period Estimation
In this paper, we propose a computational complexity reduction method for a waveform similarity overlap-and-add (WSOLA) based time-scale modification (TSM) algorithm using signal period estimation. In the proposed method, a signal period is estimated from the normalized cross-correlation. An optimal shift, a maximally similar point, of WSOLA for the current frame can be estimated from the estimated period obtained from the previous frame. Then, we reduce the search range for calculating the normalized cross-correlation around the estimated optimal shift instead of calculating for the full search range. In this manner, we can reduce the computational complexity required for normalized cross-correlations, which dominates most of the complexity in WSOLA. It is shown from experiments that the proposed method gives a relative complexity reduction of 56% for the WSOLA-based TSM algorithm while maintaining speech quality.
KeywordsTime-scale modification WSOLA complexity reduction signal period estimation
Unable to display preview. Download preview PDF.
- 4.Roucos, S., Wilgus, A.: High quality time-scale modification of speech. In: Proceedings of ICASSP, pp. 236–239 (1985)Google Scholar
- 5.Verhelst, W., Roelands, M.: An overlap-add technique based on waveform similarity (WSOLA) for high quality time-scale modification of speech. In: Proceedings of ICASSP, pp. 554–557 (1985)Google Scholar
- 6.Choi, W.Y.: Audio signal time-scale modification method using variable length synthesis and reduced cross-correlation computations. U.S. Patent Application 2005/0273321 (2005)Google Scholar