Abstract
A sinusoidal representation of speech and a cochlear model are used to extract speech parameters in this paper, and a speech analysis/synthesis system controlled by the auditory spectrum is developed with the model. The computer simulation shows that speech can be synthesized with only 12 parameters per frame on the average. The method has the advantages of few parameters, low complexity and high performance of speech representation. The synthetic speech has high intelligibility.
Similar content being viewed by others
References
Ozawa K., et al., A study on pulse search algorithms for multipulse excited speech coder realisation, IEEE JSAC, 1986, SAC-4(1): 133–141
ITU-T, COM: Draft Recommendation G. 729-Coding of Speech at 8 kbit/s Using Conjugate-Structure Algebraic-Code-Excited-Linear-Prediction(CS-CELP), 1995: 15–152
McAulay R. J. and Ouatieri T. F., Speech analysis/synthesis based a sinusoidal representation, IEEE trans. ASSP, 1986, 34: 744–754
Quatiteri T. F. and McAulay R. J., Mixed-phase deconvolution of speech based on sine-wave model, Proceedings of the International Conference on Acoustics, Speech and Signal Processing, 1987: 649–652
George E. G., An Analysis-by-synthesis Approach to Sinusoidal Modelling Applied to Speech and Music Signal Processing, Ph. D. Thesis, Georgia Institute of Technology, 1991
George E. B. and Smith M. J. T., An analysis-by-synthesis approach to sinusoidal modelling applied to the analysis and synthesis of musical tones, Journal of the Audio Engineering Society, 1992, 40: 497–516
Wan W. G. and Yu X. Q., A second order difference cochlear model, Acta Electronica Sinica, 1995, 23(7): 6–10 (in Chinese)
Wan W. G. and Yu X. Q., A speech analysis/synthesis method based on a secondorder difference cochlear model, Acta Electronica Sinica, 1998 (in Chinese)
Yang X. J. and Chi H. S., et al., The Digital Processing of Speech Signal, Electronic Engineering Press, Beijing, China, 1995: 34–40 (in Chinese)
Author information
Authors and Affiliations
Additional information
Project supported by the National Natural Science Foundation of China(69501007)
About this article
Cite this article
Yuan, J., Wan, W. & Yu, X. Application of cochlear model in speech analysis/synthesis using sinusoidal representation. J. of Shanghai Univ. 3, 47–52 (1999). https://doi.org/10.1007/s11741-999-0028-1
Received:
Issue Date:
DOI: https://doi.org/10.1007/s11741-999-0028-1