Skip to main content
Log in

A comprehensive model for voice activity in conversational speech-development and application to performance analysis of new-generation wireless communication systems

  • Published:
Wireless Networks Aims and scope Submit manuscript

Abstract

Proposed new wireless communication systems such as third generation cellular and PCN will utilize speech inferpolation, disconnecting the user from the spectral resource during pauses in speech in order to reduce radiated emissions and improve spectral efficiency. An accurate model of the on-off characteristics of conversational speech is thus necessary to analyze system performance, particularly if the system utilizes a time and/or frequency division multiple access technique. Previously developed speech activity models are deficient because they either do not reproduce short silent pauses of less than 200 ms. (representative of the silence gaps between syllables or words) or else they do not replicate the dynamics between the two conversing parties. Starting with the P.T. Brady model and developing appropriate modifications, this paper formulates a simple, accurate, comprehensive 8-state Markov model for voice activity in conversational speech. The new model can easily be incorporated into simulations or analyses assessing the performance of various new-generation wireless networks, thus improving the accuracy of the performance assessments.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. P.T. Brady, A model for generating on-off speech patterns in two-way conversation, Bell Syst. Tech. J. (Sept. 1969) 2445–2472.

  2. D.J. Goodman, Trends in Cellular and Cordless Communications, IEEE Commun. Mag. (June 1991) 31–40.

  3. B.Z. Kobb, Personal wireless, IEEE Spectrum (June 1993) 20–25.

  4. K. Bullington and J.M. Fraser, Engineering aspects of TASI, Bell Syst. Tech. J. (Mar. 1959) 353–364.

  5. S.J. Campanella, Digital speech interpolation, Comsat Tech. Rev. (Spring 1976) 127–158.

  6. S. Mahmoud, W. Chan, S. Riordan and S. Aidrous, An integrated voice/data system for VHF/UHF radio, IEEE J. Select. Areas Commun. (Dec. 1983) 1098–1111.

  7. D.J. Goodman, R.A. Valenzuela, K.T. Gaylaird and B. Ramamurthi, Packet reservation multiple access for local wireless communications, IEEE Trans. Commun. (Aug. 1989) 885–890.

  8. H.P. Stern and H. Sobol, Design and performance analysis of an advanced, narrowband integrated voice/data mobile radio system, IEEE Trans. Commun. (Jan. 1995) 107–116.

  9. N. Amitay, Distributed switching and control with fast resource assignment/handoff for personal communication systems, IEEE J. Select. Areas Commun. (Aug. 1993) 842–849.

  10. R.L. Pickholtz, L.B. Milstein and D.L. Schilling, Spread spectrum for mobile communications, IEEE Trans. Veh. Techn. (May 1991) 313–321.

  11. J.G. Gruber, A comparison of measured and calculated temporal patterns relevant to speech activity detection, IEEE Trans. Commun. (April 1982) 728–738.

  12. H.P. Stern, Design and performance analysis of an advanced, narrowband, integrated voice/data mobile radio system, Doctoral Dissertation, Univ. Texas, Arlington, TX (Aug. 1991).

    Google Scholar 

  13. H.H. Lee and C.K. Un, A study of on-off characteristics of conversational speech, IEEE Trans. Commun. (May 1987) 630–637.

  14. P.T. Brady, A technique for investigating on-off patterns of speech, Bell Syst. Tech. J. (Jan. 1965) 1–22.

  15. P.T. Brady, A statistical analysis of on-off patterns in 16 conversations, Bell Syst. Tech. J. (Jan. 1968) 73–91.

  16. J.G. Gruber, Delay related issues in integrated voice and data networks, IEEE Trans. Commun. (June 1981) 786–800.

  17. J.S. Turner, Design of an integrated services packet network, IEEE JSAC (Nov. 1986) 1373–1380.

Download references

Author information

Authors and Affiliations

Authors

Additional information

This work has been sponsored by the Telecommunications Research Institute of Ontario (TRIO).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Stern, H.P., Mahmoud, S.A. & Wong, KK. A comprehensive model for voice activity in conversational speech-development and application to performance analysis of new-generation wireless communication systems. Wireless Netw 2, 359–367 (1996). https://doi.org/10.1007/BF01262053

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF01262053

Keywords

Navigation