Wireless Personal Communications

, Volume 45, Issue 2, pp 189–207 | Cite as

A New Buffer Algorithm for Speech Quality Improvement in VoIP Systems

  • Zizhi Qiao
  • Ramesh K. Venkatasubramanian
  • Lingfen Sun
  • Emmanuel C. Ifeachor


Jitter buffer plays an important role in Voice over IP (VoIP) applications because it provides a key mechanism for achieving good speech quality to meet technical and commercial requirements. The main objective of this paper is to propose a new, simple-to-use jitter buffer algorithm as a front-end to conventional static or adaptive jitter buffer algorithms to provide improved performance, in terms of enhanced user-perceived speech quality and reduced end-to-end delay. Supported by signal processing features, the new algorithm, the so-called Play Late Algorithm, alters the playout delay inside a speech talkspurt without introducing unnecessary extra end-to-end delay. The results show that the new algorithm achieves the best performance under different network conditions when compared to conventional static and adaptive jitter buffer algorithms. The results reported here are based on live tests and emulated network conditions on real mobile phone prototypes. The mobile phone prototypes use AMR codec and support full IP/UDP/RTP stack with IPSec function in some of the tests. The method for perceived speech quality measurement is based on the ITU-T standard for speech quality evaluation (PESQ).


Jitter buffer Voice over IP Quality of service (QoS) Perceived speech quality PESQ AMR codec 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    European Telecommunications Standards Institute. (1999). Specification and measurement of speech transmission quality; Part 1: Introduction to objective comparison measurement methods for one-way speech quality across networks. ETSI Guide, EG 201 377–1 V1.1.1, April 1999.Google Scholar
  2. 2.
    Yamamoto, L., & Beerends, J. G. (1997). Impact of network performance parameters on the end-to-end perceived speech quality. In Proceedings of Expert ATM Traffic Symposium. Mykonos, Greece, Sep. 1997.Google Scholar
  3. 3.
    International Telecommunication Union. (1996). Methods for subjective determination of transmission quality. ITU Recommendation P.800, August 1996.Google Scholar
  4. 4.
    International Telecommunication Union. (2001). Perceptual evaluation of speech quality (PESQ), an objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs. ITU-T Recommendation P.862, Feb. 2001.Google Scholar
  5. 5.
    International Telecommunication Union. (2003). Mapping function for transforming P.862 raw result scores to MOS-LQO. ITU-T Recommendation P.862.1, Nov. 2003Google Scholar
  6. 6.
    Moon S.B., Kurose J., Towsley D. (1998). Packet audio playout delay adjustment: performance bounds and algorithms. ACM/Spring Multimedia Systems 5:17–28CrossRefGoogle Scholar
  7. 7.
    Ramjee, R., Kurose, J., Towsley, D. & Schulzrinne, H. (1994). Adaptive playout mechanism for packetised audio application in wide-area networks. IEEE INFOCOM, Toronto, Canada.Google Scholar
  8. 8.
    Kramer, Kris W., Forrester, Chris C., et al. (2003). Jitter buffer management. US patent, US 6,658,027 B1, Dec 2003.Google Scholar
  9. 9.
    Liu, F., Kim, J., Kuo, C.-C. J. (2001). Adaptive delay concealment for Internet voice applications with packet-based time-scale modification. In IEEE International Conference on Acoustics Speech and Signal Processing, Proceedings of ICASSP 01 (vol. 3, pp. 1461–1464). May 2001.Google Scholar
  10. 10.
    Liang Y.J., Färber N., Girod B. (2003). Adaptive playout scheduling and loss concealment for voice communication over IP networks. IEEE Transactions on Multimedia 5(4):532–543CrossRefGoogle Scholar
  11. 11.
    Sun, L., & Ifeachor, E. (2004). New models for perceived voice quality prediction and their applications in playout buffer optimization for VoIP networks. In Proceedings of IEEE International Conference on Communications (IEEE ICC 2004). Paris, France, June 2004.Google Scholar
  12. 12.
    European Telecommunications Standards Institute. (2000). Digital Cellular Telecommunications System (Phase 2+); Adaptive Multi-Rate (AMR) speech transcoding. ETSI-EN-301–704 V7.2.1, April 2000.Google Scholar
  13. 13.
    Liu, E., Shen, G., et al. (2005). Self-adaptive jitter buffer adjustment method for packet-switched network. US patent application publication, US 2005/0058146A1, Mar, 2005.Google Scholar
  14. 14.
    Adrian Emmanuel Eckberg. (2003). Techniques for jitter buffer delay management. US patent application publication, US 2003/0202528A1, Oct. 2003.Google Scholar
  15. 15.
    Fujimoto, K., Ata, S. & Murata, M. (2004). Adaptive playout buffer algorithm for enhancing perceived quality of streaming applications. Telecommunication Systems, 25(3–4), Spinger.Google Scholar
  16. 16.
    Zafiropoulos, F., & Xydeas, C. S. (2003). Model based packet loss concealment for AMR coders. In Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing (vol. I, 112–115), April 2003.Google Scholar
  17. 17.
    NIST Internetworking Technology Group (ITG). (2003). NIST net—a linux-based network emulation tool. Computer Communication Review, http://www.x.antd.nist.gov/nistnet/.
  18. 18.
    Sun, L., & Wade, G., et al. (2001). Impact of packet loss location on perceived speech quality. In Proceedings of 2nd IP-Telephony Workshop (IPTEL ’01) (pp. 114–122). Columbia University, New York, 2–3 April, 2001.Google Scholar

Copyright information

© Springer Science+Business Media, LLC. 2007

Authors and Affiliations

  • Zizhi Qiao
    • 1
    • 2
  • Ramesh K. Venkatasubramanian
    • 1
  • Lingfen Sun
    • 2
  • Emmanuel C. Ifeachor
    • 2
  1. 1.Motorola Inc.BasingstokeUK
  2. 2.Signal Processing & Multimedia Communications GroupUniversity of PlymouthPlymouthUK

Personalised recommendations