Low complexity forward error correction for CELP-type speech coding over erasure channel transmission

Benamirouche, Nadir; Boudraa, Bachir; López-Oller, Domingo; Pérez-Córdoba, José L.

doi:10.1007/s10772-016-9365-1

Low complexity forward error correction for CELP-type speech coding over erasure channel transmission

Published: 29 August 2016

Volume 19, pages 717–730, (2016)
Cite this article

International Journal of Speech Technology Aims and scope Submit manuscript

Nadir Benamirouche¹,
Bachir Boudraa²,
Domingo López-Oller³ &
…
José L. Pérez-Córdoba³

238 Accesses
1 Citation
Explore all metrics

Abstract

One of the well-known problems of Code-Excited Linear Prediction (CELP)-type codec is its vulnerability to a frame erasure. When a frame is erased, the inter-frame dependency introduced by the Long Term Prediction causes a desynchronization of the Adaptive Codebook (ACB) which introduces in its turn an error propagation through the correctly received frames. In this paper, we propose a media-specific Forward Error Correction (FEC) method using a Pitch-Pulse Codebook (PPCB)-based approach to model the ACB contribution for voiced frame (frame onset) determined under Zero Crossing Rate constraint. The PPCB uses a single pulse optimized by Multipulse Maximum Likelihood Quantization algorithm to model the pitch-like contribution at the encoder side while the quantized version of that pulse will be sent as FEC information to resynchronize the ACB at the decoder side after a frame erasure. Through this approach a noticeable improvement of the synthesis speech quality is achieved under adverse channel conditions with the advantage of low computational complexity while the legacy bit-rate of the codec is kept unchanged.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Quality Enhancement of Low Bit Rate Speech Coder with Nonlinear Prediction

A Simulation-Based Comparison on Code Excited Linear Prediction (CELP) Coder at Different Bit Rates

Stability Analysis of Speech Synthesis Filter of CELP-Based AMR-WB Codec

References

Al-Rousan, M., & Nawasrah, A. (2012). Adaptive FEC technique for multimedia applications over the internet. Journal of Emerging Technologies in Web Intelligence, 4(2), 142–147.
Google Scholar
Andersen, S. V., Kleijn, W. B., Hagen, R., Linden, J., Murthi, M. N., & Skoglund, J. (2002). iLBCA linear predictive coder with robustness to packet losses. Proceedings of IEEE Speech Coding Workshop, Tsukuba (pp. 23–25).
Anselam, A. S., & Pillai, S. S. (2014). Performance evaluation of code excited linear prediction speech coders at various bit rates. In IEEE 2014 International Conference on Computation of Power, Energy, Information and Communication (ICCPEIC) (pp. 93–98).
Assem, H., Malone, D., Dunne, J., & O’Sullivan, P. (2013). A new adaptive redundancy control algorithm for VoIP applications. In IEEE Global Communications Conference (GLOBECOM) (pp. 1323–1328).
Bhebhe, L., & Parkkali, R. (2011). VoIP performance over HSPA with different VoIP clients. Wireless Personal Communications, 58(3), 613–626.
Article Google Scholar
Blake, I. F., & Mullin, R. C. (2014). An introduction to algebraic and combinatorial coding theory. London: Academic Press.
MATH Google Scholar
Carmona, J. L., Pérez-Córdoba, J. L., Peinado, A. M., Gomez, A. M., & Gonzalez, J. A. (2008). A scalable coding scheme based on interframe dependency limitation. IEEE International Conference on Acoustics, Speech and Signal Processing (pp. 4805–4808).
Casu, F., Cabrera, J., Jaureguizar, F., & Garca, N. (2015). A protection scheme for multimedia packet streams in bursty packet loss networks based on small block low-density parity-check codes. EURASIP Journal on Wireless Communications and Networking, 1, 1.
Google Scholar
Chandra, M., & Ray, M. (2015). Comparative study of PCM, LPC, and CELP speech coders used for VoIP applications. Intelligent computing, communication and devices (pp. 579–587). India: Springer.
Chapter Google Scholar
Garofolo, J. S. The Structure and Format of the DARPA TIMIT”, CD-ROM Prototype, Documentation of DARPA TIMIT.
Gomez, A. M., Carmona, J. L., Peinado, A., & Sanchez, V. (2010). A multipulse-based forward error correction technique for robust CELP-coded speech transmission over erasure channels. IEEE Transaction on Audio, Speech, Language Process, 18(6), 1258–1268.
Article Google Scholar
Gomez, A. M., Carmona, J. L., Peinado, A., & Sanchez, V. (2011). One-pulse fec coding for robust celp-coded speech transmission over erasure channels. IEEE Transactions on Multimedia, 13(5), 894–904.
Article Google Scholar
Gupta, V., Dharmaraja, S., & Arunachalam, V. (2015). Stochastic modeling for delay analysis of a VoIP network. Annals of Operations Research, 233(1), 171–180.
Article MathSciNet MATH Google Scholar
ITU Rec., (1996). G.723.1, Dual rate speech coder for multimedia communication transmitting at 5.3kbit/s and 6.3kbit/s.
Jalil, S., Abbad, M., & El Azouzi, R. (2015). Hybrid FEC/ARQ schemes for real-time traffic in wireless networks. In: 2015 International Conference on Wireless Networks and Mobile Communications (WINCOM) (pp. 1–6).
Jalil, M., Butt, F. A., & Malik, A. (2013). Short-time energy, magnitude, zero crossing rate and autocorrelation measurement for discriminating voiced and unvoiced segments of speech signals. In IEEE International Conference In Technological Advances in Electrical, Electronics and Computer Engineering (TAEECE) (pp. 208–212).
Jiang, W., & Schulzrinne, H. (2000). Modeling of packet loss anddelay and their effect on real-time multimedia service quality. In Proceedings of NOSSDAV 2000.
Kang, J. A., & Kim, H. K. (2011). An adaptive packet loss recovery method based on real-time speech quality assessment and redundant speech transmission. International Journal of Innovative Computing, Information and Control, 7(12), 6773–6783.
MathSciNet Google Scholar
Kheddar, H., & Boudraa, B. (2015). Implementation of interleaving methods on MELP 2.4 coder to reduce packet loss in the Voice over IP (VoIP) transmission. International Journal of Engineering Research and Applications, 5(3), 1–4.
Google Scholar
Kim, B. H., Kim, H. G., Jeong, J., & Kim, J. Y. (2013). VoIP receiver-based adaptive playout scheduling and packet loss concealment technique. IEEE Transactions on Consumer Electronics, 59(1), 250–258.
Article Google Scholar
Kuo, C. F., Tseng, H. W., & Pang, A. C. (2013). A fragment-based retransmission scheme with quality-of-service considerations for wireless networks. Wireless Communications and Mobile Computing, 13(16), 1450–1463.
Google Scholar
Lamel, L., Kassel, R., & Seneff, S. (1986). Speech database development: Design and analysis of the acoustic-phonetic corpus. In Proceedings on Speech Recognition Workshop (DARPA) (pp. 100–110).
Liu, J., Zhao, S., Wang, J., & Kuang, J. (2011). FEC-based packet loss recovery for AVS-M audio codec. In: 2011 IEEE International Conference on Multimedia Technology (ICMT) (pp. 3069–3072).
López-Oller, D., Gomez, A. M., & Pérez-Córdoba, J. L. (2014). Source-based error mitigation for speech transmissions over erasure channels. In 2014 22nd European Signal Processing Conference (EUSIPCO) (pp. 1242–1246).
Ma, Z., Martin, R., Guo, J., & Zhang, H. (2014). Nonlinear estimation of missing \(\delta\) LSF parameters by a mixture of Dirichlet distributions. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 6929–6933).
Mehran, F. (2011). Interleaver choice in serially concatenated codes for soft real-time wireless speech transmission applications. In IEEE 2011 19th Iranian Conference on Electrical Engineering (ICEE) (pp. 1–6).
Merazka, F. (2013). Packet loss concealment by interpolation for speech over IP network services. In 2013 Constantinides International Workshop on Signal Processing (CIWSP 2013) (pp. 1–4).
Merazka, F. (2014). A comparison of packet loss concealment and control for voice transmission over IP network services. In IEEE 9th International Symposium: Communication Systems, Networks & Digital Signal Processing (CSNDSP) (pp. 497–501).
Miralavi, S. R., Ghorshi, S., Mortazavi, M., & Choupan, J. (2011). Packet loss replacement in voip using a recursive low-order autoregressive model-based speech. In 8th International Multi-Conference on Systems, Signals and Devices (SSD) (pp. 1–4).
Nagano, T., & Ito, A. (2013). A Packet Loss Recovery of G.729 speech using discriminative model and N-gram. In 2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (pp. 267–270).
Nath, D., & Kalita, S. K. (2014). An effective age detection method based on short time energy and zero crossing rate. In IEEE 2nd International Conference In Business and Information Management (ICBIM) (pp. 99–103).
Oh, S. M., & Kim, J. H. (2012). Application-aware retransmission design for VoIP services in BWA networks. In IEEE 14th International Conference on Advanced Communication Technology (ICACT) (pp. 122–131).
Park, N. I., Kim, H. K., Jung, M. A., Lee, S. R., & Choi, S. H. (2010). A packet loss concealment algorithm robust to burst packet loss using multiple codebooks and comfort noise for CELP-type speech coders. Communication and Networking (pp. 138–147). Berlin, Heidelberg: Springer.
Chapter Google Scholar
Perceptual Evaluation of Speech Quality (PESQ). (2001). An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs. ITU-T P.862 Recommendation.
Perkins, C., Hodson, O. & Hardman, V. (2001). A survey of packet loss recovery techniques for streaming audio. Readings in Multimedia Computing and Networking, 607–615.
Silveira, F., & Silva, E. D. S. (2012). Predicting packet loss statistics with hidden Markov models for FEC control. Computer Networks, 56(2), 628–641.
Article Google Scholar
Singh, H. P., Singh, S., Singh, J., & Khan, S. A. (2014). VoIP: State of art for global connectivity a critical review. Journal of Network and Computer Applications, 37, 365–379.
Article Google Scholar
Taleb, A., (2011). Low-complexity code excited linear prediction encoding. U.S. Patent No. 8,000,967. Washington, DC: U.S. Patent and Trademark Office.
Toral-Cruz, H., Pathan, A. S. K., & Ramirez Pacheco, J. C. (2013). Accurate modeling of VoIP traffic QoS parameters in current and future networks with multifractal and Markov models. Mathematical and Computer Modelling, 2832–2845(57), 11.
MathSciNet MATH Google Scholar
Toyoshima, M. & Shimamura, T. (2014). Packet loss concealment for VoIP based on pitch waveform replication and linear predictive coding. In IEEE Asia Pacific Conference on Circuits and Systems (APCCAS) (pp. 89–92).
TS 26.190, Adaptive multi-rate-wideband (AMR-WB) speech codec: Transcoding functions, 3GPP Tech. Spec.

Download references

Author information

Authors and Affiliations

Laboratoire de Génie Electrique, Faculté de Technologie, Université de Bejaia, 06000, Bejaia, Algeria
Nadir Benamirouche
Faculty of Electronics and Computer Science, University of Sciences and Technology H. B., 16000, Algiers, Algeria
Bachir Boudraa
Department of Signal Theory, Networking and Communications, University of Granada, 18071, Granada, Spain
Domingo López-Oller & José L. Pérez-Córdoba

Authors

Nadir Benamirouche
View author publications
You can also search for this author in PubMed Google Scholar
Bachir Boudraa
View author publications
You can also search for this author in PubMed Google Scholar
Domingo López-Oller
View author publications
You can also search for this author in PubMed Google Scholar
José L. Pérez-Córdoba
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nadir Benamirouche.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Benamirouche, N., Boudraa, B., López-Oller, D. et al. Low complexity forward error correction for CELP-type speech coding over erasure channel transmission. Int J Speech Technol 19, 717–730 (2016). https://doi.org/10.1007/s10772-016-9365-1

Download citation

Received: 04 March 2016
Accepted: 16 August 2016
Published: 29 August 2016
Issue Date: December 2016
DOI: https://doi.org/10.1007/s10772-016-9365-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Low complexity forward error correction for CELP-type speech coding over erasure channel transmission

Abstract

Access this article

Similar content being viewed by others

Quality Enhancement of Low Bit Rate Speech Coder with Nonlinear Prediction

A Simulation-Based Comparison on Code Excited Linear Prediction (CELP) Coder at Different Bit Rates

Stability Analysis of Speech Synthesis Filter of CELP-Based AMR-WB Codec

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Low complexity forward error correction for CELP-type speech coding over erasure channel transmission

Abstract

Access this article

Similar content being viewed by others

Quality Enhancement of Low Bit Rate Speech Coder with Nonlinear Prediction

A Simulation-Based Comparison on Code Excited Linear Prediction (CELP) Coder at Different Bit Rates

Stability Analysis of Speech Synthesis Filter of CELP-Based AMR-WB Codec

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation