Abstract
Audio bandwidth extension (BWE) has emerged as an important tool for the satisfactory performance of low bitrate audio and speech codecs. In the existing BWE method, the high frequency (HF) excitation signals are generated by replicating the low frequency (LF) band directly. However, the coding perception quality will degrade if the correlation between LF and HF bands becoming weak. In this paper, we proposed a new algorithm to restore the HF excitation signals using audio super-resolution. The experiments shown the new algorithm have an outstanding performance for rebuilding HF excitation signals compare with the conventional replication method. In addition, we also provided a new BWE scheme based on audio super-resolution. According to our experimental results, in compare with LPC-based BWE, the subjective listening quality increased by 13% under the same bitrates; in compare with eSBR, the bitrates drop by 63.7% and have the approximate subjective listening quality.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Mäkinen, J., Bessette, B., Bruhn, S., Ojala, P., Salami, R., Taleb, A.: AMR-WB + : a new audio coding standard for 3rd generation mobile audio services. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Philadelphia, PA, USA, vol. 2, pp. 1109–1112 (2005)
Dietz, M., Liljeryd, L., Kjörling, K., Kunz, O.: Spectral band replication, a novel approach in audio coding. In: 112th Convention of the Audio Engineering Society, Munich, Germany (2002)
Epps, J., Holmes, W.: A new technique for wideband enhancement of coded narrowband speech. In: Proceedings of IEEE Workshop on Speech Coding, pp. 174–176 (1999)
Fuemmeler, J.A., Hardie, R.C., Gardner, W.R.: Techniques for the regeneration of wideband speech from narrowband speech. EURASIP J. Appl. Signal Process. 2001(4), 266–274 (2001)
Neukam, C., Nagel, F., Schuller, G., et al.: A MDCT based harmonic spectral bandwidth extension method. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vancouver, Canada, 26–31 May 2013. IEEE press, pp. 566–570 (2013)
Jiang, L., Hu, R., Wang, X., Zhang, M.: Low bitrates audio bandwidth extension using a deep auto-encoder. In: Ho, Y.-S., Sang, J., Ro, Y.M., Kim, J., Wu, F. (eds.) PCM 2015. LNCS, vol. 9314, pp. 528–537. Springer, Heidelberg (2015). doi:10.1007/978-3-319-24075-6_51
Keegan, B.P., Steven, K.T., Liu, K.J.: Super-resolution of musical signals using approximate matching pursuit. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 81–84 (2011)
Dong, J., Wang, W., Chambers, J.: Audio super-resolution using analysis dictionary learning. In: IEEE International Conference on Digital Signal Processing (DSP), pp. 604–608 (2015)
Mandel, M.I., Young, S.C.: Audio super-resolution using concatenative resynthesis. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 1–5 (2015)
Park, S.C., Park, M.K., Kang, M.G.: Super-resolution image reconstruction: a technical overview. IEEE Signal Process. Mag. 20(3), 21–36 (2003)
Yang, J., Wright, J., Huang, T.S., Ma, Y.: Image super-resolution via sparse representation. IEEE Trans. Image Process. 19(11), 2861–2873 (2010)
Zhang, T., Liu, C.-T., Quan, H.-J.: AVS-M audio: algorithm and implementation. EURASIP J. Adv. Signal Process. 2011(1), 1–16 (2011)
GB/T 20090.10-2013. Information technology—advanced coding of audio and video—Part 10: Mobile speech and audio. China standard publishing house (2014)
Jie, Z., Choo, K., Oh, E.: Bandwidth extension for China AVS-M standard. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4149–4152 (2009)
ITU-R Rec. BS 1387, Methods for objective measurements of perceptual audio quality (1999)
ITU-R, Recommendation BS. 1534-1, Method for the subjective assessment of intermediate quality levels of coding systems (MUSHRA). International Telecommunication Union (2003)
Acknowledgments
The research was supported by National Nature Science Foundation of China (No. 61231015, No. 61102127, 61201340, 61201169, 61471271), National High Technology Research and Development Program of China (863 Program) No. 2015AA016306, the Science and Technology Plan in Jiangxi Province Department of Education (GJJ150585).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Lin, J., Ruimin, H., Xiaochen, W., Weiping, T. (2016). Audio Bandwidth Extension Using Audio Super-Resolution. In: Chen, E., Gong, Y., Tie, Y. (eds) Advances in Multimedia Information Processing - PCM 2016. PCM 2016. Lecture Notes in Computer Science(), vol 9917. Springer, Cham. https://doi.org/10.1007/978-3-319-48896-7_53
Download citation
DOI: https://doi.org/10.1007/978-3-319-48896-7_53
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48895-0
Online ISBN: 978-3-319-48896-7
eBook Packages: Computer ScienceComputer Science (R0)