Skip to main content

Audio Bandwidth Extension Using Audio Super-Resolution

  • Conference paper
  • First Online:
Advances in Multimedia Information Processing - PCM 2016 (PCM 2016)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9917))

Included in the following conference series:

Abstract

Audio bandwidth extension (BWE) has emerged as an important tool for the satisfactory performance of low bitrate audio and speech codecs. In the existing BWE method, the high frequency (HF) excitation signals are generated by replicating the low frequency (LF) band directly. However, the coding perception quality will degrade if the correlation between LF and HF bands becoming weak. In this paper, we proposed a new algorithm to restore the HF excitation signals using audio super-resolution. The experiments shown the new algorithm have an outstanding performance for rebuilding HF excitation signals compare with the conventional replication method. In addition, we also provided a new BWE scheme based on audio super-resolution. According to our experimental results, in compare with LPC-based BWE, the subjective listening quality increased by 13% under the same bitrates; in compare with eSBR, the bitrates drop by 63.7% and have the approximate subjective listening quality.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Mäkinen, J., Bessette, B., Bruhn, S., Ojala, P., Salami, R., Taleb, A.: AMR-WB + : a new audio coding standard for 3rd generation mobile audio services. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Philadelphia, PA, USA, vol. 2, pp. 1109–1112 (2005)

    Google Scholar 

  2. Dietz, M., Liljeryd, L., Kjörling, K., Kunz, O.: Spectral band replication, a novel approach in audio coding. In: 112th Convention of the Audio Engineering Society, Munich, Germany (2002)

    Google Scholar 

  3. Epps, J., Holmes, W.: A new technique for wideband enhancement of coded narrowband speech. In: Proceedings of IEEE Workshop on Speech Coding, pp. 174–176 (1999)

    Google Scholar 

  4. Fuemmeler, J.A., Hardie, R.C., Gardner, W.R.: Techniques for the regeneration of wideband speech from narrowband speech. EURASIP J. Appl. Signal Process. 2001(4), 266–274 (2001)

    Google Scholar 

  5. Neukam, C., Nagel, F., Schuller, G., et al.: A MDCT based harmonic spectral bandwidth extension method. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vancouver, Canada, 26–31 May 2013. IEEE press, pp. 566–570 (2013)

    Google Scholar 

  6. Jiang, L., Hu, R., Wang, X., Zhang, M.: Low bitrates audio bandwidth extension using a deep auto-encoder. In: Ho, Y.-S., Sang, J., Ro, Y.M., Kim, J., Wu, F. (eds.) PCM 2015. LNCS, vol. 9314, pp. 528–537. Springer, Heidelberg (2015). doi:10.1007/978-3-319-24075-6_51

    Chapter  Google Scholar 

  7. Keegan, B.P., Steven, K.T., Liu, K.J.: Super-resolution of musical signals using approximate matching pursuit. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 81–84 (2011)

    Google Scholar 

  8. Dong, J., Wang, W., Chambers, J.: Audio super-resolution using analysis dictionary learning. In: IEEE International Conference on Digital Signal Processing (DSP), pp. 604–608 (2015)

    Google Scholar 

  9. Mandel, M.I., Young, S.C.: Audio super-resolution using concatenative resynthesis. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 1–5 (2015)

    Google Scholar 

  10. Park, S.C., Park, M.K., Kang, M.G.: Super-resolution image reconstruction: a technical overview. IEEE Signal Process. Mag. 20(3), 21–36 (2003)

    Article  Google Scholar 

  11. Yang, J., Wright, J., Huang, T.S., Ma, Y.: Image super-resolution via sparse representation. IEEE Trans. Image Process. 19(11), 2861–2873 (2010)

    Article  MathSciNet  Google Scholar 

  12. Zhang, T., Liu, C.-T., Quan, H.-J.: AVS-M audio: algorithm and implementation. EURASIP J. Adv. Signal Process. 2011(1), 1–16 (2011)

    Article  Google Scholar 

  13. GB/T 20090.10-2013. Information technology—advanced coding of audio and video—Part 10: Mobile speech and audio. China standard publishing house (2014)

    Google Scholar 

  14. Jie, Z., Choo, K., Oh, E.: Bandwidth extension for China AVS-M standard. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4149–4152 (2009)

    Google Scholar 

  15. ITU-R Rec. BS 1387, Methods for objective measurements of perceptual audio quality (1999)

    Google Scholar 

  16. ITU-R, Recommendation BS. 1534-1, Method for the subjective assessment of intermediate quality levels of coding systems (MUSHRA). International Telecommunication Union (2003)

    Google Scholar 

Download references

Acknowledgments

The research was supported by National Nature Science Foundation of China (No. 61231015, No. 61102127, 61201340, 61201169, 61471271), National High Technology Research and Development Program of China (863 Program) No. 2015AA016306, the Science and Technology Plan in Jiangxi Province Department of Education (GJJ150585).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hu Ruimin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Lin, J., Ruimin, H., Xiaochen, W., Weiping, T. (2016). Audio Bandwidth Extension Using Audio Super-Resolution. In: Chen, E., Gong, Y., Tie, Y. (eds) Advances in Multimedia Information Processing - PCM 2016. PCM 2016. Lecture Notes in Computer Science(), vol 9917. Springer, Cham. https://doi.org/10.1007/978-3-319-48896-7_53

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-48896-7_53

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-48895-0

  • Online ISBN: 978-3-319-48896-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics