Audio Bandwidth Extension Using Audio Super-Resolution

Lin, Jiang; Ruimin, Hu; Xiaochen, Wang; Weiping, Tu

doi:10.1007/978-3-319-48896-7_53

Jiang Lin^16,18,
Hu Ruimin^16,17,
Wang Xiaochen^16,17 &
…
Tu Weiping^16,17

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9917))

Included in the following conference series:

Pacific Rim Conference on Multimedia

2624 Accesses
1 Citations

Abstract

Audio bandwidth extension (BWE) has emerged as an important tool for the satisfactory performance of low bitrate audio and speech codecs. In the existing BWE method, the high frequency (HF) excitation signals are generated by replicating the low frequency (LF) band directly. However, the coding perception quality will degrade if the correlation between LF and HF bands becoming weak. In this paper, we proposed a new algorithm to restore the HF excitation signals using audio super-resolution. The experiments shown the new algorithm have an outstanding performance for rebuilding HF excitation signals compare with the conventional replication method. In addition, we also provided a new BWE scheme based on audio super-resolution. According to our experimental results, in compare with LPC-based BWE, the subjective listening quality increased by 13% under the same bitrates; in compare with eSBR, the bitrates drop by 63.7% and have the approximate subjective listening quality.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Low Bitrates Audio Bandwidth Extension Using a Deep Auto-Encoder

Audio bandwidth extension based on temporal smoothing cepstral coefficients

Article Open access 25 November 2014

Audio Compression

References

Mäkinen, J., Bessette, B., Bruhn, S., Ojala, P., Salami, R., Taleb, A.: AMR-WB + : a new audio coding standard for 3rd generation mobile audio services. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Philadelphia, PA, USA, vol. 2, pp. 1109–1112 (2005)
Google Scholar
Dietz, M., Liljeryd, L., Kjörling, K., Kunz, O.: Spectral band replication, a novel approach in audio coding. In: 112th Convention of the Audio Engineering Society, Munich, Germany (2002)
Google Scholar
Epps, J., Holmes, W.: A new technique for wideband enhancement of coded narrowband speech. In: Proceedings of IEEE Workshop on Speech Coding, pp. 174–176 (1999)
Google Scholar
Fuemmeler, J.A., Hardie, R.C., Gardner, W.R.: Techniques for the regeneration of wideband speech from narrowband speech. EURASIP J. Appl. Signal Process. 2001(4), 266–274 (2001)
Google Scholar
Neukam, C., Nagel, F., Schuller, G., et al.: A MDCT based harmonic spectral bandwidth extension method. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vancouver, Canada, 26–31 May 2013. IEEE press, pp. 566–570 (2013)
Google Scholar
Jiang, L., Hu, R., Wang, X., Zhang, M.: Low bitrates audio bandwidth extension using a deep auto-encoder. In: Ho, Y.-S., Sang, J., Ro, Y.M., Kim, J., Wu, F. (eds.) PCM 2015. LNCS, vol. 9314, pp. 528–537. Springer, Heidelberg (2015). doi:10.1007/978-3-319-24075-6_51
Chapter Google Scholar
Keegan, B.P., Steven, K.T., Liu, K.J.: Super-resolution of musical signals using approximate matching pursuit. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 81–84 (2011)
Google Scholar
Dong, J., Wang, W., Chambers, J.: Audio super-resolution using analysis dictionary learning. In: IEEE International Conference on Digital Signal Processing (DSP), pp. 604–608 (2015)
Google Scholar
Mandel, M.I., Young, S.C.: Audio super-resolution using concatenative resynthesis. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 1–5 (2015)
Google Scholar
Park, S.C., Park, M.K., Kang, M.G.: Super-resolution image reconstruction: a technical overview. IEEE Signal Process. Mag. 20(3), 21–36 (2003)
Article Google Scholar
Yang, J., Wright, J., Huang, T.S., Ma, Y.: Image super-resolution via sparse representation. IEEE Trans. Image Process. 19(11), 2861–2873 (2010)
Article MathSciNet Google Scholar
Zhang, T., Liu, C.-T., Quan, H.-J.: AVS-M audio: algorithm and implementation. EURASIP J. Adv. Signal Process. 2011(1), 1–16 (2011)
Article Google Scholar
GB/T 20090.10-2013. Information technology—advanced coding of audio and video—Part 10: Mobile speech and audio. China standard publishing house (2014)
Google Scholar
Jie, Z., Choo, K., Oh, E.: Bandwidth extension for China AVS-M standard. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4149–4152 (2009)
Google Scholar
ITU-R Rec. BS 1387, Methods for objective measurements of perceptual audio quality (1999)
Google Scholar
ITU-R, Recommendation BS. 1534-1, Method for the subjective assessment of intermediate quality levels of coding systems (MUSHRA). International Telecommunication Union (2003)
Google Scholar

Download references

Acknowledgments

The research was supported by National Nature Science Foundation of China (No. 61231015, No. 61102127, 61201340, 61201169, 61471271), National High Technology Research and Development Program of China (863 Program) No. 2015AA016306, the Science and Technology Plan in Jiangxi Province Department of Education (GJJ150585).

Author information

Authors and Affiliations

State Key Lab of Software Engineering, Computer School of Wuhan University, Wuhan, China
Jiang Lin, Hu Ruimin, Wang Xiaochen & Tu Weiping
National Engineering Research Center for Multimedia Software, Wuhan University, Wuhan, China
Hu Ruimin, Wang Xiaochen & Tu Weiping
Software School, East China University of Technology, Nanchang, China
Jiang Lin

Authors

Jiang Lin
View author publications
You can also search for this author in PubMed Google Scholar
Hu Ruimin
View author publications
You can also search for this author in PubMed Google Scholar
Wang Xiaochen
View author publications
You can also search for this author in PubMed Google Scholar
Tu Weiping
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hu Ruimin .

Editor information

Editors and Affiliations

Zhengzhou University, Zhengzhou, China
Enqing Chen
Jiaotong University, Xi’an, China
Yihong Gong
Zhengzhou University, Zhengzhou, China
Yun Tie

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lin, J., Ruimin, H., Xiaochen, W., Weiping, T. (2016). Audio Bandwidth Extension Using Audio Super-Resolution. In: Chen, E., Gong, Y., Tie, Y. (eds) Advances in Multimedia Information Processing - PCM 2016. PCM 2016. Lecture Notes in Computer Science(), vol 9917. Springer, Cham. https://doi.org/10.1007/978-3-319-48896-7_53

Download citation

DOI: https://doi.org/10.1007/978-3-319-48896-7_53
Published: 27 November 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48895-0
Online ISBN: 978-3-319-48896-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Audio Bandwidth Extension Using Audio Super-Resolution

Abstract

Access this chapter

Similar content being viewed by others

Low Bitrates Audio Bandwidth Extension Using a Deep Auto-Encoder

Audio bandwidth extension based on temporal smoothing cepstral coefficients

Audio Compression

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Audio Bandwidth Extension Using Audio Super-Resolution

Abstract

Access this chapter

Similar content being viewed by others

Low Bitrates Audio Bandwidth Extension Using a Deep Auto-Encoder

Audio bandwidth extension based on temporal smoothing cepstral coefficients

Audio Compression

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation