Abstract
This paper proposes a parallelization method for a deblocking filter in a high-efficiency video coding (HEVC) decoder based on complexity estimation. A deblocking filter of HEVC is generally considered to be appropriate for data-level parallelism (DLP) because there are no data-level dependencies among adjacent blocks in the horizontal and vertical filtering processes. However, an imbalanced workload can increase the idle time on some of the threads, and thus, the maximum parallel performance cannot be achieved with only DLP. To alleviate this problem, the proposed method estimates the computational complexity by utilizing the coding unit (CU) segment information and the on/off flag of the deblocking filter in advance. Then, the workload is distributed equally across all threads. The experimental results indicate that the proposed method can accelerate the decoding speed by a factor of 3.97, with six threads on top of the sequential deblocking filtering. In addition, the ratio of the maximum elapsed time to ideal elapsed time is reduced to approximately 21 %, as compared to conventional DLP-based parallel deblocking filtering methods that do not implement a complexity estimation method.
Similar content being viewed by others
References
Sullivan, G.J., Ohm, J.-R., Han, W.-J., Wiegand, T.: Overview of the high efficiency video coding (HEVC) standard. IEEE Trans. Circuits Syst. Video Technol. 22(12), 1649–1668 (2012)
Mrak, M., Baroncini, V., Ramzan, N.: HEVC verification test results. In: ITU-T SG16 WP3 and ISO/IEC JTC1/SC29/WG11 JCTVC-Q0204, Valencia, ES (2014)
Ahn, Y., Hwang, T., Sim, D., Han, W.: Implementation of fast HEVC encoder based on SIMD and data-level parallelism. EURASIP J. Image Video Process. 2014(1), 1–19 (2014)
Ahn, Y., Han, W., Sim, D.: Study of decoder complexity for HEVC and AVC standards based on tool-by-tool comparison. In: Proceedings of SPIE 8499, Applications of Digital Image Processing XXXV, San Diego, CA (2012)
Norkin, A., Bjøntegaard, G., Fuldseth, A., Narroschke, M., Ikeda, M., Andersson, K., Zhou, M., Auwera, G.V.: HEVC deblocking filter. IEEE Trans. Circuits Syst. Video Technol. 22(12), 1746–1754 (2012)
Fu, C.-M., Alshina, E., Alshin, A., Huang, Y.-W., Chen, C.-Y., Tsai, C.-Y., Hsu, C.-W., Lei, S.-M., Park, J.-H., Han, W.-J.: Sample adaptive offset in the HEVC standard. IEEE Trans. Circuits Syst. Video Technol. 22(12), 1755–1764 (2012)
Budagavi, M., Fuldseth, A., Bjøntegaard, G., Sze, V., Sadafale, M.: Core transform design in the high efficiency video coding (HEVC) standard. IEEE J. Sel. Topics Signal Process. 7(6), 1029–1041 (2013)
Ugur, K., Alshin, A., Alshina, E., Bossen, F., Han, W.-J., Park, J.-H., Lainema, J.: Motion compensated prediction and interpolation filter design in H.265/HEVC. IEEE J. Sel. Topics Signal Process. 7(6), 946–956 (2013)
Alvarez-Mesa, M., George, V., Schierl, T., Juurlink, B.: Improving parallelization efficiency of WPP using overlapped wavefront. In: ITU-T SG16 WP3 and ISO/IEC JTC1/SC29/WG11 JCTVC-J425, Stockholm, SE (2012)
Misra, K., Segall, A., Horowitz, M., Xu, S., Fuldseth, A., Zhou, M.: An overview of tiles in HEVC. IEEE J. Sel. Topics Signal Process. 7(6), 969–977 (2013)
Duan, Y., Sun, J., Yan, L., Chen, K., Guo, Z.: Novel efficient HEVC decoding solution on general-purpose processors. IEEE Trans. Multimed. 16(7), 1915–1928 (2014)
Ikeda, M., Tanaka, J., Suzuki, T.: Parallel deblocking filter. In: ITU-T SG16 WP3 and ISO/IEC JTC1/SC29/WG11 JCTVC-D263, CH (2011)
Jo, H., Sim, D., Jeon, B.: Hybrid parallelization for HEVC decoder. In: 2013 6th International Congress on Image and Signal Processing, Hangzhou, CN (2013)
Kotra, A.M., Raulet, M., Deforges, O.: Comparison of different parallel implementations for deblocking filter of HEVC. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, BC (2013)
Zhang, Y., Yan, C., Dai, F., Ma, Y.: Efficient parallel framework for H.264/AVC deblocking filter on many-core platform. IEEE Trans. Multimed. 12(3), 510–524 (2012)
Bossen, F., Bross, B., Suhring, K., Flynn, D.: HEVC complexity and implementation analysis. IEEE Trans. Circuits Syst. Video Technol. 22(12), 1685–1696 (2012)
Bossen, F.: Common test conditions and software reference configurations. In: ITU-T SG16 WP3 and ISO/IEC JTC1/SC29/WG11 JCTVC-H1100, San Jose, CA (2012)
HM-12.1 software
Alvarez-Mesa, M., Ramirez, A., Azevedo, A., Meenderinck, C., Juurlink, B., Valero, M.: Scalability of macroblock-level parallelism for H.264 decoding. In: 15th International Conference on Parallel and Distributed Systems, Shenzhen, CN (2009)
Dagum, L., Menon, R.: OpenMP: an industry standard API for shared-memory programming. IEEE Comput. Sci. Eng. 5(1), 46–55 (1988)
Acknowledgments
This research was supported by the Commercializations Promotion Agency for R&D Outcomes (COMPA) funded by the Ministry of Science, ICT and Future Planning (MISP) and Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT & Future Planning (NRF-2014R1A2A1A11052210).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Jo, H., Park, S. & Sim, D. Parallelized deblocking filtering of HEVC decoders based on complexity estimation. J Real-Time Image Proc 12, 369–382 (2016). https://doi.org/10.1007/s11554-015-0556-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11554-015-0556-9