Parallel Full HD Video Decoding for Multicore Architecture

  • S. Sankaraiah
  • Hai Shuan Lam
  • C. Eswaran
  • Junaidi Abdullah
Conference paper
Part of the Lecture Notes in Electrical Engineering book series (LNEE, volume 285)

Abstract

Nowadays, the multicore architecture is adopted everywhere in the design of contemporary processors in order to boost up the performance of multitasking applications. This paper mainly exploits the multicore capability for full HD video decoding speedup to meet realtime display. Hantro 6100 H.264 decoder is chosen as the reference decoder. The serial decoding algorithm in the Hantro 6100 H.264 decoder is replaced with a parallel decoding algorithm. In this research work, macroblock level parallelism is implemented using the enhanced version of macroblock region partitioning (MBRP) is implemented for the parallel video decoding of H.264 video. The results show that the workloads are well-balanced among the processor cores. It is observed that the maximum speedup values are attained when the decoder is running with 4 threads on a 4 core system and 8 logical core system configuration. Moreover, it is also observed that there is no degradation of visual quality throughout the decoding process.

Keywords

Multicore processor Speed-up Macroblock Parallel Load balancing Hantro decoder 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
  2. 2.
    I. E. Richardson.: The H.264 Advanced Video Compression Standard. 2nd ed, Wiley, UK (2010)Google Scholar
  3. 3.
    M. J. Quinn.: Parallel Programming in C with MPI and OpenMP. McGraw-Hill, USA (2003)Google Scholar
  4. 4.
    Cor Meenderinck, Arnaldo Azevedo, Ben Juurlink, Mauricio Alvarez Mesa, Alex Ramirez.: Parallel Scalability of Video Decoders. Journal of Signal Processing Systems. 57,(2), 173–194 (2009)Google Scholar
  5. 5.
    E. B. Van der Tol, E. Jasper, and R. H. Gelderblom.: Mapping of H.264 Decoding on a Multiprocessor Architecture. In: Proceeding of SPIE Conference on Image and Video Communications, pp. 707–709. SPIE Press, New York (2003)Google Scholar
  6. 6.
    A. Gurhanli, C. C. Chen, and S. Hung.: GOP-level parallelization of the H.264 decoder without a start-code scanner. In: 2nd International Conference on Signal Processing Systems (ICSPS), pp. (2010)Google Scholar
  7. 7.
    S. Sankaraiah, H. S. Lam, C. Eswaran and Junaidi Abdullah.: GOP Level Parallelism on H.264 Video Encoder for Multicore Architecture. In: IPCSIT, pp. 127–132. IACSIT Press, New York (2011)Google Scholar
  8. 8.
    S. Sankaraiah, H. S. Lam, C. Eswaran, and J. Abdullah.: Performance Optimization of Video Coding Process on Multi-Core Platform Using Gop Level Parallelism. International Journal of Parallel Programming. Springer, 1–17 (2013)Google Scholar
  9. 9.
    A. Gurhanli, C. C. Chen, and S. Hung.: Coarse Grain Parallelization of H.264 Video De- coder and Memory Bottleneck in Multi-Core Architectures. International Journal of Computer Theory and Engineering. 3,(3), 375–381 (2011Google Scholar
  10. 10.
    Y. Chen, E. Li, X. Zhou, and S. Ge.: Implementation of H. 264 Encoder and Decoder on Personal Computers. Journal of Visual Communications and Image Representation. 17, (2006)Google Scholar
  11. 11.
    Y. Chen, X. Tian, S. Ge, and M. Girkar.: Towards Efficient MultiLevel Threading of H.264 Encoder on Intel Hyper-Threading Architectures. In: Proceedings of the 18th International Parallel and Distributed Processing Symposium, pp. 63–67. IEEE Press, New York (2004)Google Scholar
  12. 12.
    A. Azevedo, C. Meenderinck, B. Juurlink, A. Terechko, J. Hoogerbrugge, M. Alvarez, and A. Rammirez.: Parallel H.264 Decoding on an Embedded Multicore Processor. In: 4th International Conference on High Performance and Embedded Architectures and Compilers (2009)Google Scholar
  13. 13.
    S. Sun, D.Wang, and S. Chen.: A Highly Efficient Parallel Algorithm for H.264 Encoder Based on Macroblock region partition. In: 3rd international conference on High Performance Computing and Communications, pp.577–585 181–184. IEEE Press, New York (2007)Google Scholar
  14. 14.
    M. A. Mesa, A. Ramirez, A. Azevedo, C. Meenderinck, B. Juurlink, and M. Valero.: Scalability of Macroblock-level Parallelism for H.264 Decoding. In: 15th International Conference on Parallel and Distributed Systems, pp.236-243 (2009)Google Scholar
  15. 15.
    Simple DirectMedia Layer, http://www.libsdl.org
  16. 16.
    Y. Kim, J. Kim, S. Bae, H. Baik, and H. Song.: H.264/AVC decoder parallelization and optimization on asymmetric multicore platform using dynamic load balancing. In: IEEE International Conference on Multimedia and Expo, pp.1001-1004 (2008)Google Scholar
  17. 17.
    Cor Meenderinck, Arnaldo Azevedo, Mauricio Alvarez, Ben Juurlink, Alex Ramirez.: Parallel Scalability of H.264. In: first Workshop on Programmability Issues for Multi-Core Computers, pp.1–12. Goteborg, Sweden (2008)Google Scholar
  18. 18.
    Mauricio Alvarez Mesa, Alex Ramirez, Arnaldo Azevedo, Cor Meenderinck, Ben Juurlink, Mateo Valero.: Scalability of Macroblock-level Parallelism for H.264 Decoding. In: 15th International Conference on Parallel and Distributed Systems, pp.236–243. Shenzhen, China (2009)Google Scholar
  19. 19.
    M. Kim, J. Song, D. H. Kim, and S. Lee.: H.264 Decode or Embedded code with Dynamically Load-balanced Functional Partitioning. In: 17th IEEE International Conference on Image Processing, IEEE Press, New York (2010)Google Scholar

Copyright information

© Springer Science+Business Media Singapore 2014

Authors and Affiliations

  • S. Sankaraiah
    • 1
  • Hai Shuan Lam
    • 1
  • C. Eswaran
    • 1
  • Junaidi Abdullah
    • 1
  1. 1.Centre for Visual ComputingMultimedia UniversityCyberjayaMalaysia

Personalised recommendations