A Real-Time Content Adaptation Framework for Exploiting ROI Scalability in H.264/AVC

  • Peter Lambert
  • Davy De Schrijver
  • Davy Van Deursen
  • Wesley De Neve
  • Yves Dhondt
  • Rik Van de Walle
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4179)


In many application scenarios, the use of Regions of Interest (ROIs) within video sequences is a useful concept. It is shown in this paper how Flexible Macroblock Ordering (FMO), defined in H.264/AVC as an error resilience tool, can be used for the coding arbitrary-shaped ROIs. In order to exploit the coding of ROIs in an H.264/AVC bitstream, a description-driven content adaptation framework is introduced that is able to extract the ROIs of a given bitstream.

The results of a series of tests indicate that the ROI extraction process significantly reduces the bit rate of the bitstreams and increases the decoding speed. In case of a fixed camera and a static background, the impact of this reduction on the visual quality of the video sequence is negligible. Regarding the adaptation framework itself, it is shown that in all cases, the framework operates in real time and that it is suited for streaming scenarios by design.


Video Sequence Scalable Video Code Syntax Element Adaptation Framework Flexible Macroblock Ordering 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Taubman, D., Marcellin, M.: JPEG 2000: Image Compression Fundamentals, Standards and Practice. Kluwer Academic Publishers, Dordrecht (2002)Google Scholar
  2. 2.
    Li, W.: Overview of fine granularity scalability in MPEG-4 video standard. IEEE Trans. Circuits Syst. Video Technol. 11, 301–317 (2001)CrossRefGoogle Scholar
  3. 3.
    Reichel, J., Schwarz, H., Wien, M.: Joint scalable video model JSVM-4. JVT-Q202 (2005),
  4. 4.
    Yin, P., Boyce, J., Pandit, P.: FMO and ROI scalability. JVT-Q029 (2005),
  5. 5.
    Thang, T.C., Kim, D., Bae, T.M., Kang, J.W., Ro, Y.M., Kim, J.G.: Show case of ROI extraction using scalability information SEI message. JVT-Q077 (2005),
  6. 6.
    ISO/IEC JTC1/SC29/WG11: Applications and requirements for scalable video coding. N6880 (2005),
  7. 7.
    Wiegand, T., Sullivan, G.J., Bjøntegaard, G., Luthra, A.: Overview of the H.264/AVC video coding standard. IEEE Trans. Circuits Syst. Video Technol. 13, 560–576 (2003)CrossRefGoogle Scholar
  8. 8.
    Dhondt, Y., Lambert, P., Notebaert, S., Van de Walle, R.: Flexible macroblock ordering as a content adaptation tool in H.264/AVC. In: Proceedings of the SPIE/Optics East conference, Boston (2005)Google Scholar
  9. 9.
    De Neve, W., Van Deursen, D., De Schrijver, D., De Wolf, K., Van de Walle, R.: Using bitstream structure descriptions for the exploitation of multi-layered temporal scalability in H.264/AVC’s base specification. In: PCM 2005. LNCS, pp. 641–652. Springer, Heidelberg (2005)Google Scholar
  10. 10.
    Lambert, P., De Neve, W., Dhondt, Y., Van de Walle, R.: Flexible macroblock ordering in H.264/AVC. Journal of Visual Communication and Image Representation 17, 358–375 (2006)CrossRefGoogle Scholar
  11. 11.
    Devillers, S., Timmerer, C., Heuer, J., Hellwagner, H.: Bitstream syntax description-based adaptation in streaming and constrained environments. IEEE Trans. Multimedia 7, 463–470 (2005)CrossRefGoogle Scholar
  12. 12.
    De Schrijver, D., Poppe, C., Lerouge, S., De Neve, W., Van de Walle, R.: MPEG-21 bitstream syntax descriptions for scalable video codecs. Multimedia Systems 11, 403–421 (2006)CrossRefGoogle Scholar
  13. 13.
    Hong, D., Eleftheriadis, A.: Xflavor: bridging bits and objects in media representation. In: Proceedings of the International Conference on Multimedia and Expo (ICME), Lausanne, Switzerland (2002)Google Scholar
  14. 14.
    Van Deursen, D., De Neve, W., De Schrijver, D., Van de Walle, R.: BFlavor: an optimized XML-based framework for multimedia content customization. In: Proceedings of the Picture Coding Symposium 2006 (PCS 2006) (accepted for publication, 2006)Google Scholar
  15. 15.
    Cimprich, P.: Streaming transformations for XML (STX) version 1.0 working draft (2004),
  16. 16.
    De Neve, W., De Schrijver, D., Van de Walle, D., Lambert, P., Van de Walle, R.: Description-based substitution methods for emulating temporal scalability in state-of-the-art video coding formats. In: Proc. of WIAMIS, Korea (2006)Google Scholar
  17. 17.
    Hannuksela, M.M., Wang, Y.K., Gabbouj, M.: Isolated regions in video coding. IEEE Transactions on Multimedia 6, 259–267 (2004)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Peter Lambert
    • 1
  • Davy De Schrijver
    • 1
  • Davy Van Deursen
    • 1
  • Wesley De Neve
    • 1
  • Yves Dhondt
    • 1
  • Rik Van de Walle
    • 1
  1. 1.Department of Electronics and Information Systems – Multimedia LabGhent University – IBBTLedeberg-GhentBelgium

Personalised recommendations