Fovea Based Coding for Video Streaming

  • Çağatay Dikici
  • H. Işıl Bozma
  • Reha Civanlar
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3211)

Abstract

Attentive robots, inspired by human-like vision – are required to have visual systems with fovea-periphery distinction and saccadic motion capability. Thus, each frame in the incoming image sequence has nonuniform sampling and consecutive saccadic images have temporal redundancy. In this paper, we propose a novel video coding and streaming algorithm for low bandwidth networks that exploits these two features simultaneously. Our experimental results reveal improved video streaming in applications like robotic teleoperation. Furthermore, since the algorithm employs the Gaussian-like resolution of human visual system and is extremely simple to integrate with the standard coding schemes, it can also be used in applications such as cellular phones with video.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Akins, K. (ed.): Perception, pp. 290–316. Oxford University Press, Oxford (1996)Google Scholar
  2. 2.
    Ballard, D.H., Brown, C.M.: Principles of Animate Vision. CVIP: Image Understanding 56 (1992)Google Scholar
  3. 3.
    Koch, C., Itti, L.: A Saliency-Based Search Mechanism for Overt and Covert Shifts of Visual Attention. Vision Research (2000)Google Scholar
  4. 4.
    ITU-T Recommendation H.263: Video coding for low bit rate communication (1996) Google Scholar
  5. 5.
    Kowler, E. (ed.): Eye Movements and Their Role in Visual and Cognitive Processes. Elsevier, Amsterdam (1990)Google Scholar
  6. 6.
    Gouras, P., Bailey, C.H.: The retina and phototransduction. In: Schwartz, J.H., Kandel, E.R. (eds.) Principles of Neural Science, Elsevier, Amsterdam (1986)Google Scholar
  7. 7.
    Takacs, D., Wechsler, H.: A Dynamic and Multiresolution Model of Visual Attention and Its Application to Facial Landmark Detection. Computer Vision and Image Understanding 70(1), 63–73 (1998)CrossRefGoogle Scholar
  8. 8.
    Wiebe, K., Basu, A.: Improving image and video transmission quality over ATM with foveal priorization and priority dithering. Pattern Recognition Letters 22 (2001)Google Scholar
  9. 9.
    Reeves, T.H., Robinson, J.A.: Rate Control of Foveated MPEG Video. In: CCECE (1997)Google Scholar
  10. 10.
    Geisler, W.S., Perry, J.S.: A Real-time Foveated Multi resolution System for Low band width Video Comunication. In: SPIE Proceedings: Human Vision and Electronic Imaging, vol. 3299, pp. 294–305 (1998)Google Scholar
  11. 11.
    Grosso, E., Manzotti, R., Tiso, R., Sandini, G.: A Space-Variant Approach to Oculomotor Control. In: Proceedings of International Symposium on Computer Vision, pp. 509–514 (1995)Google Scholar
  12. 12.
    Soyer, C., Bozma, H.I., Istefanopulos, Y.: Apes: Actively Perceiving Robot. In: Proceedings of IEEE/RSJ International Conference on Robots and Systems, Lausanne, Switzerland (2002)Google Scholar
  13. 13.
    Soyer, C., Bozma, H.I., Istefanopulos, Y.: Attentional Sequence Based Recognition: Markovian and Eviential Reasoning. IEEE Transactions on Systems, Man and Cybernatics 33(6), 937–950 (2003)CrossRefGoogle Scholar
  14. 14.
  15. 15.
    Whang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image Quality Assessment:From Error Measurement to Structural Similarity. IEEE Transactions on Image Processing 13(1) (2004)Google Scholar
  16. 16.

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  • Çağatay Dikici
    • 1
  • H. Işıl Bozma
    • 1
  • Reha Civanlar
    • 2
  1. 1.Intelligent Systems Laboratory, Electrical and Electronics Engineering DepartmentBoğaziçi UniversityBebek, IstanbulTurkey
  2. 2.Computer Engineering DepartmentKoç UniversityIstanbulTurkey

Personalised recommendations