Skip to main content
Log in

A visual attention model for adapting images on small displays

  • OriginalPaper
  • Published:
Multimedia Systems Aims and scope Submit manuscript

Abstract.

Image adaptation, one of the essential problems in adaptive content delivery for universal access, has been actively explored for some time. Most existing approaches have focused on generic adaptation with a view to saving file size under constraints in client environment and have hardly paid attention to user perceptions of the adapted result. Meanwhile, the major limitation on the user’s delivery context is moving away from data volume (or time-to-wait) to screen size because of the galloping development of hardware technologies. In this paper, we propose a novel method for adapting images based on user attention. A generic and extensible image attention model is introduced based on three attributes (region of interest, attention value, and minimal perceptible size) associated with each attention object. A set of automatic modeling methods are presented to support this approach. A branch-and-bound algorithm is also developed to find the optimal adaptation efficiently. Experimental results demonstrate the usefulness of the proposed scheme and its potential application in the future.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Chandra S, Gehani A, Ellis CS, Vahdat A (2001) Transcoding characteristics of Web images. Proc SPIE (Multimedia Comput Network 2001) 4312:135-149

    Google Scholar 

  2. Chen JL, Zhou BY, Shi J, Zhang HJ, Wu QF (2001) Function-based object model towards website adaptation. In: Proceedings of the 10th international World Wide Web conference, Hong Kong, May 2001, pp 587-596

  3. Chen XR, Zhang HJ (2001) Text area detection from video frames. In: Proceedings of the 2nd IEEE Pacific-Rim conference on multimedia (PCM2001), Beijing, October 2001, pp 222-228

  4. Christopoulos C, Skodras A, Ebrahimi T (2000) The JPEG2000 still image coding system: an overview. IEEE Trans Consumer Electron 46(4):1103-1127

    Google Scholar 

  5. Fan X, Xie X, Ma WY, Zhang HJ, Zhou HQ (2003) Visual attention based image browsing on mobile devices. In: Proceedings of the IEEE international conference on multimedia and expo (ICME 03), Baltimore, July 2003

  6. Fox A, Gribble S, Brewer EA, Amir E (1996) Adapting to network and client variability via on-demand dynamic distillation. In: Proceedings of the 7th international conference on architectural support for programming languages and operating systems. Cambridge, MA, October 1996, pp 160-170

  7. Han R, Bhagwat P, Lamaire R, Mummert T, Perret V, Rubas J (1998) Dynamic adaptation in an image transcoding proxy for mobile Web access. IEEE Pers Commun 5(6):8-17

    Google Scholar 

  8. ISO/IEC JTC1/SC29/WG11/N4242 (2001) ISO/IEC 15938-5 FDIS Information technology - multimedia content description interface - Part 5: Multimedia description schemes. Sydney, July 2001

  9. ISO/IEC JTC1/SC29/WG11/N4674 (2002) MPEG-7 Overview. Jeju, Korea, March 2002

  10. ISO/IEC JTC1/SC29/WG11/N4819 (2002) MPEG-21 Digital item adaptation. Fairfax, VA, May 2002

  11. Itti L, Koch C, Niebur E (1998) A model of saliency-based visual attention for rapid scene analysis. IEEE Trans Pattern Analysis Mach Intell 20(11):1254-1259

    Google Scholar 

  12. Itti L, Koch C (1999) A comparison of feature combination strategies for saliency-based visual attention system. Proc SPIE (Hum Vis Electron Imag IV) 3644:473-482

    Google Scholar 

  13. Itti L, Koch C (2001) Computational modeling of visual attention. Nat Rev Neurosci 2(3):194-203

    Google Scholar 

  14. Lee K, Chang HS, Chun SS, Choi L, Sull S (2001) Perception-based image transcoding for universal multimedia access. In: Proceedings of the 8th international conference on image processing (ICIP-2001), Thessaloniki, Greece, October 2001, 2:475-478

  15. Li SZ, Zhu L, Zhang ZQ, Blake A, Zhang HJ, Shum H (2002) Statistical learning of multi-view face detection. In: Proceedings of the 7th European conference on computer vision (ECCV 2002), Copenhagen, May 2002, 4:67-81

  16. Ma WY, Bedner I, Chang G, Kuchinsky A, Zhang HJ (2000) A framework for adaptive content delivery in heterogeneous network environments. Proc SPIE (Multimedia Comput Network 2000) 3969:86-100

    Google Scholar 

  17. Ma YF, Lu L, Zhang HJ, Li MJ (2002) A user attention model for video summarization. In: Proceedings of the 10th ACM international conference on multimedia, Juan-les-Pins, France, December 2002, pp 533-542

  18. Mohan R, Smith JR, Li CS (1999) Adapting multimedia internet content for universal access. IEEE Trans Multimedia 1(1):104-114

    Google Scholar 

  19. Salah AA, Alpaydin E, Akarun L (2002) A selective attention-based method for visual pattern recognition with application to handwritten digit recognition and face recognition. IEEE Trans Pattern Analysis Mach Intell 24(3):420-425

    Google Scholar 

  20. Smith JR, Mohan R, Li CS (1998) Content-based transcoding of images in the Internet. In: Proceedings of the 5th international conference on image processing (ICIP-98), Chicago, October 1998, 3:7-11

  21. World Wide Web Consortium (1999) Web content accessibility guidelines 1.0. May 1999, http://www.w3.org/tr/wai-webcontent/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xing Xie.

Additional information

Xing Xie: Correspondence to:

Xin Fan: This work was conducted while the first and third authors were visiting students at Microsoft Research Asia.

Abbreviations: AO, attention object; ROI, region-of-interest; AV, attention value; MPS, minimal perceptible size; IF, information fidelity; DS, description scheme// Part of this work was originally presented at the 9th International Conference on Multimedia Modeling (MMM’03).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chen, LQ., Xie, X., Fan, X. et al. A visual attention model for adapting images on small displays. Multimedia Systems 9, 353–364 (2003). https://doi.org/10.1007/s00530-003-0105-4

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00530-003-0105-4

Keywords:

Navigation