A visual attention model for adapting images on small displays

Chen, Li-Qun; Xie, Xing; Fan, Xin; Ma, Wei-Ying; Zhang, Hong-Jiang; Zhou, He-Qin

doi:10.1007/s00530-003-0105-4

A visual attention model for adapting images on small displays

OriginalPaper
Published: October 2003

Volume 9, pages 353–364, (2003)
Cite this article

Multimedia Systems Aims and scope Submit manuscript

Li-Qun Chen¹,
Xing Xie²,
Xin Fan¹,
Wei-Ying Ma²,
Hong-Jiang Zhang² &
…
He-Qin Zhou¹

1370 Accesses
321 Citations
9 Altmetric
Explore all metrics

Abstract.

Image adaptation, one of the essential problems in adaptive content delivery for universal access, has been actively explored for some time. Most existing approaches have focused on generic adaptation with a view to saving file size under constraints in client environment and have hardly paid attention to user perceptions of the adapted result. Meanwhile, the major limitation on the user’s delivery context is moving away from data volume (or time-to-wait) to screen size because of the galloping development of hardware technologies. In this paper, we propose a novel method for adapting images based on user attention. A generic and extensible image attention model is introduced based on three attributes (region of interest, attention value, and minimal perceptible size) associated with each attention object. A set of automatic modeling methods are presented to support this approach. A branch-and-bound algorithm is also developed to find the optimal adaptation efficiently. Experimental results demonstrate the usefulness of the proposed scheme and its potential application in the future.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Attention mechanisms in computer vision: A survey

Article Open access 15 March 2022

Visual attention network

Article Open access 28 July 2023

A survey of the vision transformers and their CNN-transformer based variants

Article 04 October 2023

References

Chandra S, Gehani A, Ellis CS, Vahdat A (2001) Transcoding characteristics of Web images. Proc SPIE (Multimedia Comput Network 2001) 4312:135-149
Google Scholar
Chen JL, Zhou BY, Shi J, Zhang HJ, Wu QF (2001) Function-based object model towards website adaptation. In: Proceedings of the 10th international World Wide Web conference, Hong Kong, May 2001, pp 587-596
Chen XR, Zhang HJ (2001) Text area detection from video frames. In: Proceedings of the 2nd IEEE Pacific-Rim conference on multimedia (PCM2001), Beijing, October 2001, pp 222-228
Christopoulos C, Skodras A, Ebrahimi T (2000) The JPEG2000 still image coding system: an overview. IEEE Trans Consumer Electron 46(4):1103-1127
Google Scholar
Fan X, Xie X, Ma WY, Zhang HJ, Zhou HQ (2003) Visual attention based image browsing on mobile devices. In: Proceedings of the IEEE international conference on multimedia and expo (ICME 03), Baltimore, July 2003
Fox A, Gribble S, Brewer EA, Amir E (1996) Adapting to network and client variability via on-demand dynamic distillation. In: Proceedings of the 7th international conference on architectural support for programming languages and operating systems. Cambridge, MA, October 1996, pp 160-170
Han R, Bhagwat P, Lamaire R, Mummert T, Perret V, Rubas J (1998) Dynamic adaptation in an image transcoding proxy for mobile Web access. IEEE Pers Commun 5(6):8-17
Google Scholar
ISO/IEC JTC1/SC29/WG11/N4242 (2001) ISO/IEC 15938-5 FDIS Information technology - multimedia content description interface - Part 5: Multimedia description schemes. Sydney, July 2001
ISO/IEC JTC1/SC29/WG11/N4674 (2002) MPEG-7 Overview. Jeju, Korea, March 2002
ISO/IEC JTC1/SC29/WG11/N4819 (2002) MPEG-21 Digital item adaptation. Fairfax, VA, May 2002
Itti L, Koch C, Niebur E (1998) A model of saliency-based visual attention for rapid scene analysis. IEEE Trans Pattern Analysis Mach Intell 20(11):1254-1259
Google Scholar
Itti L, Koch C (1999) A comparison of feature combination strategies for saliency-based visual attention system. Proc SPIE (Hum Vis Electron Imag IV) 3644:473-482
Google Scholar
Itti L, Koch C (2001) Computational modeling of visual attention. Nat Rev Neurosci 2(3):194-203
Google Scholar
Lee K, Chang HS, Chun SS, Choi L, Sull S (2001) Perception-based image transcoding for universal multimedia access. In: Proceedings of the 8th international conference on image processing (ICIP-2001), Thessaloniki, Greece, October 2001, 2:475-478
Li SZ, Zhu L, Zhang ZQ, Blake A, Zhang HJ, Shum H (2002) Statistical learning of multi-view face detection. In: Proceedings of the 7th European conference on computer vision (ECCV 2002), Copenhagen, May 2002, 4:67-81
Ma WY, Bedner I, Chang G, Kuchinsky A, Zhang HJ (2000) A framework for adaptive content delivery in heterogeneous network environments. Proc SPIE (Multimedia Comput Network 2000) 3969:86-100
Google Scholar
Ma YF, Lu L, Zhang HJ, Li MJ (2002) A user attention model for video summarization. In: Proceedings of the 10th ACM international conference on multimedia, Juan-les-Pins, France, December 2002, pp 533-542
Mohan R, Smith JR, Li CS (1999) Adapting multimedia internet content for universal access. IEEE Trans Multimedia 1(1):104-114
Google Scholar
Salah AA, Alpaydin E, Akarun L (2002) A selective attention-based method for visual pattern recognition with application to handwritten digit recognition and face recognition. IEEE Trans Pattern Analysis Mach Intell 24(3):420-425
Google Scholar
Smith JR, Mohan R, Li CS (1998) Content-based transcoding of images in the Internet. In: Proceedings of the 5th international conference on image processing (ICIP-98), Chicago, October 1998, 3:7-11
World Wide Web Consortium (1999) Web content accessibility guidelines 1.0. May 1999, http://www.w3.org/tr/wai-webcontent/

Download references

Author information

Authors and Affiliations

Dept. of Automation, University of Science and Technology of China, 230027, Hefei, P.R. China
Li-Qun Chen, Xin Fan & He-Qin Zhou
5/F Sigma Center, Microsoft Research Asia, No. 49 Zhichun Road, 100080, Beijing, P.R. China
Xing Xie, Wei-Ying Ma & Hong-Jiang Zhang

Authors

Li-Qun Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xing Xie
View author publications
You can also search for this author in PubMed Google Scholar
Xin Fan
View author publications
You can also search for this author in PubMed Google Scholar
Wei-Ying Ma
View author publications
You can also search for this author in PubMed Google Scholar
Hong-Jiang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
He-Qin Zhou
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xing Xie.

Additional information

Xing Xie: Correspondence to:

Xin Fan: This work was conducted while the first and third authors were visiting students at Microsoft Research Asia.

Abbreviations: AO, attention object; ROI, region-of-interest; AV, attention value; MPS, minimal perceptible size; IF, information fidelity; DS, description scheme// Part of this work was originally presented at the 9th International Conference on Multimedia Modeling (MMM’03).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chen, LQ., Xie, X., Fan, X. et al. A visual attention model for adapting images on small displays. Multimedia Systems 9, 353–364 (2003). https://doi.org/10.1007/s00530-003-0105-4

Download citation

Issue Date: October 2003
DOI: https://doi.org/10.1007/s00530-003-0105-4

Keywords:

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A visual attention model for adapting images on small displays

Abstract.

Access this article

Similar content being viewed by others

Attention mechanisms in computer vision: A survey

Visual attention network

A survey of the vision transformers and their CNN-transformer based variants

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords:

Navigation

A visual attention model for adapting images on small displays

Abstract.

Access this article

Similar content being viewed by others

Attention mechanisms in computer vision: A survey

Visual attention network

A survey of the vision transformers and their CNN-transformer based variants

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords:

Search

Navigation