Abstract.
Image adaptation, one of the essential problems in adaptive content delivery for universal access, has been actively explored for some time. Most existing approaches have focused on generic adaptation with a view to saving file size under constraints in client environment and have hardly paid attention to user perceptions of the adapted result. Meanwhile, the major limitation on the user’s delivery context is moving away from data volume (or time-to-wait) to screen size because of the galloping development of hardware technologies. In this paper, we propose a novel method for adapting images based on user attention. A generic and extensible image attention model is introduced based on three attributes (region of interest, attention value, and minimal perceptible size) associated with each attention object. A set of automatic modeling methods are presented to support this approach. A branch-and-bound algorithm is also developed to find the optimal adaptation efficiently. Experimental results demonstrate the usefulness of the proposed scheme and its potential application in the future.
Similar content being viewed by others
References
Chandra S, Gehani A, Ellis CS, Vahdat A (2001) Transcoding characteristics of Web images. Proc SPIE (Multimedia Comput Network 2001) 4312:135-149
Chen JL, Zhou BY, Shi J, Zhang HJ, Wu QF (2001) Function-based object model towards website adaptation. In: Proceedings of the 10th international World Wide Web conference, Hong Kong, May 2001, pp 587-596
Chen XR, Zhang HJ (2001) Text area detection from video frames. In: Proceedings of the 2nd IEEE Pacific-Rim conference on multimedia (PCM2001), Beijing, October 2001, pp 222-228
Christopoulos C, Skodras A, Ebrahimi T (2000) The JPEG2000 still image coding system: an overview. IEEE Trans Consumer Electron 46(4):1103-1127
Fan X, Xie X, Ma WY, Zhang HJ, Zhou HQ (2003) Visual attention based image browsing on mobile devices. In: Proceedings of the IEEE international conference on multimedia and expo (ICME 03), Baltimore, July 2003
Fox A, Gribble S, Brewer EA, Amir E (1996) Adapting to network and client variability via on-demand dynamic distillation. In: Proceedings of the 7th international conference on architectural support for programming languages and operating systems. Cambridge, MA, October 1996, pp 160-170
Han R, Bhagwat P, Lamaire R, Mummert T, Perret V, Rubas J (1998) Dynamic adaptation in an image transcoding proxy for mobile Web access. IEEE Pers Commun 5(6):8-17
ISO/IEC JTC1/SC29/WG11/N4242 (2001) ISO/IEC 15938-5 FDIS Information technology - multimedia content description interface - Part 5: Multimedia description schemes. Sydney, July 2001
ISO/IEC JTC1/SC29/WG11/N4674 (2002) MPEG-7 Overview. Jeju, Korea, March 2002
ISO/IEC JTC1/SC29/WG11/N4819 (2002) MPEG-21 Digital item adaptation. Fairfax, VA, May 2002
Itti L, Koch C, Niebur E (1998) A model of saliency-based visual attention for rapid scene analysis. IEEE Trans Pattern Analysis Mach Intell 20(11):1254-1259
Itti L, Koch C (1999) A comparison of feature combination strategies for saliency-based visual attention system. Proc SPIE (Hum Vis Electron Imag IV) 3644:473-482
Itti L, Koch C (2001) Computational modeling of visual attention. Nat Rev Neurosci 2(3):194-203
Lee K, Chang HS, Chun SS, Choi L, Sull S (2001) Perception-based image transcoding for universal multimedia access. In: Proceedings of the 8th international conference on image processing (ICIP-2001), Thessaloniki, Greece, October 2001, 2:475-478
Li SZ, Zhu L, Zhang ZQ, Blake A, Zhang HJ, Shum H (2002) Statistical learning of multi-view face detection. In: Proceedings of the 7th European conference on computer vision (ECCV 2002), Copenhagen, May 2002, 4:67-81
Ma WY, Bedner I, Chang G, Kuchinsky A, Zhang HJ (2000) A framework for adaptive content delivery in heterogeneous network environments. Proc SPIE (Multimedia Comput Network 2000) 3969:86-100
Ma YF, Lu L, Zhang HJ, Li MJ (2002) A user attention model for video summarization. In: Proceedings of the 10th ACM international conference on multimedia, Juan-les-Pins, France, December 2002, pp 533-542
Mohan R, Smith JR, Li CS (1999) Adapting multimedia internet content for universal access. IEEE Trans Multimedia 1(1):104-114
Salah AA, Alpaydin E, Akarun L (2002) A selective attention-based method for visual pattern recognition with application to handwritten digit recognition and face recognition. IEEE Trans Pattern Analysis Mach Intell 24(3):420-425
Smith JR, Mohan R, Li CS (1998) Content-based transcoding of images in the Internet. In: Proceedings of the 5th international conference on image processing (ICIP-98), Chicago, October 1998, 3:7-11
World Wide Web Consortium (1999) Web content accessibility guidelines 1.0. May 1999, http://www.w3.org/tr/wai-webcontent/
Author information
Authors and Affiliations
Corresponding author
Additional information
Xing Xie: Correspondence to:
Xin Fan: This work was conducted while the first and third authors were visiting students at Microsoft Research Asia.
Abbreviations: AO, attention object; ROI, region-of-interest; AV, attention value; MPS, minimal perceptible size; IF, information fidelity; DS, description scheme// Part of this work was originally presented at the 9th International Conference on Multimedia Modeling (MMM’03).
Rights and permissions
About this article
Cite this article
Chen, LQ., Xie, X., Fan, X. et al. A visual attention model for adapting images on small displays. Multimedia Systems 9, 353–364 (2003). https://doi.org/10.1007/s00530-003-0105-4
Issue Date:
DOI: https://doi.org/10.1007/s00530-003-0105-4