Abstract
Image classification and automatic annotation could be treated as effective solutions to enable keyword-based semantic image retrieval. The importance of automatic image annotation has increased with the growth of the digital images collections being of great interest as it allows indexing, retrieving, and understanding of large collections of image data. In this chapter, we are presenting an overview of the existing methods for the annotation task from several perspectives: unsupervised/supervised learning, parametric/nonparametric unsupervised learning models, or text/image-based. For medical images annotation, we have chosen an extension of the cross-media relevance model based on an object-oriented approach. This method is presented in detail together with the steps that should be applied to annotate a new image. An evaluation of the annotation process and the experimental results are presented in the final part of this chapter.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Wang Y (2008) Automatic image annotation and categorization, PhD thesis, University of London, September
Mori Y, Takahashi H, Oka R (1999) Image-to-word transformation based on dividing and vector quantizing images with words. In: MISRM’99 first international workshop on multimedia intelligent storage and retrieval management, Orlando, 1999
Duygulu P, Barnard K, de Freitas N, Forsyth D (2002) Object recognition as machine translation: learning a lexicon for a fixed image vocabulary. In: Seventh European conference on computer vision, Copenhagen, 2002, pp 97–112
Brown P, Pietra SD, Pietra VD, Mercer R (1993) The mathematics of statistical machine translation: parameter estimation. Comput Linguist 19(2):263–311
Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the em algorithm. J R Stat Soc Ser B (Methodological) 39(1):1–38
Lawrence RR (1989) A tutorial on hidden Markov models and selected applications in speech recognition. Proc IEEE 77(2):257–286
Arnab G, Pavel I, Sanjeev K (2005) Hidden Markov models for automatic annotation and content-based retrieval of images and video. In: Proceedings of ACM SIGIR international conference on research and development in information retrieval (SIGIR), Salvador, 2005, pp 544–551
Barnard K, Forsyth DA (2001) Learning the semantics of words and pictures. In: Proceedings of IEEE international conference on computer vision (ICCV), Vancouver, 2001, pp 408–415
Blei DM, Jordan MI (2003) Modeling annotated data. In: Proceedings of ACM SIGIR international conference on research and development in information retrieval (SIGIR), Toronto, 2003, pp 127–134
Zhang R, Zhang Z(M.), Li M, Ma WY, Zhang HJ (2005) A probabilistic semantic model for image annotation and multi-modal image retrieval. In: Proceedings of IEEE international conference on computer vision (ICCV), Beijing, 2005, pp 846–851
Monay F, Gatica-Perez D (2004) PlSA-based image auto-annotation: constraining the latent space. In: Proceedings of ACM international conference on multimedia (ACM MULTIMEDIA), New York, 2004, pp 348–351
Deerwester SC, Dumais ST, Landauer TK, Furnas GW, Harshman RA (1990) Indexing by latent semantic analysis. J Soc Inform Sci 41(6):391–407
Hofmann T (2001) Unsupervised learning by probabilistic latent semantic analysis. Mach Learn 42(1–2):177–196
Jeon J, Lavrenko V, Manmatha R (2003) Automatic image annotation and retrieval using cross-media relevance models. In: Proceedings of ACM SIGIR international conference on research and development in information retrieval (SIGIR), Toronto, 2003, pp 119–126
Lavrenko V, Manmatha R, Jeon J (2004) A model for learning the semantics of pictures. In: Proceedings of advances in neural information processing systems (NIPS), Vancouver, 2004
Feng SL et al. (2004) Multiple bernoulli relevance models for image and video annotation. In: Proceedings of IEEE conference on computer vision and pattern recognition (CVPR), Washington, DC, 2004, pp 1242–1245
Jin R, Chai JY, Si L (2004) Effective automatic image annotation via a coherent language model and active learning. In: Proceedings of ACM international conference on multimedia (ACM MULTIMEDIA), New York, 2004, pp 892–899
Pan JY, Yang HJ, Faloutsos C (2004) Duygulu P GCap: graph-based automatic image captioning. In: Proceedings of IEEE international conference on computer vision and pattern recognition workshop, Washington, DC, 2004, pp 146–149
Tong H, Faloutsos C, Pan JY (2006) Fast random walk with restart and its applications. In: Proceedings of the international conference on data mining, Hong Kong, 2006, pp 613–622
Liu J, Li M, Ma WY, Liu Q, Lu H (2006) An adaptive graph model for automatic image annotation. In: Proceedings of the ACM SIGMM international workshop on multimedia information retrieval (MIR), Santa Barbara, 2006, pp 873–877
Zhou D, Weston J, Gretton A, Bousquet O, Scholkopf B (2004) Ranking on data manifolds. In: Proceedings of advances in neural information processing systems (NIPS), Vancouver and Whistler, 2004
Li J, Wang J (2003) Automatic linguistic indexing of pictures by a statistical modeling approach. IEEE Trans Pattern Anal Mach Intell 25:1075–1088
Lipson P, Grimson E, Sinha P (1997) Configuration based scene classification and image indexing. In: Proceedings of the 1997 conference on computer vision and pattern recognition, San Juan, 1997, pp 1007–1010
Vailaya A, Jain A, Zhang HJ (1998) On image classification: city images vs. landscapes. Pattern Recognit 31(12):1921–1935
Cristianini N, Shawe-Taylor J (2000) An introduction to support vector machines and other kernel-based learning methods. Cambridge University Press, New York
Chapelle O, Haffner P, Vapnik V (1999) Support vector machines for histogram-based image classification. IEEE Trans Neural Netw 10(5):1055–1064
Yiu Fung C, Fock Loe K (1999) Learning primitive and scene semantics of images for classification and retrieval. In: Proceedings of ACM international conference on multimedia (ACM MULTIMEDIA), Orlando, 1999, pp 9–12
Vailaya A, Figueiredo M, Jain A, Zhang H (2001) Image classification for content-based indexing. IEEE Trans Image Process 10(1):117–130
Gray R (1986) Vector quantization. IEEE Signal Process Mag 1(2):4–29
Gorkani MM, Picard RW (1994) Texture orientation for sorting photos “at a glance”. In: Proceedings of IEEE international conference in pattern recognition, Jerusalem, 1994, pp 459–464
Carneiro G, Vasconcelos N (2005) Formulating semantic image annotation as a supervised learning problem. In: Proceedings of IEEE conference on computer vision and pattern recognition (CVPR), San Diego, 2005, pp 163–168
Vogel J, Schiele B (2007) Semantic modeling of natural scenes for content-based image retrieval. Int J Comput Vis (IJCV) 72(2):133–157
Herbrich R, Graepel T, Campbell C (2001) Bayes point machines. J Mach Learn Res (JMLR) 1:245–279
Chang EY, Goh K, Sychay G, Wu G (2003) CBSA: content-based soft annotation for multimodal image retrieval using Bayes point machines. IEEE Trans Circuits Syst Video Technol (CSVT) 13(1):26–38
Fei-Fei L, Perona P (2005) A Bayesian hierarchical model for learning natural scene categories. In: Proceedings of IEEE conference on computer vision and pattern recognition (CVPR), San Diego, 2005, pp 523–525
Bosch A, Zisserman A, Munoz X (2006) Scene classification via PLSA. In: Proceedings of European conference on computer vision (ECCV), Graz, 2006, pp 1134–1137
Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of IEEE conference on computer vision and pattern recognition (CVPR), New York, 2006, pp 988–991
Dietterich TG, Lathrop RH, Lozano-Perez T (1997) Solving the multiple-instance problem with axis-parallel rectangles. Artif Intell 89(1–2):31–71
Maron O, Ratan AL (1998) Multiple-instance learning for natural scene classification. In: Proceedings of IEEE international conference on machine learning (ICML), Madison, 1998, pp 341–349
Yang C, Dong M, Hua J (2006) Region-based image annotation using asymmetrical support vector machine-based multiple-instance learning. In: Proceedings of IEEE conference on computer vision and pattern recognition (CVPR), New York, 2006, pp 1057–1063
Fan J, Gao Y, Luo H (2004) Multi-level annotation of natural scenes using dominant image components and semantic concepts. In: Proceedings of ACM international conference on multimedia (ACM MULTIMEDIA), New York, 2004, pp 540–547
Fan J, Gao Y, Luo H, Xu G (2004) Automatic image annotation by using concept-sensitive salient objects for image content representation. In: Proceedings of ACM SIGIR international conference on research and development in information retrieval (SIGIR), Sheffield, 2004, pp 361–368
Gao Y, Fan J (2006) Incorporating concept ontology to enable probabilistic concept reasoning for multi-level image annotation. In: Proceedings of the ACM SIGMM international workshop on multimedia information retrieval (MIR), Santa Barbara, 2006, pp 79–88
Gao Y, Fan J, Xue X, Jain R (2006) Automatic image annotation by incorporating feature hierarchy and boosting to scale up SVM classifiers. In: Proceedings of ACM international conference on multimedia (ACM MULTIMEDIA), Santa Barbara, 2006, pp 901–910
Russell BC, Torralba A, Murphy KP, Freeman WT (2005) LabelMe: a database and web-based tool for image annotation. Technical report, Massachusetts Institute of Technology, MIT AI Lab Memo AIM-2005–025
Miller GA (1992) WordNet: a lexical database for English. In: Proceedings of the workshop on speech and natural language, San Mateo, 1992, pp 483–483
Shah B, Benton R, Wu Z, Raghavan V (2007) Chapter VI: Automatic and semi-automatic techniques for image annotation. In: Zhang Y-J (ed) Semantic-based visual information retrieval. IRM Press, Hershey
Hu B, Dasmahapatra S, Lewis P, Shadbolt N (2003) Ontology-based medical image annotation with description logics. In: Proceedings of the IEEE conference on tools with artificial intelligence, Sacramento, 2003, p 77
Soo V, Lee C, Li C, Chen S, Chen C (2003) Automatic semantic annotation and retrieval based on sharable ontology and case-based learning techniques. In: Proceedings of the joint conference on digital libraries, Houston, 2003, pp 61–72
Shen H, Ooi B, Tan K (2000) Giving meanings to WWW images. In: Proceedings of the ACM conference on multimedia, Marina del Rey, 2000, pp 39–47
Lieberman H, Rosenzweig E, Singh P (2001) Aria: an agent for annotating and retrieving images. IEEE Comput 34(7):57–61
Carson C, Thomas M, Belongie S, Hellerstein J, Malik J (1999) Blobworld: a system for region-based image indexing and retrieval. In: Proceedings of the conference on visual information systems, Amsterdam, 1999, pp 509–516
Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Pattern Anal Mach Intell 22(8):888–905
Hartigan J, Wong M (1979) A K-means clustering algorithm. Appl Stat 28(1):100–108
Li J, Gray R, Olshen RA (2000) Multiresolution image classification by hierarchical modeling with two dimensional hidden Markov models. IEEE Trans Inform Theory 34(5):1826–1841
Feng H, Shi R, Chua T (2004) A bootstrapping framework for annotating and retrieving WWW images. In: Proceedings of the ACM conference on multimedia, New York, 2004, pp 960–967
Catherine EC, Xenophon Z, Stelios CO (1997) I2Cnet medical image annotation service. Med Inform 22(4):337–347 (Special Issue)
Igor FA, Filipe C, Joaquim F, Pinto da C, Jaime SC (2010) Hierarchical medical image annotation using SVM-based approaches. In: Proceedings of the 10th IEEE international conference on information technology and applications in biomedicine, Korfu, 2010
Daniel E (2003) OXALIS: a distributed, extensible ophthalmic image annotation system, Master of Science Thesis. University of Pittsburgh
Baoli L, Ernest VG, Ashwin R (2007) Semantic annotation and inference for medical knowledge discovery. In: NSF symposium on next generation of data mining (NGDM-07), Baltimore, 2007
Bresell A, Servenius Bo, Persson B (2006) Ontology annotation Treebrowser: an interactive tool where the complementarity of medical subject headings and gene ontology improves the interpretation of gene lists. Appl Bioinformatics 5(4):225–236
Peng H, Long F, Myers EW (2009) VANO: a volume-object image annotation system. Bioinformatics 25(5):695–697
Lin I-J, Chao H (2006) CMAS, a rich media annotation system for medical imaging. Prog Biomed Opt Imaging 7(31):614506.1–614506.8
ImageCLEF. http://www.imageclef.org/. Accessed 25 Aug 2011
Hersh W, Kalpathy-Cramer J, Jensen J (2006) Medical image retrieval and automated annotation: OHSU at ImageCLEF. In: CLEF, Alicante, 2006, vol 4730, pp 660–669
Gospodnetic O, Hatcher E (2005) Lucene in action. Manning Publications, Greenwich
Nabney IT (2004) Netlab: algorithms for pattern recognition. Springer, London
Lavrenko V, Croft W (2001) Relevance-based language models. In: Proceedings of the 24th annual international ACM SIGIR conference, New Orleans, 2001, pp 120–127
Lavrenko V, Choquette M, Croft W (2002) Cross-lingual relevance models. In: Proceedings of the 25th annual international ACM SIGIR conference, Tampere, 2002, pp 175–182
db4objects. http://www.db4o.com/. Accessed 25 Aug 2011
Db4o Developer Community. http://developer.db4o.com/. Accessed 25 Aug 2011
Paterson J, Edlich S, Hoerning H, Hoerning R (2006) The definitive guide to db4o. Apress, New York
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2012 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Stanescu, L., Burdescu, D.D., Brezovan, M., Mihai, C.G. (2012). Medical Images Annotation. In: Creating New Medical Ontologies for Image Annotation. SpringerBriefs in Electrical and Computer Engineering(). Springer, New York, NY. https://doi.org/10.1007/978-1-4614-1909-9_5
Download citation
DOI: https://doi.org/10.1007/978-1-4614-1909-9_5
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-1908-2
Online ISBN: 978-1-4614-1909-9
eBook Packages: EngineeringEngineering (R0)