Abstract
This paper explores the use of texts that are related to an image collection, also known as collateral texts, for building thesauri in specialist domains to aid in image retrieval. Corpus linguistic and information extraction methods are used for identifying key terms and semantic relationships in specialist texts that may be used for query expansion purposes. The specialist domain context imposes certain constraints on the language used in the texts, which makes the texts computationally more tractable.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Squire, McG.D., Muller, W., Muller, H., Pun, T.: Content-Based Query of Image databases: Inspirations from Text Retrieval. Pattern Recognition Letters, Vol. 21. No. 13–14. Elsevier Science, Netherlands (2000) 1193–1198
Marr, D.: Vision. W.H. Freeman, San Francisco (1982)
Eakins, J.P., Graham, M.E.: Content-based Image Retrieval: A Report to the JISC Technology Applications Programme. Image Data Research Institute Newcastle, Northumbria (1999) (http://www.unn.ac.uk/iidr/report.html, visited 15/01/03)
Srihari, R.K.: Use of Collateral Text in Understanding Photos. Artificial Intelligence Review, Special Issue on Integrating Language and Vision, Vol. 8. Kluwer Academic Publishers, Netherlands (1995) 409–430
Picard, R.W.: Towards a Visual Thesaurus. In: Ian Ruthven (ed.): Springer Verlag Workshops in Computing, MIRO 95, Glasgow, Scotland (1995)
Paek, S., Sable C.L., Hatzivassiloglou, V., Jaimes, A., Schiffman, B.H., Chang, S.F., McKeown, K.R.: Integration of Visual and Text-Based Approaches for the Content Labeling and Classification of Photographs. ACM SIGIR’99 Workshop on Multimedia Indexing and Retrieval, Berkeley, CA August (1999)
Efthimiadis, E.N.: Query Expansion. In: Williams, M.E., (ed.): Annual Review of Information Systems and Technology (ARIST), Vol.31 (1996) 121–187
Foskett, D.J.: Thesaurus. In: Sparck Jones, K., Willet, P. (eds.): Readings in Information Retrieval. Morgan Kaufmann Publishers, San Francisco, California (1997) 111–134
Salton, G.: Experiments in Automatic Thesauri Construction for Information Retrieval. In Proceedings of the IFIP Congress, Vol. TA-2. Ljubljana, Yoguslavia (1971) 43–49
Sparck Jones, K.: Automatic Keyword Classification for Information Retrieval. Butterworths, London, UK (1971)
Jing, Y., Croft, W.B.: An Association Thesaurus for Information Retrieval. In: Bretano, F., Seitz, F.: (eds.): Proceedings of the RIAO’94 Conference. CIS-CASSIS, Paris, France (1994) 146–160
Grefenstette, G.: Explorations in Automatic Thesaurus Discovery. Kluwer Academic Publishers, Boston, USA (1994)
Leech, G.: The State of the Art in Corpus Linguistics. In: Aijmer, K., Altenberg, B. (eds.): English Corpus Linguistics: Essays in Honor of Jan Svartvik. Longman, London (1991) 8–29
Harris, Z.S.: Language and Information. In: Nevin, B. (ed.): Computational Linguistics Vol. 14, No. 4. Columbia University Press, New York (1988) 87–90
Ahmad, K., Rogers, M.A.: Corpus-based terminology extraction. In: Budin, G., Wright S.A. (eds.): Handbook of Terminology Management, Vol. 2. John Benjamins Publishers, Amsterdam (2000) 725–760.
Bourigault, D., Jacquemin, C., L’Homme, M-C. (eds.): Recent Advances in Computational Terminology. John Benjamins Publishers, Amsterdam (2001)
Hearst, M.: Automatic Acquisition of Hyponyms from Large Text Corpora. In Proceedings of the Fourteenth International Conference on Computational Linguistics (COLING’92). Nantes, France. (1992) 539–545
Church, K.W., Mercer, R.L.: Introduction. In: Armstrong, S. (ed): Special Issue on Using Large Corpora. Computational Linguistics, Vol. 9. No. 1–2 The MIT Press, Mass., USA (1993) 1–24
Cruse, D.A.: Lexical Semantics. Cambridge University Press, Avon, Great Britain (1986)
Ratnaparkhi, A.: A Maximum Entropy Part-Of-Speech Tagger. In: Proceedings of the Empirical Methods in Natural Language Processing Conference (1996) 133–141
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ahmad, K., Tariq, M., Vrusias, B., Handy, C. (2003). Corpus-Based Thesaurus Construction for Image Retrieval in Specialist Domains. In: Sebastiani, F. (eds) Advances in Information Retrieval. ECIR 2003. Lecture Notes in Computer Science, vol 2633. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36618-0_36
Download citation
DOI: https://doi.org/10.1007/3-540-36618-0_36
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-01274-0
Online ISBN: 978-3-540-36618-8
eBook Packages: Springer Book Archive