Corpus-Based Thesaurus Construction for Image Retrieval in Specialist Domains

Ahmad, Khurshid; Tariq, Mariam; Vrusias, Bogdan; Handy, Chris

doi:10.1007/3-540-36618-0_36

Khurshid Ahmad⁵,
Mariam Tariq⁵,
Bogdan Vrusias⁵ &
…
Chris Handy⁵

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2633))

Included in the following conference series:

European Conference on Information Retrieval

1283 Accesses
15 Citations

Abstract

This paper explores the use of texts that are related to an image collection, also known as collateral texts, for building thesauri in specialist domains to aid in image retrieval. Corpus linguistic and information extraction methods are used for identifying key terms and semantic relationships in specialist texts that may be used for query expansion purposes. The specialist domain context imposes certain constraints on the language used in the texts, which makes the texts computationally more tractable.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Squire, McG.D., Muller, W., Muller, H., Pun, T.: Content-Based Query of Image databases: Inspirations from Text Retrieval. Pattern Recognition Letters, Vol. 21. No. 13–14. Elsevier Science, Netherlands (2000) 1193–1198
Google Scholar
Marr, D.: Vision. W.H. Freeman, San Francisco (1982)
Google Scholar
Eakins, J.P., Graham, M.E.: Content-based Image Retrieval: A Report to the JISC Technology Applications Programme. Image Data Research Institute Newcastle, Northumbria (1999) (http://www.unn.ac.uk/iidr/report.html, visited 15/01/03)
Google Scholar
Srihari, R.K.: Use of Collateral Text in Understanding Photos. Artificial Intelligence Review, Special Issue on Integrating Language and Vision, Vol. 8. Kluwer Academic Publishers, Netherlands (1995) 409–430
Google Scholar
Picard, R.W.: Towards a Visual Thesaurus. In: Ian Ruthven (ed.): Springer Verlag Workshops in Computing, MIRO 95, Glasgow, Scotland (1995)
Google Scholar
Paek, S., Sable C.L., Hatzivassiloglou, V., Jaimes, A., Schiffman, B.H., Chang, S.F., McKeown, K.R.: Integration of Visual and Text-Based Approaches for the Content Labeling and Classification of Photographs. ACM SIGIR’99 Workshop on Multimedia Indexing and Retrieval, Berkeley, CA August (1999)
Google Scholar
Efthimiadis, E.N.: Query Expansion. In: Williams, M.E., (ed.): Annual Review of Information Systems and Technology (ARIST), Vol.31 (1996) 121–187
Google Scholar
Foskett, D.J.: Thesaurus. In: Sparck Jones, K., Willet, P. (eds.): Readings in Information Retrieval. Morgan Kaufmann Publishers, San Francisco, California (1997) 111–134
Google Scholar
Salton, G.: Experiments in Automatic Thesauri Construction for Information Retrieval. In Proceedings of the IFIP Congress, Vol. TA-2. Ljubljana, Yoguslavia (1971) 43–49
Google Scholar
Sparck Jones, K.: Automatic Keyword Classification for Information Retrieval. Butterworths, London, UK (1971)
Google Scholar
Jing, Y., Croft, W.B.: An Association Thesaurus for Information Retrieval. In: Bretano, F., Seitz, F.: (eds.): Proceedings of the RIAO’94 Conference. CIS-CASSIS, Paris, France (1994) 146–160
Google Scholar
Grefenstette, G.: Explorations in Automatic Thesaurus Discovery. Kluwer Academic Publishers, Boston, USA (1994)
MATH Google Scholar
Leech, G.: The State of the Art in Corpus Linguistics. In: Aijmer, K., Altenberg, B. (eds.): English Corpus Linguistics: Essays in Honor of Jan Svartvik. Longman, London (1991) 8–29
Google Scholar
Harris, Z.S.: Language and Information. In: Nevin, B. (ed.): Computational Linguistics Vol. 14, No. 4. Columbia University Press, New York (1988) 87–90
Google Scholar
Ahmad, K., Rogers, M.A.: Corpus-based terminology extraction. In: Budin, G., Wright S.A. (eds.): Handbook of Terminology Management, Vol. 2. John Benjamins Publishers, Amsterdam (2000) 725–760.
Google Scholar
Bourigault, D., Jacquemin, C., L’Homme, M-C. (eds.): Recent Advances in Computational Terminology. John Benjamins Publishers, Amsterdam (2001)
Google Scholar
Hearst, M.: Automatic Acquisition of Hyponyms from Large Text Corpora. In Proceedings of the Fourteenth International Conference on Computational Linguistics (COLING’92). Nantes, France. (1992) 539–545
Google Scholar
Church, K.W., Mercer, R.L.: Introduction. In: Armstrong, S. (ed): Special Issue on Using Large Corpora. Computational Linguistics, Vol. 9. No. 1–2 The MIT Press, Mass., USA (1993) 1–24
Google Scholar
Cruse, D.A.: Lexical Semantics. Cambridge University Press, Avon, Great Britain (1986)
Google Scholar
Ratnaparkhi, A.: A Maximum Entropy Part-Of-Speech Tagger. In: Proceedings of the Empirical Methods in Natural Language Processing Conference (1996) 133–141
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computing, School of Electronics and Physical Sciences, University of Surrey, Guildford, GU2 7XH, UK
Khurshid Ahmad, Mariam Tariq, Bogdan Vrusias & Chris Handy

Authors

Khurshid Ahmad
View author publications
You can also search for this author in PubMed Google Scholar
Mariam Tariq
View author publications
You can also search for this author in PubMed Google Scholar
Bogdan Vrusias
View author publications
You can also search for this author in PubMed Google Scholar
Chris Handy
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Instituto di Scienza e Tecnologie dell’Informazione, Consiglio Nazionale delle Ricerche, Via Giuseppe Moruzzi, 1, 56124, Pisa, Italy
Fabrizio Sebastiani

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ahmad, K., Tariq, M., Vrusias, B., Handy, C. (2003). Corpus-Based Thesaurus Construction for Image Retrieval in Specialist Domains. In: Sebastiani, F. (eds) Advances in Information Retrieval. ECIR 2003. Lecture Notes in Computer Science, vol 2633. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36618-0_36

Download citation

DOI: https://doi.org/10.1007/3-540-36618-0_36
Published: 15 April 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-01274-0
Online ISBN: 978-3-540-36618-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics