Skip to main content

Corpus-Based Thesaurus Construction for Image Retrieval in Specialist Domains

  • Conference paper
  • First Online:
Advances in Information Retrieval (ECIR 2003)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2633))

Included in the following conference series:

Abstract

This paper explores the use of texts that are related to an image collection, also known as collateral texts, for building thesauri in specialist domains to aid in image retrieval. Corpus linguistic and information extraction methods are used for identifying key terms and semantic relationships in specialist texts that may be used for query expansion purposes. The specialist domain context imposes certain constraints on the language used in the texts, which makes the texts computationally more tractable.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Squire, McG.D., Muller, W., Muller, H., Pun, T.: Content-Based Query of Image databases: Inspirations from Text Retrieval. Pattern Recognition Letters, Vol. 21. No. 13–14. Elsevier Science, Netherlands (2000) 1193–1198

    Google Scholar 

  2. Marr, D.: Vision. W.H. Freeman, San Francisco (1982)

    Google Scholar 

  3. Eakins, J.P., Graham, M.E.: Content-based Image Retrieval: A Report to the JISC Technology Applications Programme. Image Data Research Institute Newcastle, Northumbria (1999) (http://www.unn.ac.uk/iidr/report.html, visited 15/01/03)

    Google Scholar 

  4. Srihari, R.K.: Use of Collateral Text in Understanding Photos. Artificial Intelligence Review, Special Issue on Integrating Language and Vision, Vol. 8. Kluwer Academic Publishers, Netherlands (1995) 409–430

    Google Scholar 

  5. Picard, R.W.: Towards a Visual Thesaurus. In: Ian Ruthven (ed.): Springer Verlag Workshops in Computing, MIRO 95, Glasgow, Scotland (1995)

    Google Scholar 

  6. Paek, S., Sable C.L., Hatzivassiloglou, V., Jaimes, A., Schiffman, B.H., Chang, S.F., McKeown, K.R.: Integration of Visual and Text-Based Approaches for the Content Labeling and Classification of Photographs. ACM SIGIR’99 Workshop on Multimedia Indexing and Retrieval, Berkeley, CA August (1999)

    Google Scholar 

  7. Efthimiadis, E.N.: Query Expansion. In: Williams, M.E., (ed.): Annual Review of Information Systems and Technology (ARIST), Vol.31 (1996) 121–187

    Google Scholar 

  8. Foskett, D.J.: Thesaurus. In: Sparck Jones, K., Willet, P. (eds.): Readings in Information Retrieval. Morgan Kaufmann Publishers, San Francisco, California (1997) 111–134

    Google Scholar 

  9. Salton, G.: Experiments in Automatic Thesauri Construction for Information Retrieval. In Proceedings of the IFIP Congress, Vol. TA-2. Ljubljana, Yoguslavia (1971) 43–49

    Google Scholar 

  10. Sparck Jones, K.: Automatic Keyword Classification for Information Retrieval. Butterworths, London, UK (1971)

    Google Scholar 

  11. Jing, Y., Croft, W.B.: An Association Thesaurus for Information Retrieval. In: Bretano, F., Seitz, F.: (eds.): Proceedings of the RIAO’94 Conference. CIS-CASSIS, Paris, France (1994) 146–160

    Google Scholar 

  12. Grefenstette, G.: Explorations in Automatic Thesaurus Discovery. Kluwer Academic Publishers, Boston, USA (1994)

    MATH  Google Scholar 

  13. Leech, G.: The State of the Art in Corpus Linguistics. In: Aijmer, K., Altenberg, B. (eds.): English Corpus Linguistics: Essays in Honor of Jan Svartvik. Longman, London (1991) 8–29

    Google Scholar 

  14. Harris, Z.S.: Language and Information. In: Nevin, B. (ed.): Computational Linguistics Vol. 14, No. 4. Columbia University Press, New York (1988) 87–90

    Google Scholar 

  15. Ahmad, K., Rogers, M.A.: Corpus-based terminology extraction. In: Budin, G., Wright S.A. (eds.): Handbook of Terminology Management, Vol. 2. John Benjamins Publishers, Amsterdam (2000) 725–760.

    Google Scholar 

  16. Bourigault, D., Jacquemin, C., L’Homme, M-C. (eds.): Recent Advances in Computational Terminology. John Benjamins Publishers, Amsterdam (2001)

    Google Scholar 

  17. Hearst, M.: Automatic Acquisition of Hyponyms from Large Text Corpora. In Proceedings of the Fourteenth International Conference on Computational Linguistics (COLING’92). Nantes, France. (1992) 539–545

    Google Scholar 

  18. Church, K.W., Mercer, R.L.: Introduction. In: Armstrong, S. (ed): Special Issue on Using Large Corpora. Computational Linguistics, Vol. 9. No. 1–2 The MIT Press, Mass., USA (1993) 1–24

    Google Scholar 

  19. Cruse, D.A.: Lexical Semantics. Cambridge University Press, Avon, Great Britain (1986)

    Google Scholar 

  20. Ratnaparkhi, A.: A Maximum Entropy Part-Of-Speech Tagger. In: Proceedings of the Empirical Methods in Natural Language Processing Conference (1996) 133–141

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ahmad, K., Tariq, M., Vrusias, B., Handy, C. (2003). Corpus-Based Thesaurus Construction for Image Retrieval in Specialist Domains. In: Sebastiani, F. (eds) Advances in Information Retrieval. ECIR 2003. Lecture Notes in Computer Science, vol 2633. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36618-0_36

Download citation

  • DOI: https://doi.org/10.1007/3-540-36618-0_36

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-01274-0

  • Online ISBN: 978-3-540-36618-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics