Associative and Spatial Relationships in Thesaurus-Based Retrieval

  • Harith Alani
  • Christopher Jones
  • Douglas Tudhope
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1923)

Abstract

The OASIS (Ontologically Augmented Spatial Information System) project explores terminology systems for thematic and spatial access in digital library applications. A prototype implementation uses data from the Royal Commission on the Ancient and Historical Monuments of Scotland, together with the Getty AAT and TGN thesauri. This paper describes its integrated spatial and thematic schema and discusses novel approaches to the application of thesauri in spatial and thematic semantic distance measures. Semantic distance measures can underpin interactive and automatic query expansion techniques by ranking lists of candidate terms. We first illustrate how hierarchical spatial relationships can be used to provide more flexible retrieval for queries incorporating place names in applications employing online gazetteers and geographical thesauri. We then employ a set of experimental scenarios to investigate key issues affecting use of the associative (RT) thesaurus relationships in semantic distance measures. Previous work has noted the potential of RTs in thesaurus search aids but the problem of increased noise in result sets has been emphasised. Specialising RTs allows the possibility of dynamically linking RT type to query context. Results presented in this paper demonstrate the potential for filtering on the context of the RT link and on subtypes of RT relationships.

References

  1. 1.
    AAT 1995. The AAT Editorial Manual: Related terms. User Friendly, 2(3-4), 6–15. Getty Art History Information Program.Google Scholar
  2. 2.
  3. 3.
    Aitchison J., Gilchrist A. 1987. Thesaurus construction: a practical manual. ASLIB: London.Google Scholar
  4. 4.
    Alani H., Jones C., Tudhope D. in press. Voronoi-based region approximation for geographical information retrieval with online gazetteers. Internat. Journal of Geographic Information Systems.Google Scholar
  5. 5.
    Amann B., Fundulaki I. 1999. Integrating ontologies and thesauri to build RDF schemas. Proc. 3rd European Conference on Digital Libraries (ECDL’99), (S. Abiteboul and A. Vercoustre eds.) Lecture Notes in Computer Science 1696, Springer-Verlag: Berlin, 234–253.Google Scholar
  6. 6.
    Beaulieu M. 1997. Experiments on interfaces to support query expansion. Journal of Documentation, 53(1), 8–19.Google Scholar
  7. 7.
    Bosman F., Bruza P., van der Weide T., Weusten L. 1998. Documentation, cataloguing, and query by navigation: a practical and sound approach. Proc. 2nd European Conference on Digital Libraries (ECDL’98), (C. Nikolaou and C. Stephanidis eds.) Lecture Notes in Computer Science 1513, Springer-Verlag: Berlin, 459–478.Google Scholar
  8. 8.
    Brooks T. 1997. The relevance aura of bibliographic records. Information Processing and Management, 33(1), 69–80.Google Scholar
  9. 9.
    Chen H., Dhar V. 1991. Cognitive process as a basis for intelligent retrieval systems design. Information Processing and Management, 27(5), 405–432.Google Scholar
  10. 10.
    Chen H., Ng T., Martinez J., Schatz B. 1997. A concept space approach to addressing the vocabulary problem in scientific information retrieval: an experiment on the Worm Community System. Journal of the American Society for Information Science, 48(1), 17–31.Google Scholar
  11. .11.
    Cohen, P. R. and R. Kjeldsen (1987). Information Retrieval by Constrained Spreading Activation in Semantic Networks. Information Processing & Management 23(4): 255–268.CrossRefGoogle Scholar
  12. 12.
    Constantopolous P., Doerr M. 1993. The Semantic Index System-A brief presentation. Institute of Computer Science Technical Report. FORTH-Hellas, GR-71110 Heraklion, Crete.Google Scholar
  13. 13.
    Croft W., Lucia T., Cringean J., Willett P. 1989. Retrieving documents by plausible inference: an experimental study. Information Processing and Management, 25(6), 599–614.Google Scholar
  14. 14.
    Cunliffe D., Taylor C., Tudhope D. 1997. Query-based navigation in semantically indexed hypermedia. Proc. 8th ACM Conference on Hypertext, 87–95.Google Scholar
  15. 15.
    Doerr M., Fundulaki I. 1998. SIS-TMS: A thesaurus management system for distributed digital collections. Proc. 2nd European Conference on Digital Libraries (ECDL’98), (C. Nikolaou and C. Stephanidis eds.) Lecture Notes in Computer Science 1513, Springer-Verlag: Berlin, 215–234.Google Scholar
  16. 16.
    Fidel R. 1991. Searchers’ selection of search keys (I-III), Journal of American Society for Information Science, 42(7), 490–527.Google Scholar
  17. 17.
    Frew J., Freeston M., Freitas N., Hill L., Janee G., Lovette K., Nideffer R., Smith T., Zheng Q. 1998. The Alexandria Digital Library Architecture. Proc. 2nd European Conference on Digital Libraries (ECDL’98), (C. Nikolaou and C. Stephanidis eds.) Lecture Notes in Computer Science 1513, Springer-Verlag: Berlin, 61–73.Google Scholar
  18. 18.
    Guarino N. 1995. Ontologies and knowledge bases: towards a terminological clarification. In: Towards very large knowledge bases: knowledge building and knowledge sharing, 25–32. IOS Press.Google Scholar
  19. 19.
    Harper Collins, 2000, Bartholomew. http://www.bartholomewmaps.com
  20. 20.
    Harpring P. 1997. The limits of the world: Theoretical and practical issues in the construction of the Getty Thesaurus of Geographic Names. Proc. 4th International Conference on Hypermedia and Interactivity in Museums (ICHIM’97), 237–251, Archives and Museum Informatics.Google Scholar
  21. 21.
    Harpring P. 1999. How forcible are the right words: overview of applications and interfaces incorporating the Getty vocabularies. Proc. Museums and the Web 1999. Archives and Museum Informatics. http://www.archimuse.com/mw99/papers/harpring/harpring.html
  22. 22.
    Jones C. 1997. Geographic Interfaces to Museum Collections. Proc. 4th International Conference on Hypermedia and Interactivity in Museums (ICHIM’97), 226–236, Archives and Museum Informatics.Google Scholar
  23. 23.
    Jones, S. 1993. A Thesaurus Data Model for an Intelligent Retrieval System. Journal of Information Science 19: 167–178.Google Scholar
  24. 24.
    Jones S., Gatford M., Robertson S., Hancock-Beaulieu M., Secker J., Walker S. 1995. Interactive Thesaurus Navigation: Intelligence Rules OK?, Journal of the American Society for Information Science, 46(1), 52–59.Google Scholar
  25. 25.
    Kim Y., Kim J. 1990. A model of knowledge based information retrieval with hierarchical concept graph. Journal of Documentation, 46(2), 113–136.Google Scholar
  26. 26.
    Kristensen J. 1993. Expanding end-users’ query statements for free text searching with a search-aid thesaurus. Information Processing and Management, 29(6), 733–744.Google Scholar
  27. 27.
    Lee J., Kim H., Lee Y. 1993. Information retrieval based on conceptual distance in ISA hierarchies. Journal of Documentation, 49(2), 113–136.Google Scholar
  28. 28.
    McMath C. F., Tamaru R. S., Rada R. 1989. A graphical thesaurus-based information retrieval system, International Journal of Man-Machine Studies, 31(2), 121–147.Google Scholar
  29. 29.
    Michard A., Pham-Dac G. 1998. Description of Collections and Encyclopaedias on the Web using XML. Archives and Museum Informatics, 12(1), 39–79.Google Scholar
  30. 30.
    Molholt P. 1996. Standardization of inter-concept links and their usage. Proc. 4th International ISKO Conference, Advances in Knowledge Organisation (5), 65–71.Google Scholar
  31. 31.
    Murray D. 1997. GIS in RCAHMS. MDA Information 2(3): 35–38.Google Scholar
  32. 32.
    Paice C 1991. A thesaural model of information retrieval. Information Processing and Management, 27(5), 433–447.Google Scholar
  33. 33.
    Pollitt A. 1997. Interactive information retrieval based on facetted classification using views. Proc. 6th International Study Conference on Classification, London.Google Scholar
  34. 34.
    Rada R., Mili H., Bicknell E., Blettner M. (1989). Development and Application of a Metric on Semantic Nets. IEEE Transactions on Systems, Man and Cybernetics, 19(1), 17–30.CrossRefGoogle Scholar
  35. 35.
    Rada R, Barlow J., Potharst J., Zanstra P., Bijstra D. 1991. Document ranking using an enriched thesaurus. Journal of Documentation, 47(3), 240–253.Google Scholar
  36. 36.
    Richardson R., Smeaton A., Murphy J. 1994. Using Wordnet for conceptual distance measurement, Proc. 16th Research Colloquium of BCS IR Specialist Group, 100–123.Google Scholar
  37. 37.
    Smeaton A., & Quigley I. 1996. Experiments on Using Semantic Distances Between Words in Image Caption Retrieval, Proc. 19th ACM SIGIR Conference, 174–180.Google Scholar
  38. 38.
    Soergel. D 1995. The Art and Architecture Thesaurus (AAT): a critical appraisal. Visual Resources, 10(4), 369–400.Google Scholar
  39. 39.
    Spanoudakis G., Constantopoulos P. 1994. Similarity for analogical software reuse: a computational model. Proc. 11th European Conference on AI (ECAI’94), 18–22. Wiley.Google Scholar
  40. 40.
    Spanoudakis G., Constantopoulos P. 1996. Elaborating analogies from conceptual models. International Journal of Intelligent Systems. 11, 917–974.Google Scholar
  41. 41.
    Tudhope D., Taylor C. 1997. Navigation via Similarity: automatic linking based on semantic closeness. Information Processing and Management, 33(2), 233–242.Google Scholar
  42. 42.
    Tudhope D., Cunliffe D. 1999. Semantic index hypermedia: linking information disciplines. ACM Computing Surveys, Symposium on Hypertext and Hypermedia. in press.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2000

Authors and Affiliations

  • Harith Alani
    • 1
  • Christopher Jones
    • 2
  • Douglas Tudhope
    • 1
  1. 1.School of ComputingUniversity of GlamorganPontypriddUK
  2. 2.Department of Computer ScienceCardiff UniversityCardiffUK

Personalised recommendations