An Ontology-Based Approach to Information Retrieval

Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10151)

Abstract

We define a general framework for ontology-based information retrieval (IR). In our approach, document and query expansion rely on a base taxonomy that is extracted from a lexical database or a Linked Data set (e.g. WordNet, Wiktionary etc.). Each term from a document or query is modelled as a vector of base concepts from the base taxonomy. We define a set of mapping functions which map multiple ontological layers (dimensions) onto the base taxonomy. This way, each concept from the included ontologies can also be represented as a vector of base concepts from the base taxonomy. We propose a general weighting schema which is used for the vector space model. Our framework can therefore take into account various lexical and semantic relations between terms and concepts (e.g. synonymy, hierarchy, meronymy, antonymy, geo-proximity, etc.). This allows us to avoid certain vocabulary problems (e.g. synonymy, polysemy) as well as to reduce the vector size in the IR tasks.

References

  1. 1.
    Aronson, A.R., Rindflesch, T.C., Browne, A.C.: Exploiting a large thesaurus for information retrieval. In: RIAO, vol. 94 (1994)Google Scholar
  2. 2.
    Baziz, M., et al.: An information retrieval driven by ontology from query to document expansion. In: Large Scale Semantic Access to Content (Text, Image, Video, and Sound). LE CENTRE DE HAUTES ETUDES INTERNATIONALES D’INFORMATIQUE DOCUMENTAIRE (2007)Google Scholar
  3. 3.
    Castells, P., Fernandez, M., Vallet, D.: An adaptation of the vector-space model for ontology-based information retrieval. IEEE Trans. Knowl. Data Eng. 19(2), 261–272 (2007)CrossRefGoogle Scholar
  4. 4.
    Carpineto, C., Romano, G.: A survey of automatic query expansion in information retrieval. ACM Comput. Surv. (CSUR) 44(1), 1 (2012)CrossRefMATHGoogle Scholar
  5. 5.
    Dragoni, M., da Costa Pereira, C., Tettamanzi, A.G.B.: A conceptual representation of documents and queries for information retrieval systems by using light ontologies. Expert Syst. Appl. 39(12), 10376–10388 (2012)CrossRefGoogle Scholar
  6. 6.
    Hersh, W.R., Greenes, R.A.: SAPHIRE–an information retrieval system featuring concept matching, automatic indexing, probabilistic retrieval, and hierarchical relationships. Comput. Biomed. Res. 23(5), 410–425 (1990)CrossRefGoogle Scholar
  7. 7.
    Mandala, R., Tokunaga T., and Tanaka H.: The use of WordNet in information retrieval. In: Proceedings of the Conference on Use of WordNet in Natural Language Processing Systems (1998)Google Scholar
  8. 8.
    Navigli, R., Velardi, P.: An analysis of ontology-based query expansion strategies. In: Proceedings of the 14th European Conference on Machine Learning, Workshop on Adaptive Text Extraction and Mining, Cavtat-Dubrovnik, Croatia (2003)Google Scholar
  9. 9.
    Luke, S., Lee S., Rager, D.: Ontology-based knowledge discovery on the world-wide web. In: Working Notes of the Workshop on Internet-Based Information Systems at the 13th National Conference on Artificial Intelligence (AAAI 1996) (1996)Google Scholar
  10. 10.
    Salton, G., Wong, A., Yang, C.-S.: A vector space model for automatic indexing. Commun. ACM 18(11), 613–620 (1975)CrossRefMATHGoogle Scholar
  11. 11.
    Schuhmacher, M., Ponzetto, S.P.: Knowledge-based graph document modeling. In: Proceedings of the 7th ACM International Conference on Web Search and Data Mining. ACM (2014)Google Scholar
  12. 12.
    Song, M., Song, I.Y., Hu, X., Allen, R.B.: Integration of association rules and ontologies for semantic query expansion. Data Knowl. Eng. 63(1), 63–75 (2007)CrossRefGoogle Scholar
  13. 13.
    Thomopoulos, R., Buche, P., Haemmerlé, O.: Representation of weakly structured imprecise data for fuzzy querying. Fuzzy Sets Syst. 140(1), 111–128 (2003)MathSciNetCrossRefMATHGoogle Scholar
  14. 14.
    Tsatsaronis, G., Panagiotopoulou, V.: A generalized vector space model for text retrieval based on semantic relatedness. In: Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop. Association for Computational Linguistics (2009)Google Scholar
  15. 15.
    Voorhees, E.M.: Query expansion using lexical-semantic relations. In: Croft, B.W., van Rijsbergen, C.J. (eds.) SIGIR 1994. Springer, London (1994)Google Scholar
  16. 16.
    Waitelonis, J., Exeler, C., Sack, H.: Linked data enabled generalized vector space model to improve document retrieval. In: NLP and DBpedia Workshop, ISWC 2015, Bethlehem, 11–15th September 2015Google Scholar
  17. 17.
    Wong, S.K.M., Ziarko, W., Wong, P.C.N.: Generalized vector spaces model in information retrieval. In: Proceedings of the 8th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM (1985)Google Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  1. 1.University of RijekaRijekaCroatia
  2. 2.Birkbeck, University of LondonLondonUK

Personalised recommendations